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Editorial 


Introduction 


In the 1930s and 1940s, especially during the Second World 
War, h was a primary objective to obtain robust control of 
physical systems having variable characteristics and unpre¬ 
dictable disturbances both internal and external. Actually, 
the rather recently used term 'robust’ was not used in that 
period, but it was considered m terms of sensitivity and 
return difference which were well dehned. 

Long before the age of digital computers, design 
techniques were mostly limited to SISO systems. Neverthe¬ 
less, rather efficient, though approximate, graphical methods 
were developed for designing multiple feedback loops in 
multivariable systems including those having sampled data 
and even different types of nonlinearities. 

However, during the 1960s and 1970s powerful digital 
computers became available and attention shifted from the 
robust benefits of feedback control to rigorous, but 
previously unfeasible, exact mathematical techniques for 
obtaining optimal control of physical systems represented by 
exact, invariable mathematical models where feedback 
control was not needed for robustness except for reducing 


the effects of disturbances which were usually not even 
considered in early studies. Application of the resulting exact 
control laws to actual, variable systems, especially adaptive 
control systems with unmodeled dynamics, were not 
successful. Thus the need for robust feedback techniques was 
resurrected, but this time to include complex, MIMO 
variable systems using necessary computer design methods, 
the derivation of which have become a primary and 
practically essential area of developing research. 

Therefore, the Automatica Editorial Board and 1 are very 
grateful to the Deputy Editor-in-Chief and his Guest Editors 
for creating this Special Issue on Robust Control to provide 
some of the latest directions and a pierspectivc of 
developments in robust control techniques which will be 
necessary to provide the potential advantages of advanced 
control components and systems in the future. 

George S. Axelby 
Editor-in-Chief 
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Editorial 


Special Issue on Robust Control 


The idea to publish a Special Issue of Automatica devoted 
to Robust Control came up several years ago. I first talked to 
Peter Dorato about it during the IFAC Congress in Munich 
in 1987. The propiosal was approved during the Automatica 
Editorial Board nuceting just before the 1990 IFAC Congress 
at Tallinn, and the issue now appears not long before the 
1993 Congress in Sydney. 

I had the pleasure of working with a fine team of Guest 
Editors, consisting of J, Ackermann, R. F. Curtain, P. 
Dorato, B. A. Francis and H. Kimura. After the calls for 
papers late in 1990 and early in 1991, eventually 47 papers 
were received, several of which had been invited. The 
Special Issue consists of four invited papers, eight regular 
papers, and six brief papers. Moreover, a bibliography on 
robust control is included. Three papers were accepted but 
eventually were not published mainly because they were too 
long. They will appear later in regular issues of Automatica. 

Robust control is the most interesting current research 
theme in Che control held. Everyone knows about the 
difficult phase control theory went through in the late 1950s 
and early 1960s, when many mistakenly believed good 
control to be synonymous with optimal steering. Things took 
a better turn with the rise of linear optimal control theory in 
the 1960s, which at least included the idea of feedback. It 
lasted until the late 1970s, however, before a significant 
proportion of the research force of the control community 
returned to the central issues of control: feedback, and the 
robustness it can achieve. 

Obviously the Special Issue does not cover the theme of 
robust control exhaustively. The editorial team is satished, 
though, that a representative selection of papers was 
assembled. 

Sadly, the issue opens with a paper whose principal author 
is no longer alive. The work of K. Wei is commemorated in 
an “In Memoriam”, written by his former Ph D. adviser B. 
Ross Barmish. 

The span of the invited papers is wide but nevertheless the 
papers do not cover all the aspects or methodologies tor 
robust control system analysis and design. A paper on 
Quantitative Feedback Theory was invited but did not make 


it, in common with few submitted papers on the subject. 

The invited papers are all long. Barmish and Kang survey 
extreme point results. The paper of M. A. Dahleh and 
Khammash brings the fi story. Kaminer, Khargonekar, and 
Rotea put mixed H 2 lH^ control in a convex optimization 
framework. Packard and l>oylc need approximately 50 pages 
to set down their work on the complex structured singular 
value. 

The “Call for Papers" specifically requested applications 
papers and case studies. The paper by Chiang, Safonov, 
Madden and Tekawy and Garg deal with design studies 
of aircraft control problems. The subject of the design study 
by Milanese, Fiorio, and Maian is more unusual, and 
concerns the robust performance of a high accuracy 
calibration device. Limcbeer, Kascnally and Perkins present 
new methodology that is subsequently applied to a 
distillation column control problem, which was also the 
center of attention M a session of the 1991 IEEE Control and 
Decision Conference at Brighton, U K. 

The remaining regular papers and the brief papers cover a 
variety of theoretical aspects of robust control, ranging from 
robust root clustering to nonlinear uncertain systems. 

The “Bibliography on Robust Control" by Dorato, Tempo 
and Muscato lists and classifies several hundreds of selected 
books and journal articles that appeared in the period 
1987-1991. It supplements earlier compilations by Dorato. 

Automatica's Editorial Board hopes to serve the control 
community at large with this and other special issues. A 
Special Issue on “The Challenge of Computer Science in 
Industrial Applications of Cx^ntrol", to be published jointly 
with the IEEE Transactions on Automatic Control, is being 
prepared by A. Benveniste and K. J. Aslrdm, for publication 
during the second half of 1993. B. Wahlberg and T, 
Sdderstrdm are working on a Special Issue on “Signal 
Processing m Control", scheduled for January 1994. The 
Editorial Board invites proposals for other Special Issues. 

Huibert Kwakernaak 
Deputy Editor-in-Chief 
A utomatica 
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In Memoriam: Kehui Wei (1946-1992) 



Kehui Wei was born in Nancheng in the Chinese 
province of Jiangxi in 1946. As a child, he 
showed high academic potential skipping two 
years of elementary school and demonstrating 
aptitude and interest in Physics and Mathemat¬ 
ics. From a young age, he aspired to become a 
scientist—Albert Einstein was his role model. 

From 1963 to 1968, he studied Physics 
(specializing in microwaves) at Chengdu Instit¬ 
ute Radio Technology. The last two years of his 
studies were interrupted by the Chinese Cultural 
Revolution and, for the next decade, he worked 
in the Jin-Jiang Electronics Company in 
Chengdu, Sichuan. During that period, he was a 
technician by day and a “closet scholar’’ by 
night. Colleagues at Jin-Jiang characterized 
Kehui as “a short thin guy with a pair of 
basketball shoes hanging around his neck”. 
When recounting these difficult days, he always 
put a positive spin on the events. He felt that he 
made enormous intellectual progress during this 
time and was extremely proud of his role in 
sharing child raising responsibilites with his wife 
Zhuoli—he apparently took periods of leave 
from his job to give undivided attention to his 
daughter Wei-Wei. Kehui was an outstanding 
father. 

After the cultural revolution, he was one of 
the first candidates to pass a very selective 


national examination qualifying him for study 
abroad. In 1980, leaving his family behind, he 
commenced graduate studies in Electrical Engi¬ 
neering al the University of Rochester where he 
received the Master of Science degree in 1982. 
Under the guidance of B. Ross Barmish, he 
worked towards his Ph.D. degree from 1981 to 
1984. 

Kehui’s time in Rochester involved day-to-day 
interactions with contemporaries including Kris 
Hollot, Ian Petersen and Alberto Galimidi. He 
was known in the department for doing some of 
his best research while pacing the hallways in a 
cloud of smoke. Kehui was also known for his 
tennis matches and Florida adventures over 
Christmas with his friend, Deng Zhi Fang, the 
son of Chinese paramount leader, Chairman 
Deng Xiao Ping. 

In 1984, he moved with his advisor to the 
University of Wisconsin-Madison where he 
completed his Ph.D. dissertation “Robust 
Stabilization and Pole Assignment for Linear 
Time-Invariant systems” in 1986. The SISO 
version of this work was published in a book 
which includes contributed papers from the 1985 
MTNS; the MIMO version appeared in 1988 in 
Automatica. During the Madison period, he 
interacted frequently with contemporaries in¬ 
cluding Minyue Fu and Ruxiang Qian. His spirits 
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were enormously uplifted when his wife and 
daughter travelled to the U.S.A. to join him 
after more than four years of separation. With 
Zhouli’s presence, this also marked the time 
when he improved his dietary habits—a variety 
of foods were substituted for his daily fast 
noodles. Soon thereafter, the birth of his second 
daughter, Linda, marked another joyous event 
in his life. His Ph.D. dissertation includes a note 
of thanks to his wife Zhuoli who “sacrificed her 
career and personal life to accommodate her 
husband’s pursuit”. 

Throughout the entire decade of his research 
in the control area, Kehui continually revisited 
the robust stabilization problem. He could not 
accept the fact that a “clean” and general 
solution had not been given even for the 
problem of simultaneous stabilizability of three 
LTl plants with an LTI output feedback 
controller. For 15 months immediately following 
his Ph.D. he continued this line of research as a 
Postdoctoral Associate under the supervision of 
R. K. Yedavalli at the University of Toledo in 
Ohio. The research effort was also expanded to 
include unmodelled dynamics and polynomial 
transformation methods. In describing Kehui in 
jest, colleagues from Toledo recount his 
fascination with fax machines. Finally, he had a 
simple method for quick transmission of Chinese 
characters to all those long lost friends and 
relatives back home. While in Toledo, he was 
also known for his joviality at social gatherings 
and the use of a bike as his primary mode of 
transportation. 

Over the last four years of his life, Kehui was 
employed at DLR (Institute for Dynamic Flight 
Systems) in Oberpfaffenhofen, Germany. Work¬ 


ing with colleagues such as Georg Gruebel and 
Juergen Ackermann, he continued to pursue his 
line of research in robust control publishing 
more than 10 papers in major journals. He was 
also known at DLR for his versatility. For 
example, in a short period of time, he became 
quite involved in the problem of instantaneous 
torque control of a PM stepper motors with a 
PWM voltage source invertor—an important 
topic associated with the development of a new 
generation of robot-gripper motors. 

DLR researchers described Kehui Wei as a 
person who was full of creative ideas, extremely 
hardworking (foregoing vacations) and paying 
little attention to his failing health. He was 
optimistic about his health until the very 
end—carrying our reviews of journal articles up 
to a few days before he passed away. Given his 
optimism, his sudden death came as a shock to 
his wife and two daughters. 

Throughout the entire DLR period, Kehui 
continued to doggedly pursue a solution to the 
robust stabilization problem in addition to his 
other responsibilities. His last piece of work 
appears in this issue. It is a paper written in 
collaboration with T. Tsujino and T. Fujii. The 
work was completed only a couple of months 
before he passed away. 

On 4 June 1992, we lost an excellent scientist, 
a fine colleague and good friend. Kehui Wei will 
live on through his publications the fond 
memories of our interaction with him and his 
dream to find an elegant and practical solution to 
the robust stabilization problem. 

B. R. Barmish 

J. E. Ackermann 
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On the Connection Between Controllability and 
Stabilizability of Linear Systems with Structural 
Uncertain Parameters* 

TARO TSUJINO,t TAKAO FUJIIt and KEHUI WEI+t 

Robust stabilizability is equivalent to controllability invariance for a certain 
class of interval systems. 

Key Wards Controllability; robust control; slate feedback; linear systems, control systems; system 
theory; control theory. 


Abstract —In this paper, we investigate the problem of 
robust .stabilization of interval systems by state-feedback It 
is shown under certain conditions that robust stabilizability is 
equivalent to controllability invariance. 

1. INlRODUniON 

In recent years, the robust stabilization 
problem has attracted a considerable amount of 
interest in the field of robust control. Various 
kinds of necessary and sufficient conditions have 
been derived so far for the existence of robust 
stabilizing controllers. One example is a 
condition gi /en in terms of the Pick matrix using 
interpolation theory (Kimura, 1984). Another is 
a type of condition given in terms of the 
solutions to Riccati equations both in 
problem (Doyle et ai, 1989) and in quadratic 
stabilization problem (see Khargonekar ct al, 
(1990) and the references therein). 

For a certain class of linear interval systems, 
conditions for robust stabilization are given in 
terms of a geometric pattern with respect to the 
location of uncertain parameters both in the 
quadratic stabilization problem (Wei, 1990) and 
in the robust stabilization problem (Wei, 1991, 
1992). These conditions are easy to check, but 
not necessarily easy to understand from a system 
theoretic point of view. In particular, the 
connection of these conditions with more 
familiar notions in the linear system theory, e.g. 
controllability and so on, are not so clear. Along 
this line Petersen (1987) defined a notion of 


♦Received in final form 7 July 1992. This paper was not 
presented at any IFAC meeting. 
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controllability invariance in that a linear 
uncertain system is controllability invariant if it 
is controllable in usual sense for each fixed value 
of uncertain parameters and discussed its 
connection with complete quadratic stabi¬ 
lizability with an arbitrary degree of stability. 
However, he did not succeed in making a 
complete clarification of the connection between 
these two notions. 

The purpose of this paper is to investigate the 
robust stabilization problem for interval systems 
along the direction in Petersen (19ft7). Based on 
the results in Wei (1990, 1991, 1992) we restrict 
attention to uncertain systems having a specific 
structure which is described in greater detail in 
Section 2. Along this line we show, with the help 
of the results in Wei (1990, 1991, 1992), that 
the notion of controllability invariance plays an 
important role in this problem. We establish that 
controllability invariance is necessary and 
sufficient for robust stabilizability, and is 
necessary hut not sufficient for quadratic 
stabilizability. Thus, the main contribution of 
this paper is to give an intuitively appealing 
interpretation for the condition for robust 
stabilizability given in Wei (1991, 1992). 

2 SYSTEMS AND DEFINITIONS 

We consider a single input time-invariant 
interval system (or interval system for short) 
(A, b), which is a set of systems described by the 
state equation 

( 1 ) 

where jc e R" is the state; w e R is the control; 
the entries of A and b are unknown but bounded 
in given compact sets; i.e. A = (a^j} and 
d,j > a,j > g,,; b = {b,} and B,^b,^ t,. Note that 
the entries oi A, b vary independently. We will 
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write a ,;*0 (fr, ® 0 ) if a,, = 0 *^ = 0 (resp. 6i = 
tfi = 0). The entry Otj or b, is called a 
sign-invariant entry if a,, x gj/ ■> 0 or b, x bf>0 
and a sign-varying entry if a,^ x g,, < 0 or 
SiXb,<0. Noting Theorem 3.1 in Wei (1990) 
that a robustly stabilizable system must have at 
least the same number of sign-invariant entries 
in the system matrices as the system order, we 
restrict our attention to a class of interval 
systems which is called a standard system as 
defined below. 


every entry pu of the matrix is either 0 or 1. Let 
Z denote the set of all standard systems (A, b) as 
in Definition 1. For a given pattern matrix P, we 
define as a subset of Z determined by the 
following rule: A standard interval system 
(A, 6 ) G Zp if py = 0 implies m ,,» 0 for any i, j. 

According to the above definition, in order to 
check if an interval system (A, b) g Zp we only 
need to check if it is a standard system and in 
addition m,j * 0 when p,, = 0 . 


Definition 1. (Wei, 1991, 1992) An n x (n + 1 ) 
interval matrix M is called the associated matrix 
of the interval system (A, 6 ) if 

Af = [A b]. (2) 

Furthermore an interval system (A, b) is called a 
standard system if the associated matrix M = 
{m,f} has the property that fn„+, is a 
sign-invariant entry for each i, 1 < i < n - 1 . 

Definition 2. (Wei, 1991, 1992) An interval 
system (A, b) is said to be stabilizable if there 
exists a linear static state-feedback control law 
u = kx with fc 6 R" such that the characteristic 
polynomial of the closed-loop system 

fis) = det {si-A- bk), (3) 

is a Hurwitz invariant polynomial; i.e. all the 
roots of the uncertain polynomial f{s) are in the 
strict left half of the complex plane. 


Definition 5. (Wei, 1991, 1992) An n x (n-f-1) 
pattern matrix P = {p,,} is said to have a 
generalized antisymmetric stepwise configuration 
if the following conditions hold: 

( 1 ) p„+, = 1 for all i = 1 , 2 , . . . , n. 

(2) If p 5: /i + 2 and p^p = 1, then p^v = 0 for all 
u ^v, uSp -1 and v^h. 

(3) det(P'’)=Pi 2 P 2 i • • Pnn+i, wherc P' is the 
right submatrix of P defined by 


Pll Pl3 


pr ^ 


P22 P 2 ^ 


P\n■^-\ 

Plrt^X 


LPn 2 Pn^ * ‘ * Pn/i-Fl J 


( 5 ) 


The following lemma shows a necessary 
condition for controllability invariance and will 
be used later for proving one of the two key 
propositions, i.e. Proposition 1 (see the Appen¬ 
dix for its proof). 


Definition 3. Consider an interval system (A, b) 
and assume that it is a standard system. Then the 
interval system (A, b) is said t 6 be controllability 
invariant if the pair (A, f>) is controllable in 
usual sense for any fixed value of uncertain 
parameters, that is, 

rank [A—.r/ b] = n, (4) 

for every j g C and every a„ e [a,,, a,,], b, g 

[b., b,]. 

Remark 1. The notion of “controllability in¬ 
variance” defined above, which is essentially the 
same as that defined in Petersen (1987), is a 
natural extension of the familiar “control¬ 
lability” in the linear system theory. 


3 PRELIMINARY RESULTS 
First we state the definition of that condition 
mentioned in the Introduction, which is given in 
terms of a geometric pattern. 

Definition 4. (Wei, 1991, 1992) An n x (n -(- 1 ) 
matrix P= {p,f} is said to be a pattern matrix if 


Lemma. If every interval .system (A, b) in Z,, is 
controllability invariant, then the following 
conditions hold. 

(1) If b* ^ 0, then a,, = 0(/ sy, 1 sy s fc, 

l<i<n) and = 0 . ( 6 ) 

( 2 ) U v^u + 2 and a„„^ 0 , then a,, = 0{i^}, 

1 :<y < u,l < u - 1) and + , =0. (7) 

The following are two key propositions for 
deriving one of the main re.sults, i.e. Theorem 1. 

Proposition 1. Let P be a given pattern matrix. 
Every interval system (A, b) in Z^ is control¬ 
lability invariant if and only if the matrix P has a 
generalized anti-symmetric stepwise configur¬ 
ation. 

A proof is given in the Appendix. 

Proposition 2. (Wei, 1991, 1992) Let P be a 
given pattern matrix. Every interval system 
(A, b) in Zp is stabilizable if and only if the 
matrix P has a generalized anti-symmetric 
stepwise configuration. 
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4. MAIN RESULTS 

This first main result stated below is a direct 
consequence of Propositions 1 and 2. 

Theorem 1. Every interval system (A, h) in is 
stabilizable if and only if every system {A, b) in 
is controllability invariant. 

Remark 2, This result has the following 
interpretation. Suppose a standard interval 
system is controllability invariant, then for each 
fixed values of the uncertain parameters, there 
exists a stabilizing feedback law which may 
depend on the uncertain parameter. However, 
the theorem guarantees that the feedback law 
can be chosen to be independent of the 
uncertain parameters, in other words, it depends 
only on the upper and lower bounds of the 
uncertain parameters. Conversely, if every 
standard interval system in 2^, is robustly 
stabilizable, then controllability invariance must 
hold. 

In the following we introduce another notion 
of robust stabilizability. 


This system is controllability invariant, but not 
always quadratically stabilizable for any <122 tind 
fci. 

5. CONCLUSION 

Theorems 1 and Corollary 1 lead to the fact 
that as far as a certain class of interval systems is 
concerned, the notion of controllability in¬ 
variance defined here is necessary and sufficient 
for stabilizability of the interval systems and is 
necessary but not sufficient for quadratic 
stabilizability. By this fact we have connected 
robust stabilizability with a natural extension of 
the familiar notion of controllability in the linear 
system theory. Thus, we have found that this 
notion plays an important role in this robust 
stabilization problem. The future research is to 
clarify the meaning of controllability invariance 
defined here in the context of robust stabilization 
problem and also to investigate its connection 
with other notions of controllability, e.g. the 
feedback controllability as defined in Petersen 
(1990). 


Definition 6. (Hollot and Barmish, 1980; Barm- 
ish, 1985) An interval system {A, b) is said to be 
quadratically stabilizable via linear control if 
there exists a linear static feedback control 
u = kx, P>0 and orX) such that the following 
condition holds: for all (A, b) 
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Lix) = x^A'P + PA^]x + lx'Phkx ^-a ||^||% 
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APPENDDC; PROOFS 

BeJow we denote sign-invariant entries hy or 0. 

A. Proof of Lemma 

Suppose that the following system is controllability 
invariant, 




"o 

■ ■ 0 " 


”o“ 


^hp 







■ 

' ■ ® _ 


A 


(A.l) 


Then by Definition 3 we can assume that the rank Condition 
4 holds for the case where B, = ] (i = 1, . . ., n) and A and b 
contain only two sign-varying entries, and a^^,, or and 
which vary sufficiently largely. Define the matrix 


Af=[A-j/ b], 


(A.2) 


and denote as the (n - k -i-1) x (n - jt -f 1) lower right 
hand submatrix of N. 

For the former part we suppose that « 0, Then 

clearly by (A.l) rank [A - s/J = n - 1 for s taking the 
eigenvalues of A and hence N^ is of full rank, or, detN, #0. 
We also note that the characteristic polynomial of A for the 
system £^/ is described by 


(-■O'* 




' + a,«, } = 0. 


(A.3) 


for li^u. Below we find conditions for the following two 
cases such that detA^i #0 for all the roots s of the equation 
(A.3), or all eigenvalues of A. 

Case 1; /c = 1, meaning that 6, is a sign-varying entry. 

We furthermore separate the case depending on the 
location of a^„,. 

(1) The case with u = l, meaning that there exists a 
sign-varying entry in the first column of A. For the matrix 


1 

—s 


0 


0 0 

0 

-s 1 


(A.4) 


we have 


(A.5) 


det A^i = 1 + bi(-.v)" ‘ 

Substituting the eigenvalues of A into s in (A.5) yields 

(A.6a) 
(A.fib) 


dct/Vi = l, (a = 0), 


The latter equation implies that for a^, 9^0 and fi, ^0, there 
exists a„, and b^ such that dclN, = 0. Hence if b, ^0, then 
a„, ■() (w “= 1,. . . , n). 

(2) The case with u > 1, meaning that there docs not exist a 
sign-varying entry in the first column of A. Define 




1 

— 5 
0 


0 0 


0 -j 1 


(A.7) 


(A.8b) 


then we have 

det - 1 + + fl„}, 

(u#nort;#2), (A.8a) 

detN, = r + [-ir*\{(-sr-' + a^), 

(u==n and t) = 2). 

For j = 0, we have 

detN, = 1, (u#noru^2), (A.9a) 

detNj = 1 + (-l)"^‘biA^(i; = n and u ^ 2). (A.9b) 
For s = V(-l)“' we have 

dttN^ = l, (u=#noru^2), (A. 10a) 

detNi = l, (m - n and or u = 2). (A. 10b) 

Equation (A.9b) implies that if b|#0, then a„ 2*0, where 
a „2 is the lower left hand corner of Combining (1) and 
(2) as above, we see that this lemma is valid for it = 1. 

Case 2: k>l, meaning that is a sign-varying entry. 
Define 




1 

— 5 
0 


0 


I 

-s 


0 
0 
0 

0 

• ■ 0 

0 -J 1 


(A.ll) 




then we have 


det N, = del Nf^. 


(A. 12) 


(1) When A/* does not contain i.c. v^k, we have 


dciyV* = l + bA(-A’)” 


(A.13) 


Since A has nonzero eigenvalues, b^^O implies a. »() 

(u = l_it) by (A.13). 

(2) When contains a^,, i.c v >k, it becomes 




1 

-,V 


0 




0 * 
0 
0 
0 

0 -.y 1 


(A. 14) 


Like in Case 1, we can show that b^^^O implies that the 
lower left hand corner of Nf^ must be zero, or a„*.^i *0. 

For the latter part we consider the case of a^p ^ 0 and 


b* ■ 0 as described below; 


'o 1 

• o" 




“v ■ 




1 




0_ 


_1_ 


(A.15) 


First we show that this case can be reduced to the case of 
A;,^ « 0 and b* # 0 as treated in the former part. For proving 
this, we need the next claim, which is easy to verify, 

Claim 1. An interval system (A, b) is controllability invariant 
if and only if an interval system (A^,b^l defined below 
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controllability invariant. 


a- 


Considering the case of u^p-1, and define 

(^hpf ^hp) as follows. 


rank (^4 fr) = n, (A.20) 

for any h^. By considering the case where the first column 
of A take a value of 0, we sec 




itino aimiluriftr i-»f fA 1 / A 1 £\ 


use of Oaim 1 we sec that controllability invariance of the 
system (A. 15) is equivalent to controllability invariance of 
the system (A. 17), which belongs to the class of systems as 
treated in the former case (i.e. « 0, 6;^ ^0). We therefore 

conclude from the discussion for the former part that if 
(thp ^ 0, then “ 0 tor all ]:sii^p-l, and 

“0. This completes the proof of Lemma. 


B. Proof of Proposition 1 

Necessity. Suppose that a standard system described below 
is controllability invariant 

fa,, e, a„ ■ J|„ ”1 r ^1 1 


This implies the two conditions of Lemma, which include 
Condition 2 of Definition 5 as a subset. It thus remains to 
show Condition 3 of Definition 5 under the assumption, i.e. 

rank[A“.v7 h] = n, (A. 19) 


fln 

**22 ^2 


<l|n b. 


^detAir^O, fA.2l) 


for any b,. Expanding M with respect to the first column 
of M yields 


det M — A(d,) 4- A(a 22 ) • -f A(a„ 2 ) 

(A.22) 

where A(*) is the colactor of ♦. All terms of the above 
equation except for the first term contain uncertain entries, 
varying independently, hence these terms must be equal 
to 0 Repeating the same discussion for the remaining first 
term leads to the following equation. 

deliW = fLl9. • . (A 2.3) 


Therefore the conditions of Definition 5 hold for the System 
(A, h), namely this system has a generalized anti-symmetric 
stepwise configuration. 

Sufficiency, Suppose that Af has a generalized anti¬ 
symmetric stepwise configuration. Then by Condition 2 of 
Definition 5 the matrix 


N-lA-i/ h\, 


for every j e C and every a,j, b, e R. Taking j = 0 in (A. 19) 


has the form: 



where #9 is a sign-invariant entry, and ♦ is a sign-varying entry. 

Ut the (h p) entry of N be the lowest and rightmost has a full rank both for ,r = 0 and for s*0. by taking 

sign-varying entry in the upper part of sign-invariant entries Condition 3 of Definition 5 into consideration and by 

(we set fi = 0 if there is no sign-varying entry in the upper elementary transformation, respectively, 

part of sign-invariant entries). In the sequel, we prove that N (1) The case with s — 0. 


AUTO 29:1-B 
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Taking 5 = 0 in (A.25) yields 


h 

0 0 * * • 

0 0 19 * 


p-1 p p+1 

♦ * * 0 

• 0 


0 

0 


0 

0 




0 ^ • 0 
OfdOOOOOO 
• e 0 


0 h 

0 h + l 
0 


(A.26) 


0 0 • 
0 0 ■ ■ 0 • 

4i * • ■ 4< * 

+ * • • * * 

* 

4t * * * 


* 0 

* 


0 

6 0 

♦ 0 


* * 


* Hi 


1 


Eliminating the first column of N above yields the right 
matrix M of the associated matrix M, whose determinant is 
equal by Condition 3 of Definition 5 to the product of all 
diagonal elements 00 ■ ' 0. So S has a full rank. 


(2) The case with s ^ 0. 

By shifting the first h columns of N to the right by one 
column and at the same time moving the (h + l)th column to 
the first column, we obtain 


h h + [ 


p-1 p p + 1 


♦ -s 0 * f 

0 ~s 6 * 


* * 0 ■ • 0 
* 0 0 




0 0 
> - j 0 


—5 • • • 

0 61 0 0 

♦ - j 0 0 


0 

0 


4> 


♦ 


• * 0 

0 0 0 


0 h 

0 /i + l 

0 


(A.27) 


0 0 

*-j 6 0 

♦ -A- 0 

* ~ s 


* * * 


0 

0 

0 

0 

0 

* — s 0 


P - 1 
P 


It is then easy to see that the determinant of (n x n) right for every seC and every a,^, b, e R. Hence (A, b) is 
submatrix is equal to a nonzero value 61"'^ x By (1) controllability invariant. This completes the proof 

and (2) as above, it follows 

rank[A-j7 h] = n. 


(A.28) 
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Abslncl —This paper surveys a subset of the body of 
research which was sparked by Kharitonov's Tlieorem. The 
focal pH)int is extreme point results for robust stability and 
robust performance. That is, we give conditions under which 
satisfaction of a performance specification is ascertained for a 
family of systems by only checking a finite subset of the 
extreme members of this family. The results which arc 
surveyed apply mainly to systems with structuied real 
parametric uncertainty. In addition, a number of coun¬ 
terexamples are given to illustrate cases for which an extreme 
point result does not hold. For such cases, a solution via the 
so-called Edge 'Fheorem is often possible. 

1. INTRODUCTION 

This survey paper concentrates on a subset of 
the major developments in robust control over 
the last decade—new extreme point results 
which facilitate robustness analysis in the 
presence of real parametric uncertainty. When 
we refer to an extreme point result in this paper, 
we mean roughly the following; It is a result 
which enables us to infer that some desired 
property of a control system is robustly satisfied 
by checking the satisfaction of this property on a 
finite subset of the “extreme” systems which 
may arise. For example, if the bounding set for 
the uncertain parameters is an ^-dimensional 
box Q and we want to have a system with 
damping ratio ^ <0.707 for all q eQ, then the 
following question is of interest; If the damping 
ratio specification is satisfied for each of the 2^ 
parameter settings associated with the vertices of 
Q, does it follow that this specification is also 
satisfied for all qeQl If so, we say that an 

* Received 1 November 1991; received in final form 10 
June 1992. This paper was not presented at any IFAC 
meeting. This paper was recommended for publication in 
revised form by Guest Editor J. E. Ackermann. 

t This work was supported by the National Science 
Foundation under Grant Number ECS-9111570. 

t Department of Electrical and Computer Engineering, 
University of Wisconsin-Madison, Madison, WI 53706, 
U.S.A. 


extreme point result holds. Most of the extreme 
point results obtained to date apply to the 
Robust Stability Problem. However, over the last 
few years, a number of extreme point results 
have emerged for various Robust Performance 
Problems as well. The results which we survey 
are confined to these two problem areas. In 
addition, this paper includes a number of 
counterexamples to conjectures which one is 
tempted to make. That is, in many cases, a 
certain robustness condition holds at the 
extremes but fails to hold for the entire family of 
interest. 

To put this survey in historical perspective, we 
begin by noting that robustness analysis with real 
parametric uncertainty received considerable 
attention in the imf>ortanl pioneering work of 
researchers such as Horowitz (1963), Siljak 
(1969) and Ackermann (1980). The explicit topic 
of this survey, however, is clearly motivated by 
the seminal theorem of Kharitonov (1978a). 
Once Kharitonov’s Theorem came to light in the 
control community via the papers of Barmish 
(1983) and Bialas (1983), the pace of robustness 
research on real parametric uncertainty seems to 
have accelerated rapidly. Finally, it is also 
important to mention a second line of research 
which is also motivated by Kharitonov’s 
Theorem but is only minimally covered here: 
the line of research beginning with the 
celebrated Edge Theorem of Bartlett et al 
(1988). Given the fact that inclusion of a survey 
of edge results would probably double the length 
of this exposition, we opt for a sharper focus and 
concentrate exclusively on extreme points; for a 
more complete exposition covering the entire 
area, see the forthcoming textbook by Barmish 
(1993), Since this survey deals with “robustified 
versions” of a number of age old problems about 
polynomials, an important background reference 
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to mention is the classical book by Marden 
(1966). 

Throughout this paper, q = (qu <} 2 , • ■ ■ > 9 r) ^ 
represents a vector of real uncertain 
parameters. For example, in the well-known 
track-guided bus problem of Ackermann (1985), 
qi can represent a coefficient of friction between 
the tires of a vehicle and the surface of the road. 
This parameter is uncertain because it can be 
rainy on one day and dry on another. For 
robustness analysis problems involving many 
uncertain parameters, a major challenge is to 
come up with a method of testing for satisfaction 
of the performance specification without having 
to deal with a large number of contingencies. A 
naive approach to robustness analysis involves 
carrying out a classical analysis for many 
combinations of {qi, q 2 , . ■ . , qf) obtained by 
gridding the admissible range of each q,. In 
practice, however, this approach becomes 
computationally intractable as the number of 
uncertainties ^ increases. Furthermore, if one 
wants to know how various controller para¬ 
meters affect performance, it can be argued that 
this important information is lost in a gridding 
process. In fact, when carrying out robustness 
analysis, even an “intelligent” gridding process 
such as a branch-and-bound algorithm encoun¬ 
ters this difficulty. The desire to avoid gridding 
further motivates the search for extreme point 
results. 

By and large, the body of extreme point 
results which have emerged applies to feedback 
control problems with structured real uncertainty. 
Moreover, almost all extreme point results apply 
to either independent uncertainty structures or 
affine linear uncertainty structures. On a few 
occasions, multilinear uncertainty structures are 
also considered in the sequel. We now explain 
these terms. 


Uncertainty structures 

The paradigm for the discussion to follow 
begins with an uncertain SISO plant. Rather 
than using the “usual” P(s) notation to describe 
the plant, we use the notation P(s, q) to 
emphasize the dependence of plant coefficients 
on the uncertain parameter vector q ; we write 


Pis,q) = 


q) 

D{s, q )' 


where N{s, q) and D(s, q) are uncertain 
polynomials. That is, each of these polynomials 
has coefficients which are functions of q. To 
describe an uncertain polynomial, it is con¬ 
venient to write 

n 

p{s. <?) = S a,iqy- 


An uncertain polynomial is said to have an 
affine linear uncertainty structure if a^q) is an 
affine linear function for i = 0 , 1, 2 ,. . . , n; that 
is, a linear function plus a constant. For 
example, observe that 0 ,( 9 ) = 4^i-I -692 — 7 ^ 3 -I- 
5 is an affine linear function. We say that the 
uncertain plant P{s, q) has an affine linear 
uncertainty structure if both the numerator 
N{s, q) and the denominator D(s, q) each have 
affine linear uncertainty structures. 

A special case of an affine linear uncertainty 
structure is an independent uncertainty structure. 
In this case, each component q, of q enters into 
only one coefficient. For example, the uncertain 
polynomial p{s, q) = s^ + {A + qi + 6 ^ 2 )^^ + (7 + 
^3 + 2q^s -I- (5 -I- q^) has an independent uncer¬ 
tainty structure. When dealing with an inde¬ 
pendent uncertainty structure, we generally 
lump uncertainties and simply write 

n 

pis, q) = 'Z 


Notice that in this case, we allow a zeroth 
component of q. In the case of rational 
functions, the uncertain plant P{s, is said to 
have an independent uncertainty structure if 
both numerator N{s, q) and denominator 
D{s, q) have independent uncertainty structures 
and, in addition, no uncertain parameter 
entering into the numerator enters into the 
denominator and vice versa. To emphasize this 
uncertainty decoupling between numerator and 
denominator, we introduce a second uncertain 
parameter vector r e and write 


q, r) = 


q) 

D{s, r) 


E qts‘ 


S r,s' 


1=0 


We conclude this subsection with one last 
definition: The uncertain polynomial p{s,q) = 

n 

E aXq)s‘ is said to have a multilinear uncertainty 
1-0 

structure if a,(q) is a multilinear afhne function 
for I = 0 , 1 , 2 , . . . , n; that is, if all but one 
component q^ of q is fixed, then a,{q) is affine 
linear in q/,. A simple example of a multilinear 
affine coefficient function is given by a,(q) = 
5^19293 + 2^2 - 693 + 492^3 + 5. 


2 FURTHER MOTIVATION 
Given that we restrict most of our attention in 
this survey to affine linear and independent 
uncertainty structures, it can rightfully be argued 
that the problems discussed here are special 
cases of more general problems considered 
elsewhere in the literature. For example, as 
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pointed out by Chen et al. (1992), a robust 
stability problem with affine linear uncertainty 
structure can be viewed as a rank one ^ 
problem. Hence, it is correct to say that many of 
the results surveyed here are solutions to rather 
special cases of problems which can be attacked 
using the method of computation associated with 
theories such as those in Doyle (1982) or 
Safonov (1982). This leads us to consider two 
questions. 

The first question which we address is: Are 
there advantages associated with extreme point 
solution when other techniques are available? To 
answer this question, we begin by noting that the 
computational complexity associated with ex¬ 
treme point solution can be dramatically lower 
than other solution methods. Perhaps the best 
example illustrating this point is Kharitonov’s 
Theorem (see Section 5). For an nth order 
polynomial with independent uncertainty struc¬ 
ture and interval bounds for the uncertain 
parameters q,, Kharitonov’s Theorem tells us 
that only four polynomials need to be tested to 
ascertain robust stability. 

Without knowing Kharitonov’s Theorem, one 
might erroneously conclude that a difficult 
nonlinear program must be solved in order to 
ascertain robust stability. To illustrate, let A,iq) 
denote the iih leading principal minor of the 
Hurwitz matrix for the uncertain polynomial 
p{s, q) and observe that A^q) is typically a 
nonlinear function of q. In view of the fact that 
robust stability is equivalent to positivity of all 
minors A,( 9 ), we can considei a nonlinear 
programming problem with objective function 

y(9) = inin ^Xq), 

I 

and constraint qeQ. Clearly, robust stability is 
guaranteed if and only if 

min7(^) ^ 0 . 

In other words, without knowing Kharitonov’s 
Theorem, common sense considerations might 
lead to an overly complicated nonlinear pro¬ 
gramming formulation. A similar motivation can 
be given for some of the special cases of ju 
theory. Once one recognizes the rank one 
structure, computational tractability is achieved 
in the sense that a “precise” global solution to 
the fi optimization is guaranteed via convex 
programming. 

Further motivation for development of an 
extreme point theory is derived from the fact 
that only a few special cases of the robust 
stabilization problem have been solved for 
systems with structured real uncertainty; e.g. see 


the two plant solution in Vidyasagar and 
Viswanadham (1982), the solution for minimum 
phase systems in Kwakernaak (1982) and 
Barmish and Wei (1985), the solution for 
systems with a single uncertain parameter 
entering affinely into the plant in Khargonekar 
and Tannenbaum (1985) and the solution for 
systems satisfying a so-called matching condition 
as in Leitmann (1979). The key point to note is 
that an extreme point result reduces the problem 
of stabilizing infinitely many plants to the 
problem of stabilizing finitely many plants. If we 
restrict attention to compensators C( 5 ) with 
some fixed structure parametrized by a vector 
K = K 2 . ‘ , Kp) and consider the Hurwitz 

matrix inequalities A,(</, K )>0 for stability of 
the closed loop characteristic polynomial with q 
restricted to the finite set of extreme points, a 
synthesis becomes possible for small values of p. 
For example, if C(.y) = K, 4-K 2 /.V is a PI 
compensator, we can display the satisfaction set 
for these inequalities in the (K,, ^ 2 ) plane; e.g. 
see Barmish et ai (1992) for further details. 

With regard to line of research being 
surveyed, another important question to ask is: 
How rich is the class of ‘real world” control 
problems which can be attacked using a 
robustness theory based on affine linearity of the 
uncertainty structure? Although this class is 
admittedly “thin” in some appropriately defined 
problem space, we argue that in a pragmatic 
sense, the engineer often has little choice other 
than turning to existing theory for the affine 
linear case. To elaborate on this point, consider 
the following typical applications scenario: The 
engineer derives the characteristic polynomial 
p{s, q) for feedback system by performing a 
calculation which is rather complicated (perhaps 
facilitated using a symbolic manipulator). The 
coefficient functions a,(q) involve rather ”ugly” 
nonlinear expressions in the components q, of q \ 
it is not unusual for a,{q) to have hundreds of 
terms! Subsequently, the goal is to obtain some 
sort of robustness margin but it turns out that 
existing literature is of questionable use because 
the a^(q) are so complicated. In this situation, we 
argue that a “pragmatic” solution might involve 
selection of some fixed nominal q* e Q followed 
by a robustness computation assuming inde¬ 
pendent coefficient variations; For example, one 
can begin with the nominal closed loop 
polynomial 

PU, 9*) = i: aXq*), 

f ^'0 

and study percentage variations which can be 
tolerated in the coefficients before instability 
occurs. Within the framework of this survey, this 
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is easily accomplished by working with the 
uncertain polynomial 

n 

P{s, q) =p{s, q*) + 2 qi^‘> 

(-0 

with variable uncertainty bounds 

-ra,(q*)^q,^ra,(q*). 

Now, by computing the largest value of r, call 
it for which robust stability is guaranteed, 
we obtain a robustness margin measured in terms 
of a percentage error with respect to nominal 
coefficients. To gain additional information about 
the degree of robustness, one can also perform a 
small gain calculation with the point of view that 
the nominal closed loop system is corrupted by 
unmodelled dynamics instead of complicated 
parametric uncertainty. 

To conclude this section on motivation, it is 
also worth noting that afhne linear uncertain 
structures are often quite useful for overbound¬ 
ing or approximating more general uncertainty 
structures. To illustrate, for the case of the 
multilinear uncertainty structures, the Mapping 
Theorem (for example, see Zadeh and Desoer 
(1963)) provides a description of the convex hull 
of the coefficient set. For this case, the theory to 
follow can be applied to this convex hull. 

3 NOTATION AND TECHNICAL JARGON 
In this brief section, we provide the notation 
and technical jargon which is used in the sequel. 

Convention for real vs complex coefficients 
Throughout this paper, the standing assump¬ 
tion is that all polynomials have real coefficients. 
On the few occasions when we assume complex 
coefficients, we state this assumption explicitly. 
In this regard, our point of view is as follows: in 
most cases, the results given here are easily 
modified to obtain complex coefficient ana¬ 
logues. Since the proof modifications for the 
complex coefficient case are usually straightfor¬ 
ward, this survey avoids double citations; i.e. 
one citation for the first paper to provide a result 
and a second citation for a minor extension to 
the complex case. 

Bounding boxes and extreme points 
Throughout this paper, the uncertain para¬ 
meter vectors q and r are confined to 
multidimensional boxes Q and R, respectively. 
The box description is motivated by applications 
in which each component of the uncertain 
parameter vector is known within given bounds. 
To describe these boxes, we assume com¬ 
ponentwise bounds 

q7^q^^qt\ r;:^r^:sr;. 


Since the focal point of this paper is extreme 
point results, it is important to introduce a 
notation for the extreme points of Q and /?. To 
this end we use superscript notation q* and r' to 
denote the /th extreme points of Q and R, 
respectively. Associated with each extreme point 
of the uncertainty bounding sets is a distin¬ 
guished polynomial or plant. For example, if 
is an extreme point of Q, then p(s, ^*') = 

n 

2 is the associated extreme polynomial. 

Similarly, if {q*\ r'^) is an extreme point pair for 
Q X then P{s, q'\ r*^) is the associated 
extreme plant. If and R^W^, notice 

that there are extreme plants at most. 

Interval polynomial families and polytopes of 
polynomials 

If p{s, q) is an uncertain polynomial having 
independent uncertainty structure, we hence¬ 
forth refer to the set ?/= {p(-, q):q eQ) as an 
interval polynomial family. For convenience, we 
use the notation 

pis, q) = 'Z [q7, q7]s‘. 

I 0 

When dealing with a plant P{s, q, r) with 
independent uncertainty structure, we adopt the 
phraseology of Chapellal and Bhattacharyya 
(1989) and simply call {P{-, q, r):q ^ Q-,r e 
/?} an interval plant family. 

If an uncertain polynomial p{s, q) has an 
affine linear uncertainty structure, then the 
associated coefficient set 

aiQ) = {Mq). a^iq )— , aniq))'q eQ), 

is a polytope in In fact, by standard 

arguments from convex analysis (for example, 
see Rockafellar (1970)), we can express a(Q) as 
the convex hull of the generators a{q')\ i.e. 

a( 0 ) = conv {< 1 ( 9 ')}. 

Hence, wc create an association between 
polytopes in and families of polynomials 

with affine linear uncertainty structures. We note 
that every polynomial in the family ^ = 
{pi’s q) ’q ^ Q} can be expressed as a convex 
combination of the p{s,q*), i.e. ^ = 
conv {pis, q')). This further justifies calling the 
family a poly tope of polynomials. 

Monicity, properness and invariant degree 
When dealing with a robustness analysis 
problem, technical subtleties arise if the 

n 

uncertain polynomial pis, g) = E u,iq)s* can 

i»0 

drop in degree; i.e. if a„iq) = 0 for some 
admissible value of q. We opt to keep the 
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exposition in this survey at a more simple 
technical level via imposition of an invariant 
degree assumption wherever appropriate. To 
this end, when working with a feedback system, 
whenever needed, we assume monicity of the 
plant denominator and strict properness of the 
plant. This guarantees invariant degree of the 
closed loop polynomial for large classes of 
feedback systems with a proper compensator. 
For an example of a paper dealing with the 
subtleties of degree dropping, see Sideris and 
Barmish (1989). 


4. INTERPLAY BETWEEN CONTROL RESULTS 
AND POLYNOMIAL RESULTS 

Throughout the literature on extreme points, 
there is constantly an interplay between 
polynomial results and control vSystem results—a 
new result involving robustness of polynomials is 
“dressed” in control clothing and conversely, the 
control clothing is often “removed” in order to 
isolate fundamental polynomial problems. In 
large measure, this interplay is motivated by a 
fundamental lemma which we provide below. 

To motivate this lemma, let P{s, q) be an 
uncertain plant with atfine linear uncertainty 
structure which is connected in a teedback 
configuration with a compensator C(.v) as shown 
in Fig. 1. Now, if the compensator is expressed 
as the quotient of polynomials 


C{s) = 


K(s) 

A(.9) ’ 


the closed loop transfer function is 

p ( ) = _ 

■*’ ^ N{s, g)A/.(.v) + f){s, q)l\{x) 
DclA, 9 ) 

By taking note of the fact that the mappings 
from q to the coefficients of Nci{s,q) and 
q) are affine linear, the following lemma 
is easy to prove. 


Lemma 4.1. (Affine linear uncertainty preserva¬ 
tion.) Consider an uncertain plant connected in 
feedback configuration as in Fig. 1 and assume 
that P{s, q) has an affine linear uncertainty 



Y{») 


Structure. Then it follows that the uncertain 
closed loop transfer function Pchi^, q) also has 
an affine linear uncertainty structure. 

The lemma above tells us that an affine linear 
uncertainty structure is preserved in going from 
the open loop to the closed loop. This provides 
significant motivation for development of rob¬ 
ustness results at the level of uncertain 
polynomials with affine linear uncertainty struc¬ 
tures; i.e. there is typically an interpretation of a 
polynomial result in terms of the feedback 
system in Fig. 1. It should also be noted that 
there are many variations on Lemma 4.1 which 
can be provided. For example, if the compen¬ 
sator C(5) is connected in cascade with the plant 
and a unity feedback is used, the affine linear 
uncertainty structure is again preserved. More 
generally, the preservation of the affine linear 
uncertainty structure is guaranteed for all 
transfer functions of practical interest. For 
example, a simple computation of the closed 
loop sensitivity results in 

" 1 + Pis, q)C(s) 

Dis. 9)D,Cv) 

^(■5. (l)Kis) + D(s. 9)0,(j) ■ 

It is easy to verify that if P{s, q) has an affine 
linear uncertainty structure, then 5(5, q) also has 
this same structure. A similar result holds for the 
complementary sensitivity function. 


.*) ROBUST STABILITY AND KHARITONOV’S 
THEOREM 

We begin with some standard definitions: A 
fixed polynomial p(s) is said to be stable if all its 
roots lie in the open left half plane. Given a 
family of polynomials ^ = {pi-, q) 'q € Q}, we 
say that 9^ is robustly stable if all polynomials in 
are stable. If all polynomials in ^ have the 
same degree, then we .say that has invariant 

n 

degree\ i.e. if p{s, q)= T, ai(q)s\ then, recalling 

, oO 

the discussion in Section 3, 9^ has invariant 
degree if and only if fl„(^) =^0 for all q eQ. 

The point of departure for the literature being 
surveyed is Kharitonov’s seminal theorem. For 
an interval polynomial family 9** described by 

n 

p{s,q)= E [qrfq^]s\ we extract four distin- 
»-(» 

guished members 

= qo + qU + qiA + q^s^ + qts 

+ q^s^ H 

= qii + qT^ + qU^ + q^A + q^x* 

+ q;s'^ + 


Fig. 1 Clas.sical Feedback configuration 
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K3(s) = qo +qiS + q 2 S^ + q3S^ + q^J^ 

+ q?s^+-..-, 

^4(5) = q? + qis + + qU^ + ^4 

+ qis^ + ---\ 

which are referred to as the Kharitonov 
polynomials. For example, if 

p(s, q) = 5 ** +[0.74, 1.26Jj’ + [0.75. 1.25^ 

+ [1, 2]J-' + [2.75, 3.25]j" + [8.75, 9.25]s 
+ [0.76, 10.25], 

then the Kharitonov polynomials are 

K,(j) = j‘ + 1.26s’ + 1.25s" + s^ + 2.75s- 
+ 9.25s+ 10.25; 

K^is) = s‘ + 0.74s’ + 0.75s" + Iv’ + 3.25s^ 

+ 8.75s+0.76; 

Kjis) = s" + 1.26s’ + 0.75s" + s’ + 3.25s- 
+ 9.25s+ 0.76; 

K^{s) = s* + 0.74s’ + 1.25s" + 2s’ + 2.75s" 

+ 8.75s + 10.25. 

Theorem 5.1. (Kharitonov (1978a)): Let be 
an interval polynomial family with invariant 
degree. Then ^ is robustly stable if and only if 
its associated four Kharitonov polynomials are 
stable. 

The power of Kharitonov’s theorem is derived 
from the fact that we can determine if ^ is 
robustly stable by checking only four fixed 
polynomials. In other words, four distinguished 
extreme points of Q tell the entire story. A nice 
proof of the theorem is given by Minnichelli et 
al. (1989); the paper by Dasgupta (1988) 
provides the essential geometric concepts which 
pave the way for the proof. 

The Dasgupta geometry 

Understanding of the extreme point nature of 
Kharitonov’s solution is enhanced by considering 
the interval polynomial geometry of Dasgupta 
(1988). To this end, we temporarily freeze the 
frequency; say s=/a)o with fixed WoeR. To 
avoid trivialities, we also suppose that p(s, q*) is 
stable for at least one q* e Q. Next, for the 
interval polynomial family we construct the 
associated four Kharitonov polynomials Xi(s), 
K^is), K 2 (s) and ^ 4 ( 5 ) and generate a rectangle 
p(/tOo> Q) with vertices K.Oai,,) as ,hown in Fig. 
2. This rectangle is called the value set because it 
corresponds to the set of possible values that 
p{jtiOaf q) can assume as q ranges over Q-, i.e. 

PU(^o, Q) = {p(jiOo, q):qeQ}. 


KiOwo) A'i(;mo) 



Kiijuhj) 


Fin 2 Simplified description of the Kharitonov rectangle for 
frequency 


Using simple ideas about continuous dependence 
of roots of p(a, q) on q, it is easy to show that if 
^ has invariant degree and at least one stable 
member p(s, then robust stability is 

equivalent to satisfaction of the Zero Exclusion 
Condition 


Oipijo), Q), 

for all to ^ 0. In fact, the equivalence between 
zero exclusion and robust stability is not specific 
to interval polynomials. The Zero Exclusion 
Condition remains valid under much weaker 
conditions, e.g. if has invariant degree, Q is 
pathwise connected and p{s, q) has coefficients 
depending continuously on q. This idea goes 
back at least as far as the paper by Frazer and 
Duncan (1929). 

The basic geometry associated with the zero 
exclusion condition is more fully demonstrated 
in Fig. 3. For ^) = [0.25, 1.25].v^+ 

[2.75, 3.25]r -f [0.75, 1.25].v + [0.25, 1.25], the 
rectangular value set /7(/fn, (2) is displayed for 
0<(i>!<l. Using the four Kharitonov polyno¬ 
mials, we can also interpret zero exclusion in 
terms of positivity of a specially constructed 
frequency dependent function H{(o), With the 
aid of the geometry in Fig. 2, the result in 
Barmish (1989) is easily established; if an 
interval polynomial family has invariant degree 
and at least one stable member, then positivity of 



Fig. 3. Motion of the Kharitonov rectangle for 0^ oj 2 1. 
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the function 

H{(o) - max {Re K 2 (jof), -Re K^(j(i)), 
ImK^ijo)), -ImATaOa))), 
for all w 2:0 iv equivalent to robust stability of 

Polygonal value sets 

The geometrical ideas above also carry over to 
the more general framework of polytopes of 
polynomials; If ^ = {pi.-.q)-q^Q) is a 
polytope of polynomials, then the value set 
p(/ct>oi Q) af frequency oio is the polygon in the 
complex plane with generating set q^)) \ 

i,e. p(y(Uo, Q) = conv {p(ycuo, q')). The ideas 
underlying value set analysis go all the way back 
to the book by Zadeh and Desoer (1963). After 
laying dormant for over 20 years, we attribute 
the revival of these geometrical ideas to Saeki 
(1986) and de Gaston and Safonov (1988). 

Refinements, embellishments and extensions 

One of the few complex coefficient results 
which we mention is the followup paper by 
Kharitonov (1978b). For interval polynomials 
with complex coefficients with box bounds for 
both real and imaginary part, the “magic 
number” is eight instead of four—robust stability 
is guaranteed if and only if eight distinguished 
extreme polynomials arc stable. 

We now describe some results concerning the 
strengthening of Kharitonov’s Theorem for low 
order polynomials. For interval polynomial 
families with invariant degree and order n < 5, 
special properties of the rectangular value set 
can be exploited to obtain the results of 
Anderson et al (1987). If n = 3 and the lowest 
order coefficient bound satisfies q^ > 0 , then 
is robustly stable if and only if the single 
Kharitonov polynomial /C 4 (.^) is stable. If n = 4 
and qo > 0 , is robustly stable if and only if the 
two Kharitonov polynomials Ki(s) and K^{s) are 
stable. Finally, for n = 5, 9/^ is robustly stable if 
and only if the three Kharitonov polynomials 
K^{s), K'^{s) and K 4 (i') are stable. Note that the 
assumption <70 >0 above is not restrictive 
because a necessary condition for stability is that 
all coefficients have the same sign. 

A simple control theoretic interpretation of 
Kharitonov’s Theorem is given in Ghosh (1985) 
where an interval plant family with pure gain 
comp)ensator C{j) =/C is considered. With 
configuration as in Fig. 1, it is easy to see that 
the denominator DciX?. q) = Rb/{s, q) - 1 - D{s, q) 
of the closed loop transfer function still has an 
independent uncertainty structure. Hence, 
closed loop stability can be studied using 
Kharitonov’s Theorem. This type of reasoning 


leads to an eight polynomial test for robust 
stability—four polynomials for and four 

polynomials for /C< 0 . Furthermore, this same 
analysis implies that the “worst case” gain 
margin is achieved by a Kharitonov polynomial; 
see Section 9 for further discussion. 

To conclude this section, we mention an 
extension of the theory which applies to 
multilinear uncertainty structures satisfying a 
so-called even-odd decoupling condition: We 
assume that any uncertain parameter q^ which 
enters into an even order coefficient fl 2 i(^) of 
p(s, q) does not enter into any odd order 
coefficient, and conversely, if qf^ enters into an 
odd order coefficient of p{s, q), then it 

does not enter into an even order coefficient. 
Using this condition, the paper by Kharitonov 
(1979) provides the following result: Suppose 
{p{', q):q eQ) is a family of polynomials 
which has invariant degree, multilinear uncer¬ 
tainty structure and satisfies the even-odd 
decoupling condition. Then ^ is robustly stable 
if and only if p{s, q') is stable for all extreme 
points q' of Q. Underlying a simple proof of this 
result is the basic fact that the value set p{jv\ G) 
remains rectangular when the even-odd decou¬ 
pling condition is satisfied; sec also Panier ct al. 
(1989) for further extensions. 

6. OTHER ROOT LOC ATION REGIONS 
After reading Kharitonov’s Theorem, an 
obvious question to ask is: To what extent can 
extreme point results be given for root location 
regions other than the strict left half plane? For 
example, for discrete time control problems, the 
focal point is the unit disc. More generally, given 
a family of polynomials = (p(-, q)\q € Q) and 
a desired open root location region c C, we 
say that ^ is robustly 9/)-stahle if, for each q e Q, 
p{s, q) has all its roots in We reserve the 
phrase robust stability for the special case when 
^ is the open left half plane. For the special case 
when is the open unit disc, we use the phrase 
robust Schur stability. 

We call 91) a weak Kharitonov region if the 
following condition holds: Given any interval 
polynomial family having invariant degree, 
stability of p{s, q^) for all extreme points q' of Q 
implies robust li)-stability of Taking n to be 
the degree of all polynomials in 9/^, we call 9^ a 
strong Kharitonov region if there exists an index 
set 7(^2)) c {1, 2, 3, . . . , 2"^*} with cardinality 
independent of n and associated labelling of 
extreme polynomials (pCv, ^')} such that 9d- 
stability of p{s, q') for i€l(9d) implies robust 
2^-stability of The understanding above is 
that the labelling scheme for the extreme 


AUTO 29:1-C 
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polynomials may depend on n. For example, in 
Kharitonov’s Theorem, by appropriate labelling 
of extreme points, we can take /(S) = 
{1,2,3,4}. From the point of view of 
computational complexity, it is obvious that 
results involving strong Kharitonov regions are 
quite powerful when the cardinality of is 
small. 

When studying the robust Schur stability 
problem, the first obvious conjecture to make is 
that the open unit disc is a weak Kharitonov 
region. However, a number of authors provide 
counterexamples to such a conjecture. For 
example, in Bose and Zeheb (1986), the interval 
polynomial family described by p(s, q)=s‘* + 
(-^, - i has all extremes with roots 

in the open unit disc. However, the “intermedi¬ 
ate” polynomial p*(s) = 5 “ -t- i is a member 
of the family and has two root s = ±y 1.3025 
which are outside the unit disc. Although the 
open unit disc fails to be a weak Kharitonov 
region, the paper by Petersen (1989) indicates 
that there are other important classes of weak 
Kharitonov regions. Such classes include regions 
such as shifted half planes, damping cones and 
circles which are entirely contained in the open 
left half plane. Petersen also goes on to show 
that the intersection of any two of his regions is 
once again a weak Kharitonov region. In the 
theorem below, a much more general charac¬ 
terization of weak Kharitonov regions is given. 

Theorem 6.1. (Rantzer (1992b)) Suppose 3? c 
C. Then ^ is a weak Kharitonov region if S') and 
1/3) = {z:zd = 1 for some d e3>} are convex. 

In fact, the paper by Rantzer actually gives an 
even stronger result: For the case of complex 
coefficient polynomials, convexity of both 3) and 
1/3) is also necessary in order for 3 to be a weak 
Kharitonov region. Note that Rantzer’s theorem 
enhances our understanding of a number of facts 
which we already know. For example, consistent 
with the fact that general extreme points are not 
available for the robust Schur stability problem, 
take 3 to be the open unit disc and notice that 
1/3S = {z: |z| > 1} is not convex. On the other 
hand if S' is a half plane or a convex cone, then 
its reciprocals are easily seen to be convex. This 
is consistent with the results of Petersen (1989). 

More on robust Schur stability problems 

Given that the unit disc is not a weak 
Kharitonov region, a number of authors have 
considered the following question; For an 
interval polynomial, what reasonable strength¬ 
ening of hypotheses enables us to infer robust 
Schur stability from Schur stability of the 


extreme points? In addressing this question, the 
first point to note is that it is no accident that a 
fourth order polynomial is used above to 
demonstrate that the unit disc is not a weak 
Kharitonov region. In fact, for monic polyno¬ 
mials of order n^3, the papers by Cieslik 
(1987) and Kraus et al. (1988) tell us that robust 
Schur stability is guaranteed if and only if all of 
the extreme polynomials p{s, q‘) are Schur 
stable. For higher order polynomials, an extreme 
point result of Hollot and Bartlett (1986) applies 
to the special case when roughly half the 
coefficients of p{s, q) are fixed. Since p(l, q)^0 
is necessary for Schur stability, without loss of 

generality, the assumption E q,~>0 is imposed 

i-O 

in the theorem below. 

Theorem 6.2. (Hollot and Bartlett (1986)) 
Consider the discrete time interval polynomial 

n 

family ^ described by p{s,q)= E 

and in addition, assume that and 

f-O 

q7 = for /= [nl2\ + 1, \nl2\ -1-2, . . . , n 
where [«/2j denotes the largest integer less than 
or equal to n/2. Then P? is robustly Schur stable 
if and only if all of the extreme polynomials 
pis, q^) are Schur stable. 

To conclude this subsection, we mention the 
result of Soh (1990) for robust Schur aperiodi- 
city: We say that the interval polynomial family 
{pi'> q)'’q ^Q) is robustly aperiodic with 
respect to the unit disc if, for all q eQ, all roots 
of p{s, q) are real, positive, distinct and lie in 
(0,1). Within this framework, Soh identifies two 
distinguished extreme polynomials p{s, q^) and 
p(s, q^) which tell the entire story; i.e. SP is 
robustly aperiodic with respect to the unit disc if 
and only if all roots of pis, q') and pis, q^) lie in 
(0,1) and are distinct. 

Damping cones 

When dealing with an nth order interval 
polynomial and a weak Kharitonov region 
note that a stability test can require working with 
as many as 2”'^' extreme polynomials. We now 
describe a pair of results which indicate that 
damping cones enable us to carry out a robust 
stability test with much smaller number of 
extremes. In the two theorems below, we refer 
to the damping cone 9) shown in Fig. 4. 

Theorem 6.3. (Foo and Soh (1989)) Given a 
damping cone 9) and an nth order interval 
polynomial family ^ having invariant degree. 
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Fig 4 Damping cone as a ® region. 


there exists a distinguished subset of, at most, 
2 (n + 1 ) extreme polynomials {p(j, q'),p(s, q^), 

. . . , p(s, q^^"*'^)} having the following pro¬ 
perty: ^ is robustly S-stable if and only if 
p(s, q') is S-stable for i = 1, 2.2(« -I- 1). 

The paper by Foo and Soh (1989) also 
includes a simple recipe for construction of the 
distinguished extreme polynomials. For ex¬ 
ample, for n = 3, there are eight distinguished 
extremes given by 

pis, q') = q^ +qts +q7s^ + qt'i\ 
p(s, q^) = qo +q'is +q^s^ + q;s''- 
pis,q^) = q^, +qU' -^qU^ + q'^s'-, 
p(s, q*) = qu +qTs + q7s" + qts^; 
pis, = + qis + q 2 S- + qU'^\ 

pis, q^) = 9,7 -I- qts + qis^ -t- q^s'; 
pis, q'^) = 9,7 + 97*5 + 97 .r + 97 .s’; 
pis, 9*^) = 9,T + qis + q 2 s' + q^s\ 

In a later paper by Soh (1992), it is shown that 
only four ot the eight specified complex 
coefficient polynomials of Soh and Berger 
(1988b) are required when is the damping 
cone above. 

To motivate the next theorem, we take note of 
the fact that the open left half plane is the only 
strong Kharitonov region which we have 
identified thus far. The next result, due to Soh 
and Foo (1990) and Rantzer (H^O), strengthens 
an earlier result by Soh and Berger (1988a). We 
see below that there is a large class of damping 
cones which also enjoy the strong Kharitonov 
region property. 

Theorem 6.4. (Soh and Foo (1990) and Rantzer 
(1990)) Consider an nth order interval polyno¬ 
mial family ^ having invariant degree and a 
damping cone ® with angle (p - {nj 912 ) 71 ^ Jill 
where n, and 1 x 2 are relatively coprime positive 
integers. Then, there exists a distinguished 
subset of at most 2 fi 2 extreme polynomials 
{p{s, 9 ‘), p(s, , p(j, having the 


following property: ^ is robustly ®-stable if and 
only if q') is 2 -siMe for r = 1 , 2 ,. . . , 2 /i 2 . 

It is interesting to compare Theorems 6.3 and 
6.4. If the objective is to minimize the number of 
extremes to be tested, one cannot say that either 
theorem is uniformly better. For extimple, if 
n=5 and 0 = 27r/3, then Theorem 6.3 requires 
up to 12 extremes but Theorem 6.4 requires six 
extremes. On the other hand, if /i = 9 and 
0 = then Theorem 6.3 requires up to 20 
extremes and Theorem 6.4 requires up to 360 
extremes. For further results aimed at reducing 
the number of extremes to be tested, see Soh 
and Foo (1991). 

7 . THE EDGE THEOREM A BRIDGE TO 
FURTHER RESULTS 

There are a number of extreme point results in 
the literature whose proofs rely on the 
celebrated Edge Theorem of Bartlett et al. 
(1988). Looselv speaking, the Edge Theorem 
tells us the following: For robust ^-stability 
problems with affine linear uncertainty struc¬ 
tures, we need only check IS)-stability of all 
polynomials associated with the edges of the Q 
box. If all polynomials corresponding to edge 
points of Q are 2)-stable, then the entire 
polynomial family is robustly 2)-stable. Since the 
edges of Q are only one-dimensional, we obtain 
a dramatic reduction in computational com¬ 
plexity. On the downside, however, note that if 
then this box has as many as 
edges. We now make these ideas more preci.sely. 

Given a polytope of polynomials 
{p(‘» ^ define its exposed edges of 

this set of functions via the natural isomorphism 
between and its coefficient space repre¬ 
sentation; i.e. recalling the discussion in Section 
3 , if a(q) is the coefficient vector for p(s, q), the 
coefficient set a(Q) is a polytope obtained by 
taking the convex hull of the a{q'). We now use 
the exposed edges of the polytope a{Q) to define 
exposed edges of ^ in the obvious way. That is, 
for each a* which lies on the exposed edge of 
fl((?), we obtain some q*eQ and an edge 
polynomial p{s, q'^) in ^ with coefficient vector 
a{q*) = a*. When we refer to the exposed edges 
of a{Q) above, we mean the one-dimensional 
exposed faces; e.g. see Rockafellar (1970). 
Using the affine linear uncertainty structure and 
some standard convex analysis arguments, it is 
not difficult to prove that every polynomial lying 
on an exposed edge of ^ is obtained from an 
exposed edge of Q. However, an exposed edge 
of Q does not necessarily induce an exposed 
edge of 9^. We are now prepared to provide the 
most well-known version of the Edge Theorem. 
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Fig. 5. Fictitious plant for .solution of an edge problem. 


Theorem 7.1. (Bartlett et al. (1988)) Let ^ be a 
polytope of polynomials with invariant degree 
and assume that 3i cC is open and simply 
connected. Then is robustly ^-stable if and 
only if each of the exposed edges of 9 ** is 
S)-stable. 


When applying the Edge Theorem, it is often 
easier to work with the exposed edges of Q in 
lieu of the exposed edges of By working with 
exposed edges of Q, however, we accept the 
possibility of redundancy in the robustness test; 
i.e. we may end up testing a superset of the 
exposed edges of In summary, by working 
with edges, we reduce the robust ^-stability 
problem for a polytope of polynomials to a finite 
number of one-dimensional edge problems. 
Each of these one-dimensional edge problems 
can be solved by a variety of classical methods. 
For example, if q'' and q'^ are two extreme 
points of Q whose convex combination describes 
an exposed edge of Q, then a robust i^-stability 
test amounts finding the roots of the polynomial 

- (1 - 9 ") + ^p{i', q'% 

for A 6 [0, 1]. By dividing by kp{s, q'-) above, it 
becomes clear that the edge problem reduces to 
a classical root locus plot for the fictitious plant 




pi^’ q") 
pis, q'‘)' 


which is compensated via unity feedback as 
shown in Fig. 5. 


8 THE 16 PLANT THEOREM 
To motivate the results surveyed in this 
section and the next, we begin with an interval 
plant family described by 

Pis, q, -, 

1=0 

and return to the case when the desired root 
location region ^ is the open left half plane. 
Recalling the notation P{s, 9 *', r*^) for the plant 
associated with the extreme point pair r'^) e 
Qx R, the fundamental question which we now 
address is: If a compensator C(j), connected as 


in Fig. 1, stabilizes each of the extreme plants 
P(s,q'\r‘^), does it follow that C( 5 ) robustly 
stabilizes the entire family 


Thirty two edges 

In order to address this question, we bring the 
result of Chapellat and Bhatacharyya (1989) into 
play. That is, if the closed loop polynomial has 
invariant degree, we do not need to check all the 
edges in order to ascertain robust stability. It is 
necessary and sufficient to check a set of 32 
distinguished edges. These 32 edges are dubbed 
Kharitonov segments in Chapellat and Bhat- 
tacharyya (1989); note that the number 32 is 
independent of m and n. These distinguished 
edges are obtained as follows: We freeze the 
plant numerator at some Kharitonov polynomial 
and consider four denominator edges of 
the form (1 - A)Z),,(^) + where D,^{s) and 

D,^{s) are Kharitonov polynomials for the 
denominator having either the same even order 
coefficients or the same odd order coefficients. 
Similar, if the plant denominator is fixed at some 
Kharitonov polynomial D,,{s), we obtain four 
numerator edges of the form (l-A)N,,(j) + 
AA^,,(j) where N,^{s) and N,^{s) are Kharitonov 
polynomials for the numerator having either the 
same even order coefficients or the same odd 
order coefficients. 

We see below that the A sweep associated with 
these edge stability problems can be eliminated 
when the compensator is first order; i.e. instead 
of checking 32 edges, one can work with 16 
specially designated extreme plants. 

Theory for first order compensators 

The papers by Hollot and Yang (1990) and 
Barmish et al. (1992) provide an avenue for 
extreme point analysis in lieu of the 32 edges 
above. We continue to consider an interval plant 
family {P{’, q, r):q e Q\r e R}, assume all 
members of ^ are strictly proper and for 
simplicity, we take D{s, r) to be monic. Now, 
with a first order compensator 

(i-p) 

as connected as in Fig. 1, we say that C(j) 
robustly stabilizes ^ if the resulting family of 
closed loop polynomials described by q e Q, r e 
R and 

Peds, q. r) = Kis - z)N(j, q) + (j -p)Dis, r), 

is robustly stable. With the setup above, Hollot 
and Yang (1990) establish the following result: 
The first order compensator C(j) robustly 
stabilizes 9 if and only if it stabilizes each of the 
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plants P{s, q'', r‘^) associated with extreme point 
pairs iq‘',r*^)eQxR; i.e. for each such pair, 
the associated closed loop polynomial 
Pcl(s. r‘‘) is stable. Notice that if the plant 
numerator has degree m and the plant 
denominator has degree n, this result can 
involve as many as 2’”'^"*' extremes. We now 
describe a stronger result for the identical 
problem. 

The salient feature in the paper of Barmish et 
al. (1992) is a set of 16 Kharitonov plants instead 
of plants. These 16 plants are obtained 

by generating the four Kharitonov polynomials 
Ni(i), and Ni{s) for the numerator 

and the four Kharitonov polynomials D,(r), 
D 2 (r), and /) 4 (r) for the denominator. 

Subsequently, we obtain 16 distinguished plants 
defined by 


for I'l, *2 = 2, 3, 4. The importance of these 16 

distinguished plants was hrst pointed out by 
Mori and Barnett (1988) and Chapellat and 
Bhattacharagya (1990) in a rather different 
context involving unstructured perturbations; see 
Section 11 for further details. 


Theorem 8.1. (Barmish et al. (1992)) Consider 
the strictly proper interval plant family ^ with 
monic denominator and the first order compen¬ 
sator C(.y). Then C(.y) robustly stabilizes ?? if 
and only if it stabilizes each of the associated 16 
Kharitonov plants. 

To conclude this subsection, we note that the 
Theorem 8.1 can be refined when more a priori 
information is assumed about the signs and the 
magnitudes of K, z and p. For such cases, it can 
be shown that only eight or 12 plants are 
required. 

Higher order compensators 

To see that an extreme point result is not 
always guaranteed for higher order compen¬ 
sators, we consider the simple interval plant of 
Hollot and Yang (1990) described by 


+25,) ■ 

and 9 6 [1, 5000]. Now, with compensator 

=_(j + 3)(j + 4) 

(^ + 0.1)(j+0.2)(i + 75)’ 


connected as in Fig. 1, it is easy to prove that 
C(j) stabilizes the two extreme plants P{s, 1) 
and Pis, 5000) but the closed loop system is 
unstable for q =2. In view of examples of this 


sort, it is of interest to delineate new 
compensator classes which lead to extreme point 
solutions in the interval plant framework. 
Recalling that we can reduce the robust stability 
problem to 32 edge problems for compensators 
of arbitrary order, there is considerable motiva* 
tion to pursue extreme point results involving 
convex combination of only two polynomials. 
We now pursue this topic. 


9. CONVEX DIRECTIONS 
The results in this section are aimed at 
enlarging the class of polytopic robust stability 
problems for which an extreme point result 
holds. In view of the Edge Theorem, we 
concentrate on one parameter problems and 
begin the presentation with a definition due to 
Rantzer (1991). 


Definition (convex direction) 

A monic polynomial g(.r) is said to be a 
convex direction (for the space of stable nth 
order polynomials) if the following condition is 
satisfied: Given any stable nth order polynomial 
f(s) such that fis)+g(s) is also stable and 
deg if(s) + Ag(j)) = n for all A e [0, 1). it follows 
that f(s) + Ag(j) is stable for all A e [0, Ij. 

The convex direction concept is depicted 
graphically in Fig. 6. From the figure, it is 
apparent that g|(\) is a convex direction because 
f (s) + ygi(j) remains within the stable set for all 
y>0. On the other hand, g 2 (^) is not a convex 
direction. The important point to note is that 
convex direction results can readily be reinter¬ 
preted in an extreme point context; i.e. if g(.v) 
is a convex direction and 0 s y, < y^, then stability 
of f(s) + y,g(s) and f{s) -I- y 2 g(.v) implies stabi¬ 
lity of /(.v)yg(.s) for all yely,. yjj. We now 
use the convex direction verbiage for discussion 
of existing literature. 

As seen via a .simple example in Bialas and 
Garloff (1985), not all directions g(i) are 


lJn§table PolynDmiaU 



Fig. 6 . Convex directions in the space of polynomials. 
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convex; i.e. if /■(s) = 10s^ + x^ + 6i -l-0.57 and 
g(5) = + 2 j +1 , it follows that f(s) and 

f{s) + g(j) are stable, f{s) + kg{s) has constant 
degree for all Ae[0, IJ but f(s) + ig(s) is 
unstable. Said another way. Bialas and Garloff 
(1985) emphasize the well-known fact that a 
convex combination of stable polynomials need 
not be stable. This raises the following problem; 
Provide conditions under which g(s) is a convex 
direction. An important paper by Rantzer 
(1992a) provides an elegant solution to this 
problem. We refer to the condition in the 
theorem below as Rantzer’s Growth Condition. 


Theorem 9.1. (Rantzer (1992a)) A polynomial 
g(s) is a convex direction for the space of stable 
nth order polynomials if and only if 


do) 




sin 2ji^g(j(o) 
2 ( 1 ) 


for all w > 0 such that g{j(o) # 0. 


The theorem above is quite useful for 
explanation of existing results and delineating 
new classes of convex combination problems 
which admit extreme point solution. For 
example, if we consider the setup in Theorem 
9,1 but allow the compensator C(s) = 
Nf(s)/Dc(s) to be of arbitrary order, the 
satisfaction of Rantzer’s Growth Condition leads 
to the following result: Robust stability o/ 16 
interconnection pairs (P/^ /^(s), C(s)) correspond¬ 
ing to the Kharitonov plants implies robust 
stability for all interconnection pairs (P(s), C(j)) 
with P(s) e For example, the compensator 


we can interpret the Kharitonov analysis in the 
convex direction framework by taking g(s) to be 
a polynomial containing only either all even or 
all odd powers of s. Such a g(s) turns out to be a 
convex direction and the associated extreme 
point result for /(f) + Ag(f) can be exploited in a 
proof of Kharitonov’s Theorem. In a similar 
manner, one can interpret the extreme point 
results of Bialas and Garloff (1985); i.e. again 
take g{s) to be a polynomial containing only 
either all even or all odd powers of s. To recover 
the result of Petersen (1990), g(s) is taken to be 
antistable (all roots in the open right half plane) 
and Rantzer’s Growth Condition is verified by 
inspection. Basic to the paper by Hollot and 
Yang (1990) is a convex direction of the form 
g(5) = j*(f + tr) where a is real. A minor 
extension of this form is exploited by Barmish et 
al. (1992); they work with a convex direction of 
the form g(f) = (f + a)h(f) with a real and h{s) 
containing only all even or all odd powers of s. 

Motivation for the paper by Barmish and 
Kang (1992) is derived from the fact that a 
systematic transcription of the growth condition 
into a direct condition on the coefficients of g(f) 
appears to be very complicated. This motivates 
seeking a simpler convex direction criterion 
which involves the coefficients of g(f) in a more 
straightforward way. With this as the goal, the 
theory of Barmish and Kang begins with a monic 
polynomial 

g{s) = j"' + 5 a,s', 

and the associated Hurwitz matrix 


f’ - f” - f ^ - f ^+f 1 

~ s'- - O Sj" + ? - 1.5f’ - 4f^ - f - 4 ’ 

is readily verified to have numerator and 
denominator satisfying the growth condition. 
Hence, if C(5) is connected to an interval plant 
as in Fig. 1, robust stability of the closed loop is 
ascertained by simply checking the stability of 16 
closed loop systems. 

It is easy to verify that Rantzer’s Growth 
Condition captures earlier extreme point results 
such as those as given in Rantzer (1990) and Fu 
(1991). For example, if 4g(y(y) is nonincreasing, 
then the growth condition is trivially satisfied. Of 
course, since the growth condition is both 
necessary and sufficient for g{s) to be a convex 
direction, it also captures (in a less explicit way) 
other coefficient-based convex direction results 
in the literature. For the sake of completeness, 
we review these results because they provide a 
spectrum of examples of convex directions. 

Beginning at the level of interval polynomials, 
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Taking A,(g), Az(g), .... A„(g) to be the 
leading principal minors of ^(g), we say that 
g{s) satisfies the Alternating Hurwitz Minor 
Condition (AHMC) if at least one of the 
following two conditions hold. 

(i) The associated odd order Hurwitz minors of 
g{5) alternate according to the sign pattern 


A,(g)<0, A,(g)>0, 

A,(g)<0, A7(g)>0. 
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(ii) The associated even order Hurwitz minors 
of g(i) alternate according to the sign pattern 

A2(^)<0, A4(g)>0, 

A,(g)<0, A,(g)>0. 

For the degenerate case when m = 1, we adopt 
the convention that (ii) holds vacuously. Finally, 
if for some k^O, either condition (i) or 
condition (ii) holds except for the fact that 
A,(g) = 0 for i s k, then g(j) is said to satisfy the 
extended AHMC. A simple convex direction 
condition can now be given. 

Theorem 9.2. (Barmish and Kang (1992)) 
Suppose g{s) is a monic polynomial of degree m 
which satisfies the extended AHMC. Tlien g{s) 
is a convex direction for the space of 
polynomials of order n>m. 

The application of the AHMC to SISO 
feedback systems leads to a number of extreme 
point results. For example, it is easy to verify 
that numerator and denominator of the 
compensator 

^ 1 0(^+4)(^ +!)(.>-2) 

- 3a- + 4)(a + 2)(a - 3) ’ 

satisfy AHMC conditions (ii) and (i), respec¬ 
tively. Therefore, if the plant is strictly proper 
with monic denominator, it is easy to verify that 
C(5) robustly stabilizes an interval plant family 
if and only if it stabilizes the associated 16 
Kharitonov plants. 


Then, we obtain a simple matrix analogue of 
Kharitonov's Theorem; ^ is robustly stable if 
and only if four distinguished extreme matrices 
are stable. Of course, the four distinguished 
matrices are obtained from the Kharitonov 
polynomials associated with the interval polyno* 
mial family {p(*, q) :q € Q}. 

There are also some trivial matrix analogues 
of robust stability results for polytopes of 
polynomials. For example, if q enters afline 
linearly into only a single row or column of 
A{q), then the resulting characteristic polyno¬ 
mial p{s, q) turns out to have an afiine linear 
uncertainty structure and many results already 
surveyed can be restated in a matrix setting. 

In fact, there is a class of rank one matrix 
uncertainty structures which permit linkage with 
the theory for polytopes of polynomials. This 
class is nicely characterized in the dissertation by 
El Ghaoui (1990). Rewriting A{q) in the form 

e 

A(q) = A„+'Z 

with A, fixed for i = 0, 1, 2, , . . , the following 
result holds: The characteristic polynomial 
p(s, q) = (id {si ~ A{q)) has affine linear uncer^ 
tainty structure if and only if 

min {rank [A,, A 2 , . . - , A^], 

rank [A\, Az, . . . , A'^]} = 1, 

where A' denotes the transpose of A,. 


10 RESULTS AT THE MATRIX LEVEL 
The motivation for this section is derived from 
slate space systems which are described in terms 
of the vector of uncertain parameters q\ we 
consider 

x(t) = Aiq)xU), 


and call A{q) an uncertain matrix. In view of the 
discussion in the preceding sections, there are a 
number of “quick and dirty” results which can 
be obtained by noting that the polynomial 

pis, q) = dct isl-Aiq)), 


governs the lobust ^-stability for the family of 
matrices sd = {Aiq):q e Q}. For example, sup¬ 
pose Aiq) is a companion canonical form; e.g. 
say 

0 1 0 .] 

001 0 


Aiq) = 


0 


fn 


0 

9i 


0 

92 


1 

qn-\. 


Interval matrices 

In this subsection, we work with matrices 
having uncertain parameters each entering into 
one entry at most. For convenience, we use a 
double subscripting notation and denote the 
(i, y)th entry of A(^) by q,^. This enables us to 
use a compact notation to describe an interval 
matrix family, i.e. if [q^.q^j] is a given bound 
for q,j, we can use a notation similar to the one 
employed for interval polynomials. For example, 
we can describe a 2 x 2 interval matrix family by 
writing 


. [kn. 9n] [ 912 , 91 ^ 2 ]] 

1.192..92M 192-2.9?2lJ‘ 

For an interval matrix family M, we let A, 
denote the eth extreme matrix; e.g. if q‘ is the (th 
extreme point of Q, we can take A, = A(q'). The 
first point to mention is that for interval 
matrices, extreme point results for robust 
stability are the exception rather than the rule. 
For example, the 3x3 interval matrix of 
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Barmish and Hollot (1984) 


A{q) = 


[-1.5, -0.5] -12,06 -0.06' 
-0.25 0 1 

0.25 -4 -1 . 


has stable extremes and A 2 but is unstable 
when 911 - -1; see also Karl et al, (1984) for a 
similar example. 

It is interesting to note that a 3 x 3 matrix was 
used above. At the level of 2 x 2 matrices, if we 
use the fact that stability is equivalent to 
positivity of the coefficients of the associated 
second order characteristic polynomial, we can 
easily obtain an extreme point result; e.g. in 
Mansour (1989) it is seen that stability of the set 
of 2 X 2 extreme matrices {Ai} is equivalent to 
robust stability of si. As far as 3 x 3 interval 
matrices are concerned, the extreme point result 
of Zong (1990) can be applied provided the 
diagonal elements of si are fixed; i.e. if = q^, 
for / == 1 , 2 ,3, then robust stability of the 
extreme matrices {A,} is equivalent to robust 
stability of jz/. At the level of 4 x 4 matrices and 
above, extreme point results are available only 
under rather strict conditions. 


Poly topes of matrices and Lyapunov functions 

If the entries of A(q) depend affine linearly on 
q, then the matrix family si = {A{q):q e Q} is 
called a polytope of matrices or a polytopic 
matrix family. The fact that si is polytopic is 
explained by noting that if q^ is the ith extreme 
point of Q and again, we take A, = A(q^), then 

si = conv {Ay}. 

Since the interval matrix problem is a special 
case of the poly tope of matrices problem, we 
cannot expect extreme point results at the level 
of 4 X 4 and above without enforcing additional 
conditions. Since such conditions are quite 
restrictive, we only provide a sampling of the 
available results along these lines. 

The following result is given by Shi and Gao 
(1986): If si is a polytope of symmetric matrices, 
then stability of the set of extreme matrices {A,} is 
equivalent to robust stability of si. This is easily 
established by observing that if the extremes are 
stable, then with P — P' — l, we have AyP + 
PAy <0, That is, the identity serves as a common 
Lyapunov matrix for the extremes. Using the 
fact that positively weighted sums of negative- 
definite matrices are still negative-definite and 
noting that every A € is a convex combination 
of the A,, it follows that AT-l-PA<0 for all 
A e si. Hence P = / serves as a common 
Lyapunov matrix for all of si\ e.g. see Fu 
(1987). 


The usefulness of common Lyapunov func¬ 
tions in an extreme point context is not 
particular to the symmetric matrix analysis. 
Notice that we can easily modify the argument 
above so that it applies to a general polytope of 
matrices; e.g. in the analysis of Horisberger and 
Belanger (1976), we see that if a polytope of 
matrices si (not necessarily symmetric) admits 
common Lyapunov matrix P = P' > 0 for its 
extremes, this same matrix serves for all of si. 
Furthermore, it can also be shown that the 
problem of finding a common Lyapunov matrix 
for the extremes is a convex program; see Boyd 
and Barratt (1990) for further details. 

More generally, if si - {a(q) :q e Q} is a 
family of matrices with entries depending 
multilinearly on q, similar extreme point results 
are obtained. Along these lines, the result of 
Garofalo et al. (1992) is the most general; If each 
entry of A{q) is a ratio of multilinear functions of 
q, then once again, a common Lyapunov matrix 
for the extremes of Q serves as a common 
Lyapunov matrix for all of si. 

Function maximization on matrix polytopes 

In some applications, we begin with a 
polytope of matrices si = {A{q)\q eQ) and a 
convex function /: R which represents some 

aspect of performance. Using the basic fact that 
a convex function on a polytope is maximized at 
an extreme point, we conclude that 

max/(A) = max/(c/'). 

^4 I 

Using this simple idea, a number of interesting 
“small" results can be obtained. For example, if 
^ is a symmetric polytope of matrices, we can 
reproduce the extreme point results given in 
Section 10. These ideas also lead to a simple 
proof of a result given in Mori and Kokame 
(1987); i.e. let |H| be any matrix norm, take 
y >0 as given and consider an interval matrix 
family .si = {A{q):q e Q}. Then using /(A) = 
||A|| above, it follows that ||A|| < y for all A g ^ 
if and only if ||A(q')|| < y for all extreme points 
9 ' of Q. 

Robust nonsingularity 

Another interesting set of results at the 
matrix level exploits a certain relationship 
between robust stability and robust nonsin¬ 
gularity. To clearly explain this linkage, we take 
si = {A{q) :q e Q) be a family of n x n matrices 
with A{q) depending continuously on q. Under 
rather mild conditions, there is a family of linear 
transformations (for example, see Bialas (1985) 
and Fu and Barmish (1988)) mapping si into a 
new family M having the following property. The 
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family ^ is robustly stable if and only if the 
family ^ is robustly nonsingular; Le. all matrices 
are nonsingular. 

One well-known example of such a transfor¬ 
mation involves the Kronecker operations. 
Indeed, if A and B are square matrices of 
dimension n, and n 2 . respectively and is the 
(i, ;)th entry of A, the Kronecker product A® B 
is a square matrix of dimension with (i, / )th 
block OijB. Subsequently the Kronecker sum is 
defined by 

where and denote identity matrices of 
dimensions ni and respectively. With this 
notation, we define a linear transformation T on 
a polytope of matrices si as follows: If a 6 
then 

TA^A^A. 

The important point to note is that any linear 
transformations on si preserves the affine linear 
dependence of matrix entries on the uncertain 
parameters. 

For the reader interested in further details on 
Kronecker operations, the review paper by 
Brewer (1978) is recommended. We nov\ present 
a result which applies to nonsingularity of 
interval matrices. Using the basic fact that a 
multilinear function on a box achieves both its 
maximum and minimum at an extreme point, the 
following lemma is easy to prove by noting that 
det A depends multilinearly on the entries of A, 

Lemma 10.1. Suppose si = {A(q):q e Q} is an 
interval matrix family. Then .rf is robustly 
nonsingular if and only if each of the 
determinants det A(^') has the same sign. 

A weakness of this result is the “combinatoric 
explosion” associated with the solution. To 
illustrate what is meant by this, notice that if si 
is an n X Ai interval matrix family, there can be as 
many as 2"' extreme points. The results of Rohn 
(1989) enables us to reduce ^his number to 4". 

Rohn's framework 

We begin with a nonsingular n x n matrix A,, 
and given bounds > 0 for the entries q,^ of an 
interval matrix AA(^). We now define the family 
of matrices 

{Ao+ eAA(q):0< f <r, q eQ), 

with variable magnification factor r ^iO. The 
objective is to obtain the robustness margin 

Tmax “ sup [r \ sAr is robusily nonsingular}. 

Taking R to be the n x n matrix having (i, /)th 


entry r,y, we provide some definitions needed in 
order to describe Rohn*s result; see also 
Demmel (1988). Indeed, if M is a square matrix, 
let 

Pd(M) — max (|A|: A is a real eigenvalue of Af }, 

with po(Af) = 0 if no eigenvalues of Af are real, 
A square matrix S is said to be a signature matrix 
if it is diagonal with all diagonal entries equal to 
either -fl or -1. Now, we let ^ be the set n x n 
signature matrices and observe that y has 2” 
members. We are now prepared to present 
Rohn’s Theorem. 

Theorem 10.2. (Rohn (1989)) Given the 

interval matrix family i^ robustly nonsingular 
if and only if detAo and the 4” determinants 
del (A(, 4- rSxRSf), obtained with 5,, iz ^ have 
the same nonzero sign. Moreover, 

__ 1 _. 

^"'*“max po(5,Ao'M) 

Connection with p theory 
In this brief subsection, we mention an 
interesting connection between Rohn’s result 
and the computation of the real structured 
singular value (real ju) for the case when the ^ 
norm is u.sed for the uncertain parameter vector 
q. To this end, we begin with a real square rank 
one matrix M and the goal is to compute 

\ =inf{|| 9 lU:det (/+ MA(,/)) = 0}. 

where ||^||.., denotes the norm of q and 

A(^) = diag(^|',. 92 ,- q,}. 

In the numerical computation of /r.(A/), the 
following lower bound has traditionally been 
used: 

max 

S € ‘J 

In view of the extreme point theory of Rohn 
(1989), it can be shown that the lower bound is 
sharp. For further discussion of extreme point 
results in this framework, see also Holohan and 
Safonov (1989), El Ghaoui (1990) and Chen et 
al. (1992). 

Stability and Schur stability of non-negative 
interval matrices 

In this subsection, we survey some extreme 
point results on stability and Schur stability of 
interval matrices with non-negative entries. The 
motivation for this problem is derived from a 
number of different fields. For example, this 
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problem arises if we consider the transition 
matrix of a Markov chain with non-negative 
switching probabilities only known within given 
bounds. We begin the exposition by taking A to 
be an n x n square matrix with non-negative 
off-diagonal entries. Letting A,, denote the upper 
ky.k block of A for k = \,2,. .. ,n, in 
accordance with classical results from matrix 
algebra (for example, see Gantmacher (1959)), it 
follows that A is stable if and only if 

(-1)* det(4-/l*)>0, 

lot k = \, 2,n where /* denotes an identity 
matrix of dimension k. A similar result holds for 
Schur stability with the added restriction that the 
diagonal entries of A are non-negative and the 
determinant condition above is replaced by 
det (4 — A^) > 0, 

If we consider an n x n interval matrix family 
si = {A{q)\q eQ) with non-negative off- 
diagonal entries, we can exploit the result above 
in combination with the arguments immediately 
preceding Lemma 10.1 to arrive at the following 
extreme point result: si is robustly stable if and 
only if 

(-l)*det( 4 -A*( 9 ‘))> 0 . 

for all extreme points q' of Q and all 

ke{l,2 . n}. Furthermore, using ideas 

quite similar to those in Theorem 10.2, the 
number of critical extreme points can be 
dramatically reduced. For the case of robust 
Schur stability, minor modifications of the 
arguments above lead to a similar result given by 
Shafai et al. (1991). For further extensions 
involving irreducible interval matrices, see 
Mayer (1984). ' 

11. OTHER ASPECTS OF PERFORMANCE 
Over the last few years, we see the beginning 
of a collection of extreme point results involving 
other aspects of robust performance besides 
robust ^-stability. The first result which we 
mention involves W* analysis with structured 
real uncertainty. For completeness, recall that if 
P{s) is proper, stable and rational, then the H‘ 
norm is given by 

\\P\U = sup\P{j0})\. 

Hi 

Theorem 11.1. (See Mori and Barnett (1988) 
and Chapellat et al. (1990)) Consider a robustly 
stable proper interval plant family .9* = 
{P(s, q, r):q e Q, r e R} with monic de¬ 
nominator and 16 associated Kharitonov plants 
Py{s) for i, j = 1-4. Then it follows that 

max ||P(-, q, r)||. = max ||F1,|U. 

</6(?,reK i.j 


The ideas central to the proof of Theorem 
11.1 enter into the proof of a number of closely 
related results. For example, in Chapellat et al 
(1990), a real parameter version of the Small 
Gain Theorem is established and in Mori and 
Barnett (1988), a robust Popov-like criterion is 
provided in the interval plant context. These 
results are extended in Dahleh et al (1991), 
Vicino and Tesi (1991) and Rantzer (1992a). It is 
also interesting to point out that the results of 
the sort above are not readily extendable to a 
more general framework involving weighting 
functions or restricted frequency intervals; e.g. 
see the paper by Hara et al (1991) for further 
discussion. 

Positive‘realness 

The issue of positive-realness (addressed in 
the classical Popov theory) is studied in an 
extreme point context by Dasgupta and 
Bhagwat (1987). If P{s, q)- N(s, q)!D{s, q) is 
an uncertain plant and q is fixed, we say that this 
transfer function is strictly positive real (SPR) if 
both N(s, q) and D{s, q) are stable and 
Re P{jv)y <?) > 0 for all to e R. We say that a 
family of plants SP = {P{', q):q e Q} is robustly 
SPR if P{Sy q) is SPR for all qeQ, 

Theorem 11.2. (Dasgupta and Bhagwat (1987)) 
Consider the interval plant family with fixed 
numerator N{s, q) '^ N{s). Letting Di(j), D 2 (s), 
D^(J) and D^{s) denote the four Kharitonov 
denominator polynomials, it follows that ^ is 
robustly SPR it and only if N{s)/DXs) is SPR for 
/= 1-4. 

In their paper, Dasgupta and Bhagwat also 
point out the applicability of these results in an 
adaptive output error identification context. 
There are also a number of papers dealing with 
extensions and variations on the SPR theme. In 
papers by Dasgupta (1987), Bose and Delanski 
(1989), Chapellat et al (1991) and Shi (1991), 
the SPR problem is studied under the weaker 
hypothesis that the numerator N{s, q) is un¬ 
certain as well; in some cases, plants with 
complex coefficients are considered. It is also 
worth noting that extreme point results for SPR 
problems with multilinear uncertain structures 
are also available. However, instead of a four 
plant result as in Theorem 11.2, the plants 
associated with all of the extreme q* come into 
play; see the recent paper by Dasgupta et al 
(1991). Finally, we mention extreme point 
results for the case when the plant is not 
necessarily SPR but can be rendered SPR by 
suitable addition of a positive constant; see 
Chapellat et al (1991). 
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Steady state and overshoot 
In the work of Bartlett (1990), an uncertain 
plant P(j, q) with multilinear uncertainty struc¬ 
ture is considered and steady state error for a 
unit step input is the prime consideration. A 
robust stability assumption is imposed and it is 
shown that both the maximum and the minimum 
of the steady state error occur on one of the 
extreme plants P{s, q'). Bartlett then goes on to 
provide a counterexample to the tempting 
conjecture that the maximal peak overshoot is 
also attained at an extreme. To this end, he 
considers the family of plants with affine linear 
uncertainty structure described by 

^ i3Aq + 0.1)j" -I- (1.7^ + 0.8)s -I- 1 ’ 

and q e [0, 1]. It turns out that the peak 
overshoot corresponding to a step input is not 
maximized at the extremes q = 0 or ^ = 1; e.g. 
^=0.5 leads to a higher overshoot value. A 
similar example is also given for discrete time 
systems. 

The Nyquist envelope 

To complete this section, we briefly discuss an 
emerging line of work related to the frequency 
response of an interval plant. Indeed, suppose 
that {P{s, q, r)\q € Q. r € R] is a strictly 
proper interval plant family with monic de- 
nominatoi and, to keep the exposition simple, 
we also assume that for all r e R, the plant 
denominator D{s, r) has no roots on the 
imaginary axis. Then, the Nyquist set associated 
with this family of plants is defined by 

= {P(j(o, q, r): oj q € Q: r e Rj. 

In Hollot and Tempo (1991), the boundary 
of the Nyquist set, called the Nyquist envelope, 
is the focal point and the following question is 
addressed. What points z e on the Nyquist 
envelope are Kharitonov points? By this, wc 
mean points z e such that z = P,^ ^J^j(x)) for 
one of the 16 Kharitonov plants P,„,^{5) and 
some (0 G R. If ^ is robustly stabilized by unity 
feedback, then the distinguished points on the 
envelope include minimal gain margin points and 
minimal phase margin points. If P{s, q) is (open 
loop) stable for all q the points associated 
with the maximum W norm also lie on the 
Nyquist envelope. Such a point is identified with 
a 9 * G 0 and an o)* g R such that 

\PU(D*, q*)\ = max \P{j(JD, q)\. 

q. M 

Recalling Theorem ll.l and the discussion in 
Section 5, we already know that the gain margin 
point and the point are Kharitonov points. 



Fig. 7. Gam margin, phase margin anil H’* points are 
Kharitonov points. 

The paper by Hollot and Tempo (1991) 
provides a characterization of a class of 
Kharitonov points via a certain “path condi¬ 
tion'’. Their theory identifies some new Kharito¬ 
nov points and recovers the ones which wc 
already know. The fact that critical points 
associated with minimum phase margin, sen¬ 
sitivity and complementary sensitivity are iden¬ 
tified as Kharitonov points seems to be new. In 
this regard, see also the paper by Kimura and 
Hara (1991) where a more restrictive class of 
interval plants with fixed numerator is con¬ 
sidered. To further clarify the geometry 
associated with the discussion above, we refer to 
Fig. 7 where three distinguished points on the 
Nyquist envelope are depicted. 

12 RESULTS FOR OTHER UNCERTAINTY 
REPRESENTATIONS 

Throughout this paper, it has been assumed 
that the uncertainty bounding set ^ is a box. In 
this section, we consider two alternative 
descriptions of Q, In each case, new extreme 
point results are obtained. 

The rotated rectangle for robust Schur stability 

The motivation for the exposition in this 
subsection is derived from the fact that stringent 
assumptions are imposed in Section 6 in order to 
obtain extreme point results for robust Schur 
stability of an interval polynomial family. In the 
work of Kraus et al (1988b), a new uncertainty 
model is considered. These authors begin with 

n 

an uncertain polynomial p{s, <?) = E qts^ having 

independent uncertainty structure but dispose 
with the box bound Q for q. Instead, they 
consider a rotated rectangular bound of the sort 
shown in Fig. 8 ; other quadrants for this 
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rectangle are also permitted. For the case when 
n is even and i = nl2, an interval [q^n,qt,ii\ is 
taken as a bound for q„, 2 . When the uncertain 
parameters are bounded via a set Q in the 
manner above, we say that Q is a product of 45“ 
rotated rectangles. As usual, we let {q') denote 
the set of extreme points of Q. With this new 
setup, rather strong extreme point result is given 
below. 

Theorem 12.1. (Kraus et al. (1988b)) Consider 
the uncertain family of polynomials = 
{p(-,q):qeQ} with invariant degree and Q 
being a product of 45° rotated rectangles. Then, 
it follows that is robustly Schur stable if and 
only if p(s, q') is Schur stable for all extreme 
points q' oi Q. 

In a follow-up paper by Mansour et al. (1988) 
even stronger results are established. Under the 
hypotheses of the theorem above, they show that 
a “small” subset of the extreme polynomials 
need only to be considered. Furthermore, 
working with Chebyshev and Jacobi polyno¬ 
mials, they develop a recipe for selection of 
these distinguished extremes. More recently, a 
novel technical method (involving barycentric 
coordinates) of Perez et al. (1992) has resulted in 
extreme point results for classes rotated 
rectangles other than 45°. For angular rotations 
in the sector [jr/4, 3;r/4), results along the lines 
of Theorem 12.1 are given. 

Diamond families of polynomials 

In a paper by Tempo (1990), a robust stability 
problem is formulated which can be viewed as a 
dual to the problem of Kharitonov. Whereas the 
interval polynomial framework is associated with 
the norm for q, Tempo considers the norm 
and works with an uncertain polynomial 

n 

p(.^> ?) * £ qi^' with independent uncertainty 
structure and uncertainty bounding set Q to be 


the diamond described by 

I tO 

n 

We call p(s, q*)= E q*s' the center polynomial 
1 = 1 ) 

and refer to {/?(■, q).q eQ) as a diamond 
family of polynomials. Associated with this 
family ^ is a set of eight critical polynomials 
defined by 

c,is)=p{s,q*) + r- 
C 2 is) = pis, q*)-r-, 
c,(.r) = p(.v. ?*) + rr; 

Q(-v)-P(s. q*)-rs; 

<^’iis)=pis, q*) -I- rs"~'\ 

Ctis)=p{s, q*)- kk"-'-, 

£’7(-5)-pC'’. ?*) + «"; 
c»(s)=p{s, q*)-rs’‘. 

In the theorem below, we see that the critical 
polynomials c,{s) play the same role in the 
analysis as the Kharitonov polynomials /C,(5) 
play in the analysis. 

Theorem 12.2. (Barmish e( al. (1990)) A 
diamond family of polynomials having 
invariant degree is robustly stable if and only if 
the associated eight critical polynomials 
c,(j), C 2 {s), . . . , Cnis) are stable. 

Unlike Kharitonov’s Theorem, the extreme 
point results for the case above do not easily 
extend to weighted norms. To demonstrate this 
point, we consider the weighted diamond family 
of polynomials in the dissertation of Kang 
(1992): The center polynomial is given by 
p{s, q*) = -{■ 2.20hv -b 4 and admissible 

coefficients are described by 10 ~ ^ 3 1 T 

100 \q 2 - ql\ T 100 \q, -q*\-^ 2.5 l^o - qol ^ l- 
It turns out that all extreme polynomials p{s, q‘) 
are stable but the polynomial p(j:) = 1.05 a’^ H- 
2j^T2,201.y + 4.2 is an unstable member of this 
diamond family. 

We now describe another issue associated with 
the case. When dealing with a complex 
coefficient polynomials, there are different ways 
to describe the diamond bound Q. For example, 
do we formulate the problem with separate 
diamond bounds for real and imaginary parts or 
with a “joint” bound? For the case of separate 
diamond bounds for real and imaginary parts, 
Barmish et al. (1990) consider a second order 
complex coefficient diamond family described by 
p(j, q) = q 2 S^ + + ^0 complex uncertain 

parameters = m, + ;v/ for i = 0, 1,2 and center 
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polynomial 

p(s, u*. V*) = (1.2272 + /6.3118>^ 

+ (0.0111 +/15.1285)s + (-4.3176+;!. 8398). 

Using diamond bounds |uo - | + |u, - u*| + 

1 m 2-«?|^1 and |uo-uJI + Iv. - Wi*! + luj- 
uJl ^ 1 for M and v, it is straightforward to verify 
that all extremes are stable but the polynomial 
^(i)=p(s, u*. ir*) + 0.5j-(0.5+;) is an un¬ 
stable member of this family. On the other hand, 
if we describe a diamond family by the joint 
bound 

iluo-Wol + ilwo-Wol 

+ 2 (|u,-u,*| + |u, 

( * 1 

the extreme point result of Tempo (1989) applies 
for the case of robust Schur stability. 


a transformed domain often leads to extreme 
point results which are not possible In the original 
domain. Of course, a fundamental limitation of 
these results is that the relationship between the 
transformed parameters and the original para¬ 
meters may be quite complicated. There are a 
number of other papers involving the transfor¬ 
mation point of view. For example, in Petersen 
(1989), the Delta transform (for the discrete 
time systems) is implicit in his analysis of 2 
regions which are shifted circles. For similar 
extreme point results involving Delta transfor¬ 
mation, see the papers by Soh (1991) and Hirsh 
(1992). Finally, we mention the paper by 
Vaidyanathan (1990) which considers yet a 
different transformation for discrete time and the 
paper by Wei and Yedavalli (1987) which deals 
with the continuous time case. 


Markov parameters and other transformations 
Rather than using a box to bound the 
coefficients, the paper by Hollot (1989) considers 
an uncertainty representation in the space of 
Markov parameters. To describe the unique 
correspondence between a fixed polynomial 

n 

p(s)= E and its Markov parameters, we 
first break p{s) into even and odd parts as 

p(5)=/(i^) + 5g(i^). 

and for simplicity, we take /(•) and g( ) to be 
coprime. Now, it is easy to obtain the Markov 
parameters b, via the expansion 


fix) 


= 6 _ 




If n = 2m is even, we take ( 6 (,. h,, . . . , hjm i) 
as the Markov parameters and if n = 2m — 1 is 
odd, we take (b^t, bo, . . . , b 2 „-i). Note that 
Markov parameters can be calculated in a 
variety of ways; e.g. see Gantmacher (1959). 
Hollot’s paper demonstrates the usefulne.ss of 
Markov’s Theorem in a robust stability context. 
Namely, if polynomial uncertainty is represented 
by a box B in the space of Markov parameters, 
robust stability is guaranteed if and only if two 
distinguished polynomials are stable. For ex¬ 
ample , if n = 2m is even and B described by 
b; < 6 , < 6 ,^ for I = 0 , 1 , 2. . . , 2m - 1 , the first 
distinguished piolynomial has Markov parameters 
(bo, bt, b 2 , ■ ■ ■ , 62 m-i) and the second distin¬ 
guished polynomial has Markov parameters 
(h^, b\ , 6 ^, ... , 62 »i-i)- 

The results presented in this section address 
robust stability analysis with the following idea in 
mind. An alternative uncertainty representation in 


Moving away from univariate polynomials 
We briefly mention .some research aimed at 
the attainment of extreme point results for delay 
systems and di.stributed systems. The paper by 
Hu et al. (1991) raises the question of extreme 
point results for robust stability of sampled data 
control systems. They show that the road is 
rocky via the simple example involving an 
uncertain plant described by P(s, q) = \l(5 -t- q) 
and q e [-0.5, 1]. They close the loop with unity 
feedback and A/D and D/A converters on each 
side of P(s,q) and arrive at . the uncertain 
characteristic polynomial 


'z-y -qe "' 

h(z. q) = ' 

2 + 7-1 


if q =^0, 
if q = 0 , 


where 7 is the sampling period. It is then readily 
verified that for 7 = 2.1, the two extreme 
polynomials h(z. -0.5) and h(z, 1) are Schur 
stable but the intermediate polynomial h(z, 0 ) is 
not Schur stable. 

Counterexamples of the sort above can al.so be 
given for delay systems having open loop 
description Pr(s. q, r) = P(s, q, r)e~'^ with 7 > 
0 fixed and P(s, q, r) being an uncertain plant 
with independent uncertainty structure and 
interval bounds for the components of q and r; 
e.g. see Ackermann et al. (1990). In the same 
paper extreme point results are given under 
further restrictions on the plant. For example, if 
the plant denominator is fixed, the plant 
numerator is an interval polynomial and unity 
feedback is applied, closed loop stability can be 
ascertained by working with the extreme points 
q' of Q. The interested reader should also 
consult the paper by Kim and Bose (1990) 
where, in a similar vein, the issue of extreme 
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point results is addressed for classes of delay 
systems satisfying various technical assumptions. 

To Qonclude this section, we mention a body 
of work aimed at extension of Kharitonov’s 
Theorem to scattering Hurwitz polynomials. In 
the papers by Bose (1988), Kim and Bose (1988) 
and Basu (1989), the uncertain polynomial 
p(s, q) is replaced by a multivariate polynomial 
p(st,S 2 , ■ ■. , s„, q) and interval bounds on the 
coefficients are imposed. These papers go on to 
develop extreme point results which appear to 
be more relevant to signal processing than 
control. 

1.1, CONCLUSIONS 

The theory of robust stability for polytopes of 
polynomials seems to have reached a mature 
state. For polytopes of matrices, however, much 
research remains to be done. An interesting 
result illustrating the difficulty of this problem is 
given in the paper by Cobb and DeMarco 
(1989): If si is a polytope of n x n matrices with 
rt a 3, then stability of all faces of dimension 
2n — 4 is sufficient to guarantee robust stability of 
si. Moreover, if the dimension of sd {viewed as a 
subset of 9?") is 2n — 4 or greater, there are 
examples of matrix polytopes which are unstable 
but have the property that all faces of dimension 
2/1-5 are stable. Results of this type motivate 
further research involving the computational 
complexity of robust stability problems. Some 
initial results in this direction are given in the 
paper by Rohn and Poljak (1992) where a class 
of robust nonsingularity problems are shown to 
be NP-hard. Another early result is given in 
Coxson and DeMarco (1992) where it is shown 
that the problem of deciding if the real 
structured singular value of a matrix is bounded 
above by a given constant is NP-hard. 

Given the fact that extreme point theory 
applies to rather specialized situations, the 
following question is important; What role will 
extreme point results have (if any) in an 
“ultimate” robustness analysis theory which is 
yet to emerge? On one hand, researchers 
working towards extreme point solutions fully 
recognize that their “elegant” solutions apply to 
a very limited problem class. On the other hand, 
if one attacks a robustness analysis problem with 
“real world specifications”, one might quickly 
become involved in a whole host of issues 
ranging from problems involving distinction 
between local and global minima to high 
computational complexity. 

The perspective of the authors of this survey is 
that a good direction for future research is 
described by the words tailored algorithm. We 
elaborate on this point by first describing an 


"untailored” algorithm; Suppose one takes a 
robustness analysis problem, “blindly” massages 
it into a mathematical programming problem 
and then applies some software package. Then, 
we call such an algorithm untailored in the sense 
that the underlying control structure is being 
totally ignored. As an alternative, however, 
there is a possibility of tailoring the iterative 
steps of the algorithm by exploiting analytical 
results which apply to some approximate version 
of the problem at hand. This idea is well 
illustrated by the line of work beginning in Saeki 
(1986) and continued by authors such as de 
Gaston and Safonov (1988) and Sideris and 
Sanchez Pena (1989). In these papers the plant 
numerator and denominator have multilinear 
uncertainty structures but using the Mapping 
Theorem (see Zadeh and Desoer (1963)), it 
becomes possible to use results on affine linear 
uncertainty structures at each step in the 
iterative process. In other words, these authors 
provide tailored algorithms which exploit analyt¬ 
ical results which apply to an idealization of the 
problem at hand. Roughly speaking, the error 
between the solution to the idealized problem 
and the .solution to the original problem is 
corrected by the iterative process. 

To further illustrate the notion of tailored 
algorithms, we consider analytical results as¬ 
sociated with p theory. The underlying control 
system structure sometimes makes it possible to 
generate convex bounds for the solution which 
would not be ordinarily available for an arbitrary 
mathematical program. The systematic exploita¬ 
tion of such bounds leads to tailored algorithms. 
For example, when minimizing some function 
f{x), the clever exploitation of a convex lower 
bound fi.eix) might greatly facilitate numerical 
computation. 
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A controller design methodology based on linear programming is suggested 
to handle time-domain specifications in the presence of structured plant 

uncertainty. 
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Abitnct —Thi.s paper addresses the problem of designing 
feedback controllers to achieve good performance in the 
presence of structured plant uncertainty and bounded but 
unknown disturbances. A general formulation for the 
performance robustness problem is presented and exact 
computable conditions arc furnished. Thc.se conditions are 
then utilized for synthesizing robust controllers which 
involves solving optimization problems. These solutions 
are computed using the duality theory of Lagrange 
multipliers. Approximation.s and computational issues are 
discussed. 

1. INTRODUCTION 

The objective of robust control is to provide 
in a quantitative way the fundamental limitations 
and capabilities of controller design in order to 
achieve good performance requirements in the 
presence of uncertainty. Even though a real 
system is not uncertain, it is desirable to think of 
it as such to reflect our imprecise or partial 
knowledge of its dynamics. On the other hand, 
uncertainty in the noise and disturbances can be 
cast under “real uncertainties”, as it is 
practically impossible to provide exact models 
for such inputs. 

Many of the design specifications tend to be 
concerned with amplitudes of signals. For 
instance, tracking, disturbance rejection, ac¬ 
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tuator authority, all result in specifications 
concerning the maximum amplitudes of signals. 
On the other hand, disturbances and noise are 
usually persistent, bounded, otherwise unknown. 
This environment motivates a peak<to-peak kind 
of specifications, which is the theme of the 
theory. 

In this paper, a general framework for 
designing controllers that achieve robust peak< 
to-peak performance in the presence of plant 
perturbations is presented. First, computable 
necessary and sufficient conditions for perfor¬ 
mance robustness are presented. The connec¬ 
tions between these conditions and spectral 
properties of positive matrices are highlighted 
and utilized to simplify the computations. These 
conditions are in turn used for controller 
synthesis. The synthesis procedure followed will 
involve iterative solutions of minimization 
problems, the solution of which is obtained by 
using the duality theory of Lagrange multipliers. 

The problem, formulated in Vidyasagar 
(1986), was solved in Dahleh and Pearson (1987, 
1988a). The theory was further developed in 
Deodhare and Vidyasasgar, 1990, Diaz-Bobillo 
and Dahleh (1993, 1992), Mendlovitz (1989) and 
Staffans (1990, 1991). The robust stabilization 
problem in the presence of ^«,-stable perturba¬ 
tions was first analysed in Dahleh and Ohta 
(1988) in the case of unstructured perturbations. 
In Khammash and Pearson (1990) a performance 
objective was added to the robust stability 
requirement in the unstructured perturbations 
case and conditions were provided for robust 
performance and stability. This led the way to 
the development of exact necessary and 
sufficient conditions for robust performance in 
the presence of structured perturbations (Kham¬ 
mash and Pearson (1991a, b, 1992)). Most of the 
above results have continuous-time analogs. 
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This paper is intended to be of tutorial value. 
In addition to surveying some of the important 
relevant results, new results are also included. 
Presented in this paper is a unified framework 
for designing robust controllers in the presence 
of structured uncertainty. Non-conservative 
conditions to guarantee robust performance are 
developed directly in terms of the spectral 
radius of certain matrices capturing the structure 
of the perturbations. Exact relations between 
these conditions and linear matrix inequality 
conditions are then established. In addition, the 
use of linear programming in synthesizing robust 
controllers is highlighted through the application 
of the theory of Lagrange multipliers. Through 
this simple formulation, problems that admit a 
finite-dimensional equivalence become quite 
transparent. For the rest of the problems, the 
theory proves to be quite instrumental in 
providing upper and lower approximations of the 
exact problem. 

This paper puts together all of the above 
development in a way that makes the theory 
readily usable for design. In general, details that 
appeared elsewhere will not be presented, 
however, simple and intuitive proofs of the main 
ideas will be. Similarities and contrasts between 
this theory and the n formalism will also be 
highlighted. 


A nonlinear operator H C.r is causal if 

P,H = P,HP„ \/k = 0,1,2,..., 


strictly causal if 

P,H = P,HP,.„ \tk = 0,1,1,..., 


time-invariant if it commutes with the shift 
operator {HS = SH), and fp stable if 


||«II = sup sup 


The quantity ||//|| is called the induced 
operator norm over fp. 

Xtv denotes the set of all linear causal 
^„-stable operators. This space is characterized 
by infinite block lower triangular matrices of 
the form 


0 

«... «„ 


where is a p x q matrix. This infinite matrix 
representation of H acts on elements of €% by 
multiplication, i.e. if u e fi, then y := Hu € 

k 

where y{k)= E «*/«(/) e M'’. The induced 

/=" 

norm of such an operator is given by: 


||«IU, = sup|(//„---«„)|,. 


2. PRELIMINARIES 

First, some notation regarding .standard 
concepts for input/output systems. For more 
details, consult Desoer and Vidyasagar (1975) 
and Willems (1971) and the references therein. 

• denotes the set of all sequences / = 
{/n,/i,/ 2 . ■ ■ ■}. A e such that 

ll/llr. = sup |/(/c)U<*, 

k 

where \f{k)\^ is the standard norm on 
vectors. Also, denotes the extended space 
of all sequences in and denotes the 

set difference. 

• p € [1, oo), denotes the set of all sequences 
such that 

ll/ll (El/W) <”■ 

• Co denotes the subspace of of sequences 
converging to zero. 

■ S denotes the backward shift operator (unit 
time delay). 

• /* denotes the itth-truncation operator on A.,,: 


where |A|, = max |a,J. 

• / 

• iPjY denotes the set of all H e Sfrv which are 
time-invariant. It is well known that if TV is 
isomorphic to and the matrix representation 
of the operator has a Toeplitz structure. Every 
element in if 7 ^, is associated with a A-transform 
defined as 

□c 

Hik)= S H(k)k'‘. 

ft=() 

The collection of all such transforms is usually 
denoted by A, which will be equipped with the 
same norm as the norm. 

Throughout this paper, systems are thought of 
as operators. So the composition of two 
operators G, // is denoted as GH. If both are 
time-invariant then GH e (or ^ti)^ and the 
induced norm is denoted by \\GH\\i. When the 
A-transform is referred to specifically, we use the 
notation for the transform of H. Also, all 
operator spaces are matrix-valued functions 
whose dimenvsions will be suppressed in general 
whenever understood from the context. 

Let be a normed linear space. The space of 
all bounded linear functionals on X is denoted 
X*, equiped with the natural induced norm; X* 
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is always complete. It is convenient to put on X* 
a weaker topology which makes X** = X. This is 
the weak*-topology. 

Dual of (fp, lsp<oc. The dual of €p is 
where (1/p) + (1/^) = 1. The characterization is 
given by the following theorem. 

Theorem 1 . Every bounded linear functional / 
on 1 sp < oe, is representable uniquely in the 
foim 

f{x) = 'Zx,y„ 

1-0 

where y = (y,) is an element in Furthermore, 
every element of defines a member of €* in 
this way and 

WfW-WyWr 

The above definitions are extended for vector¬ 
valued sequences and matrix-valued sequences 
in the obvious way. 

In this paper, we will give a solution to the 
synthesis problem by using the theory of 
Lagrange multipliers. Many people are quite 
familiar with this theory for finite-dimensional 
optimization problems, and in the sequel, we 
will review the basic duality theorem for 
infinite-dimensional problems. For a more 
thorough treatment, see Luenberger (1%9). 

Let be a vector space. A convex cone P is a 
convex set such that if x e P then ox g P for all 
real nr > 0. Given such P, it is possible to define 
an ordering relation on X as follows; x^y if and 
only if X - y e P. Then it is natural to define a 
dual cone P* (with an abuse of notation) inside 
X* in the following way: 

P*^{x*eX* I (x,x*)>0VxeP}. 

This in turn defines an ordering relation on X*. 

Let / be a convex function from A" to IR and G 
a convex map from X to another normed space 
Z. Also, let Q be a convex subset of X. Assume 
that there exists XieX such that G(xi)<() (the 
inequality with respect to some cone in Z). This 
is generally known as the regularity assumption. 
Define the minimization problem; 

fio = inf/(ji) subject to x g Q, C(x) 0 . 

The Lagrange multiplier theory basically says 
that this constrained optimization problem can 
be transformed to an unconstrained problem 
over X 6 Q. Precisely, there exists an element 
Zfl ^0 in Z* (with respect to the dual cone) such 

(io= inf (/(jc)+ (C(jc), Zff)}. 

The element Zg is precisely the Lagrange 


under uncertainty 

multiplier. Equivalently, 

jUo= sup inf {f(x) + {Gix), z*)). 

In the case where the inhmization problem 
contains equality constraints, we will replace 
them by two inequality constraints. Care should 
be taken in this case since the assumption that 
the constraint set has an interior pjoint will be 
violated; however under mild assumptions, if the 
equality constraints are given in terms of linear 
operators, the result will still hold without the 
regularity conditions. 

3. WHY THE L SIGNAL NORM? 

In many real-world applications, output 
disturbance and/or noise is persistent, i.e. 
continues acting on the system as long as the 
system is in operation. This implies that such 
inputs have infinite energy, and thus one cannot 
model them as “bounded-energy signals". 
Nevertheless, one can get a good estimate on the 
maximum amplitude of such inputs. Examples 
where bounded disturbances arise in practical 
situations arc abundant. Wind gusts facing an 
aircraft in flight can be viewed as bounded 
disturbances. Without a correcting control 
action, such disturbances will cause the aircraft 
to deviate from its set path. An automobile 
driven over an unpaved road experiences 
disturbances due to the irregularity of the 
course. Such disturbances, althpugh persistent, 
are clearly bounded in magnitude. In process 
control, level measurements of a boiling liquid 
are corrupted by bounded disturbances due to 
the constant level fluctuations of the liquid. 
Because such disturbances arc so frequent, a 
mathematical model describing them is essential. 
The C norm is clearly the most natural choice 
for measuring the size of such disturbances. In 
general, we will assume that the disturbance is 
the output of a linear time-invariant (LTI) filter 
subjected to signals of magnitude less than or 
equal to one, i.e. 

d = Ww, llwll.sl. 

Not only is the C norm useful for measuring 
input signal size, but it can also be very useful as 
a measure for the size of output signals. For 
example, in many applications it is crucial that 
the tracking error never exceeds a certain level 
at any time. While this requirement cannot be 
captured by using the ^2 norm, it can be stated 
explicitly as a condition on the C norm of the 
error signal. Another situation when the C norm 
is useful is when the plant, or any other device in 
the control loop, has a maximum input rating 
which should not be exceeded. This translates 
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directly to 4 requirement on the C norm of that 
input. An example of such a requirement 
appears in the next section. In addition, the 4 
norm plays an important role in designing 
controllers for nonlinear systems. Since most of 
the nonlinear controller designs are based on 
linearization, the linear model gives a faithful 
representation of the system only if the states 
remain close to the equilibnum point, a 
requirement captured directly in terms of the 4 
norm. 

4. THE r, NORM 

While the 4 norm is used as a measure of 
signal size, the fi norm is used to measure a 
system’s amplification of 4 input signals. Let T 
be an LTI system given by 

z(f) = (rw-)(0=i T(kMt-k). 

k-O 

The inputs and outputs of the system are 
measured by their maximum amplitude over all 
time, otherwise known as the 4 norm, i.e. 

||H'||,„ = maxsup|w//r)|. 

/ * 

The norm of the system T is precisely equal to 
the maximum amplification the system exerts on 
bounded inputs. This measure defined on the 
system T is known as the induced operator norm 
and is mathematically defined as 

lir||= sup ||rw|u = ||r||,, 

where ||T||i is the norm of the pulse response 
and is given by 

||r||, = max2 XMk)|. 

' / * 

A system is said to be ^*-stable if it has a 
bounded norm, and the space of all such 
systems will be denoted by . From this 
definition, it is clear that the system attenuates 
inputs if its norm is strictly less than unity. 

In the case where the inputs and outputs of 
the linear system are measured by the (2 norm, 
then the gain of the system is given by the //«. 
norm and is given by Doyle et aL (1989), Francis 
(1987), Vidyasagar (1985) and Zames (1981) 

||t|U= sup 

0&e-s2n 

The two induced norms are related by Boyd and 
Doyle (1987) 

||t|U=sC, \\T\U^C 2 iN) IlflU, 

where Ci is a constant depending only on the 
dimension of the matrix T, and C 2 is a linear 
function of the McMillan degree N of f. In other 


words, every system inside /j is also inside 
but the converse is not true. This means that 
there exist 4 stable LTI systems that are not 
stable; an example is the function with the 
A-transform given by Boyd and Doyle (1987); 

Thus, for LTI systems, minimizing the norm 
guarantees that the norm is bounded. This 
means that this system will have good ^ 2 “ 
disturbance rejection properties as well as 
/^-disturbance rejection properties. Also, the 
norm is more closely allied with BIBO stability 
notions and hence is quite desirable to work 
with. The disadvantage in working with the /’j 
norm is the fact that it is a Banach space of 
operators operating on a Banach space, not a 
Hilbert space itself. Many of the standard tools 
are not usable; however, this paper will present 
new techniques for handling problems of this 
kind. 

5 . PROTOTYPE PROBLEMS 
In this section we demonstrate the advantages 
of using the if* signal norm by presenting a few 
prototyj>e problems. For each problem, certain 
control objectives related to the C norm are to 
be met. These problems demonstrate the 
advantages of using the signal norm as a 
means of capturing time-domain specifications in 
an uncertain environment. Later on, it will be 
shown how all such problems can be treated in a 
unified manner under a single framework. We 
shall then develop mathematical techniques for 
obtaining solutions for all problems which fit 
within that framework. 

5.1. Disturbance rejection problem 
Consider the system in Fig. 1. Here Fo is a 
plant and K a controller, both LTI. The system 
is subjected to bounded disturbances which are 
reflected at the plant output. As mentioned 
earlier, these disturbances are assumed to be the 
output of a time-invariant filter Wi reflecting the 
frequency content of such disturbances. The 



Fig. 1. Disturbance rejection problem 
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Pig 2 Command following with input saturation. 


control objective in this case will be to find a 
controller K which satisfies the following; 

(1) K internally stabilizes the feedback system. 

(2) The effect of the disturbances at the plant 
output is minimized, i.e. K minimizes 

sup ||z||.. 

11 tv 11-^1 


5.2. Command following in the presence of input 
saturation 

The coinnnand following problem is equivalent 
to the disturbance rejection problem. Consider 
the system in Fig. 2. The plant, P, suffers from 
saturation nonlinearities at its input. Therefore, 
It can be viewed as having two components: a 
saturation component, Sat(.), and an LTI 
component, Pq- The saturation component is 
defined as follows: 


Sat (u) = 


u 




As a result the plant is described as P = 
PoSat(.). Because of the presence of the 
saturation, the plant input, u, must not be 
allowed to exceed This requirement can be 
captured in a natural way using the C norm of u. 
In other words, u must satisfy ||m||*^ 

The command, r, is to be followed at the plant 
output. It is not fixed but rather can be any 
command in the set 


{r = lVw:||w|U-l}, 

where W reflects the frequency content of the 
desired commands and is typically a low pass 
filter. 

The control objectives can now be stated more 
precisely. It is desired to find a controller K such 
that: 


(1) K internally stabilizes the system, 

(2) ||u|U<t/^„. 

(3) y follows r uniformly in time to within a 
maximum error level of y > 0, i.e. ||y - r||* ^ y. 

Here, by internal stability we mean local internal 
stability. As long as the external signals do not 
cause u to exceed the system will be 

operating in the linear region and internally 
stability of the linear system implies that all 
signals in the loop will be bounded for bounded 
inputs. 



Fig 3 Robust disturbance rejection problem. 


5.3. Robust disturbance rejection 
In the previous two problems, the plant was 
assumed to be known exactly. This is rarely the 
case due to unmodelled dynamics, parameter 
variations, etc. When the controller designed for 
a nominal plant model is implemented on the 
real system, there are no guarantees on the 
resulting performance of the system. Even 
requirements as basic as stability may not be 
met. The deviation from the expected behavior 
of the system clearly depend on the accuracy of 
the model. Since modelling uncertainty is 
inevitable, it is imperative to include stability 
and performance robustness to model uncer¬ 
tainty as a design objective. We now take a 
second look at the disturbance rejection problem 
discussed earlier. Instead of considering a single 
nominal time-invariant plant, Pq, we shall instead 
consider a collection of plaatl^. The class of 
plants considered is taken be 

n-[p = {i + w,AW,)p^; 11 ^ 

; A is causal and sup 

" v„^ l|u|U 

where Wi and W 2 are timeHq^U^js W^tginmg 
functions (see Fig. 3), In thisSjcfinitfon, the 
plant perturbation, A, may be time-varying 
and/or nonlinear. Any plant belonging to this 
plant class is said to be admissible. Note that 
when A = 0, we recover the nominal LTI plant. 
Consequently, the collection of admissible 
plants, n, may be viewed as a ball of plants 
centered around the nominal time-invariant 
plant model. If a system property, such as 
stability, holds for all admissible plants it is said 
to be robust. We now add to our original 
disturbance rejection problem a new objective: 
robustness. In other words, the controller K is 
now required to perform the following tasks: 

(1) /C internally stabilizes all admissible plants, 
i.e. all plants in the class 11. 

(2) K minimizes the effect of the disturbance w 
on the magnitude of the output for the worst 
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possible admissible plant, i.e. K minimizes 
sup sup ll)-!!.. 

Pefl llwiu^sl 

5.4. Robustness in the presence of coprime factor 
perturbations 

Another approach to the representation of 
plant uncertainty is through coprime factor 
perturbations (Dahleh (1992); Glover and 
McFarlaile (1989)). Let Po = NM~^ be a coprime 
factorization of the nominal plant. The graph of 
the plant Po over the space 4. is defined as the 
image of the space 4 under the map where 


4'-> 4 X 4, 



The class of admissible plants can be defined 
as those plants whose graph is perturbed in the 
following way: 

n = = IIA.II^I, llA.II^l}. 

This plant class can be viewed as that obtained 
by perturbing the plant numerator and the plant 
denominator independently as shown in Fig. 4. 
The main objective in this case is to find a 
controller K which stabilizes all plants in the 
class n. 

5.5. A multiobjective control problem 
In almost all practical control problems, more 
than one objective must be met simultaneously. 
Perhaps one of the most attractive features of 
the present approach is its ability to handle 
multiple objectives in a natural way. As an 



Fio. 4. Coprime factor perturbations. 


example of a multiple objective problem 
consider the system in Fig. 5. In the figure the 
plant is subjected to multiplicative output 
perturbations. In addition it has a saturation 
nonlinearity at its input of the type discussed 
earlier. A command input, r, is applied while a 
bounded disturbance, d, is acting at the plant 
output. The objectives in this problem are a 
combination of those objectives in the first three 
problems discussed earlier. Aside from stabi¬ 
lizing all admissible plants, the controller must 
also ensure that the plant input, u, never 
exceeds its maximum, despite the presence 
of the output disturbance, the command input, 
and the plant uncertainty. Furthermore, the 
tracking error in this unfriendly environment 
must be maintained at a minimum level for all 
time. These requirements on the controller are 
summarized as follows: 

(1) K stabilizes all plants in 11. 

(2) K is chosen such that sup sup||u||»^ 


(3) K is chosen such that 
minimized. 


sup supllell 

IlH'JUil Pen 


is 


Comment. 

It is possible in this formulation to include 
time-varying weights with which one can 
emphasize certain periods of the time response. 
The general framework and solutions presented 
in the sequel generalize in the presence of such 
weights; however, we will restrict our discussion 
to the time-invariant case. 


6 . A GENERAL FORMULATION. THE ROBUST 
PERFORMANCE PROBLEM 

In the previous section, we have formulated 
sample control problems which reflect various 
practical control requirements. Two assumptions 
were embedded in the problem statement. The 
first of these is that the command signals and the 
disturbance signals do not necessarily decay in 
time but can instead persist over all time so long 
as they are bounded. This is a fairly realistic 
assumption and leads to the adoption of the 4 



Fig. 5. Multiobjective problem. 
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Fig. 6 . Tlic robust performance problem. 


norm to measure the signal size. The second 
consists of requiring the regulated signals of 
interest to have small maximum amplitudes. 
Thus, once more, the signal norm is used as a 
measure for signal size, but this time it is the 
regulated output signals which are being 
measured. When considering that quite often the 
output of interest is a tracking error, plant input 
and/or plant output, it becomes clear that 
limiting the maximum value these signals can 
achieve is desirable if not necessary. As a means 
for obtaining a unifying framework for formulat¬ 
ing and solving a wide variety of problems with 
4= signal norms and 4 induced-norm bounded 
perturbations, we set up the Robust Perfor¬ 
mance Problem. All the prototype problems 
discussed in the previous section are special 
cases of this general problem. So consider the 
system in Fig. 6: A models the system 
uncertainty, K is the controller, and Go contains 
the remaining part of the system. It is assumed 
that A belongs to the following class: 

S)(rt):= {A = diag(A,.A„) 

; A, is causal and |{ A,|| ^ 1). 


Here |1A,|| is the induced norm, i.e. 


||A,1| = sup 


l|A,u|| 

||u|U 


In the sequel, the A,s are assumed to be SISO 
for simplicity. There is no time-invariance 
restriction on the perturbations, and hence 
time-varying and/or nonlinear perturbations are 
allowed. The diagonal structure of the perturba¬ 
tions is essential for incorporating information 
about the location of the system uncertainty. For 
example actuator unmodelled dynamics are not 
related to sensor unmodelled dynamics or to the 
plant’s unmodelled high-frequency dynamics, 
and should not be modelled by the same 
perturbation block. By isolating the independent 
sources of uncertainty, a more realistic and less 
conservative system model is obtained. This is 
the main reason for considering structured 
perturbations. While A models the uncertain 
part of the system, G,, is the known part of the 


system with the exception of the controller, and 
it is a 3 X 3 block matrix. The actual system is an 
element in the upper linear fractional connection 
of Go and the admissible As. So, included in Go is 
the nominal plant/plants, any input and output 
weighting functions, and any wei^ting functions 
on the perturbation!). We shall restrict the 
weights and the nominal plant to be LTl. As a 
result. Go is LTI. Hie controller K is also 
assumed to be LTl. The signal w denotes all 
exogenous inputs, including the command inputs 
and the disturbance inputs which are assumed to 
be in 4, while z denotes the regulated outputs. 
Both w and z are allowed to be vector signals. 
From now on, we shall refer to the map taking w 
to z as 3'^,^,. The induced norm of 3^^ is 
defined as follows; 


II ^,,.,11 :=sup 


l|w|U 




We will say that the system in Fig. 6 is ^ stable 
(or just internally stable) if, when injecting 
external €“ signals at the inputs of A, Go, and K, 
the effect of these signals together with w e ^ is 
to produce signals at any point in the loop. 
Furthermore, the induced norm of this mapping 
between the injected signals together with w and 
any internal signal in the loop must be bounded 
(see Desoer and Vidyasagar, 1975). 

We are now ready to state the Robust 
Performance Problem. 


Robust Performance Problem. Find a controller 
K such that: 

(1) The system achieves robust stability, i.e. K 
internally stabilizes the system for all 
admissible perturbations, i.e. for all A in 
^(n). 

(2) The system achieves robust performance, 
i.e. K is chosen so that 

sup ||.‘Tz,.||<l. 

Ae r'(n) 

As mentioned earlier, the prototype problems 
discussed can all fit in this framework. As an 
example, for the Disturbance Rejection Problem 
since the number of perturbation blocks, n, is 
zero. Go has only two inputs w and u, and two 
outputs 2 and y. As a result Go has the form: 



This is referred to as the nominal performance 
problem. 

On the other hand, for the robust disturbance 
rejection problem n will be 1. Thus, Go has an 
additional input fed from the perturbation 
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output^ and an additional output feeding the 
perturbation input. It follows that Go has the 
following structure: 


/O 

0 


Go= W, 

Wr 

-Po 

\W3 

w, 

-P« / 


, etc. 



7. ROBUSTNESS CONDITIONS 

Having stated the Robust Performance Pr¬ 
oblem, we can now focus our attention on its 
solution. In particular, we shall develop 
necessary and sufficient conditions for achieving 
both performance robustness and stability 
robustness. These conditions will be used for the 
robustness analysis of the system at hand. In this 
case, the controller is assumed given and fixed 
and its effect on the robustness of the system is 
investigated. The same conditions developed for 
robustness analysis are used to develop tech¬ 
niques for the synthesis of robust controllers. 

We begin by discussing the robustness analysis 
issue. Suppose we are given a nominal system 
Go, a perturbation class 3i(n), and a controller K 
connected as shown in Fig. 6. We can 
incorporate Go and K together and view them as 
one system, Af, as shown in Fig. 7. Thus M will 
have two inputs and two outputs. We will 
assume that the controller K stabilizes the 
nominal system Go; otherwise robust stability 
and hence performance clearly will not be 
achieved. Consequently, M will be LTl and 
stable. We will say the system in Fig. 7 achieves 
robust stability if it is stable for all admissible 
perturbations. We will say that it achieves robust 
performance if, in addition, for all 

admissible perturbations. We can now state the 
following problem whose solution is provided in 
the next two sections. 

Robustness analysis problem. Under what con¬ 
ditions on M will the system in Fig. 7 achieve 
robust performance? 

7.1 . Stability robustness vs performance 
robustness 

It is an interesting fact that a robust 
performance problem can be transformed to a 
robust stability problem. This has been shown in 
Doyle and Stein (1981) and Doyle (1982) when 



Fig. 7. Robust performance problem m M-A form 


System I System II 

Fig 8 Stability robustness vs performance robustness. 

the perturbations are LTI with an ^2 induced- 
norm. This remains true in our case as well, 
although the perturbation class and the method 
of proof are quite different. To elaborate further 
on this relationship between stability robustness 
and performance robustness consider the two 
Systems shown in Fig. 8. System I corresponds 
to a performance robustness problem, while 
System II is formed from System I by feeding z 
back to w through a fictitious perturbation, A/., 
satisfying ||A/*||<1. As a result, System II has 
3}{n -b ]) as its perturbation class. We can now 
ask the following question: How does the 
performance robustness of System I relate to the 
stability robustness of System II? One aspect of 
the relationship between the two notions of 
stability is fairly obvious: performance robust¬ 
ness of System 1 implies stability robustness of 
System II. This is quite easy to see. Since robust 
performance is equivalent to the norm of the 
map between w and 2 being less than one, the 
Small Gain Theorem can be used to establish the 
stability of System II for all ||A,.||:<1, or 
equivalently to establish the robust stability of 
System II. Equally important, the relation 
between stability robustness and performance 
robustness holds the other way as well. In other 
words, stability robustness of System II implies 
performance robustness of System I. This 
direction is not as obvious as the first one. The 
proof follows from certain results on the 
robustness of time-varying systems. 

7.2. Stability robustness conditions 
Because performance robustness is equivalent 
to stability robustness in the sense discussed 
earlier, we need only discuss stability robustness. 
Specifically, we can consider the interconnection 
of a stable LTI system, M, with a structured 
perturbation ^e^{n) in Fig. 9, and seek 
necessary and sufficient conditions for the 
stability robustness of the system. Since M and A 
are both stable, the internal stability of the 
system is equivalent to the map 1 - Af A having a 
stable inverse, one which maps to itself with a 
finite gain. When the signal norm is the €2 norm 
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Fio. 9 Stability robustness problem. 


and the perturbations are time-invariant, the 
conditions are provided by the Structured 
Singular Value, fi (Doyle, 1982). In particular, 
robustness is achieved iff sup n(M(e^^))<l. 

In our formulation, it turns out that the 
conditions are much simpler and easier to 
compute than fi. Before we can present these 
conditions we need to define a certain 
nonnegative matrix, M, which depends solely on 
M. Recalling M has n inputs and n outputs, it 
can be partitioned as follows: 

Af, 

Af = 

Each M,j is LTI and stable, and thus 
Clearly ||A/,J|] can be computed with arbitrary 
accuracy by considering finite truncations of Af,, 
as approximations. We can now define M as 
follows: 


llAfnII. l|Af,J|, 

Af = 

LllMnlll, ••• lIMJlJ 

One of the most interesting aspects of the 
robustness problem formulated here is the role 
which M plays in the system robustness. This is 
presented in the next theorem due to Khammash 
and Pearson (1991a, b, 1992). 

Theorem 2. The system in Fig. 9 possesses 
robust stability iff any one of the following holds. 

(1) p(Af)<l, where p(.) denotes the spectral 
radius. 

(2) X ^ Mx and x > 0 imply that x = 0, where the 
vector inequalities are to be interpreted 
componentwise. 

(3) inf < 1, where 


9i:= {diag(ri,. . . , r„):r,>0}. 

One of the main contributions of this theorem 
is that it provides simple and exact conditions for 
testing the system’s stability robustness regard¬ 
less of the number of perturbation blocks, n. 
While the three conditions in the theorem are 


equivalent, each provides a different perspective 
and has certain advantages over the others. For 
example, the spectral radius condition is in 
general the easiest to compute. It is particularly 
useful when n is large since it can be computed 
efficiently using power methods. Specifically, 
g^ven an Af which is assumed primitive (i.e. 
Af * > 0 for some integer k) then it satisfies the 
following: 

"""IKT 

for any vector x>0. Furthermore, the upper 
and lower bounds both converge to p(M) as k 
goes to infinity. If M were not primitive, it can 
be perturbed slightly to become primitive. 

Whereas the spectral radius test provides a yes 
or no answer concerning system robustness, the 
second test involving the Linear Matrix Ine¬ 
quality (LMI) is most useful for providing 
information about the effect of the individual 
entries of M on the overall robustness of the 
system. This is achieved by translating the LMI 
condition into n algebraic conditions stated 
explicitly in terms of the entries of M. This is 
best demonstrated by an example. Suppose M is 
a 2x2 matrix corresponding to a certain 
robustness problem with n = 2. The LMI 
condition states that robust stability iff the 
system 

JCi^l|Af„||,Jr,-l-||Af,2j|,X2, 
Jr2^l|Af2,||,JC,+ ||Af22||, jr2, 

has no solution x = {x^.x^) e [0 , op) x (0 , oo)\(0}. 
Among other things, this implies that ||Af,,||, < 
1; otherwise jc = (l,0) would be a solution for 
the two inequalities. The first inequality can be 
rewritten as 

^ IIAf.zlIi ^ 

'''"l-||Af„|| 

When combined with the second inequality, we 
have that 


has no solution in (0, °o), which is equivalent to 


IIAf^ill, 


l-||Afn||, 


+ l|Af22ll,<l. 


This last condition, together with the condition 
that ||Af„||,<l, is therefore necessary for the 
inequality robustness conditions to hold. By 
retracing our steps backwards, it becomes clear 
that they are also sufficient. This procedure of 
constructing explicit norm conditions from the 
second robustness condition can be repeated in 
the same way for any n. 
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Finally, the third robustness condition is useful 
for robust controller synthesis. This will be 
discussed in more detail later on. 

7.2.1. Equivalence of the robustness condi¬ 
tions in Theorem 2. Before we shed more light 
on the proof of Theorem 2, we will show that the 
three, apparently unrelated, conditions in the 
statement of the theorem are indeed equivalent. 
We will show that 102 and that 103. For 
simplicity, we will do this for an irreducible M. 
So suppose that p(Af) < 1. It follows that 
(/ - M)”' exists. Since (/ - Af)“* = / + Af + 

+ ■ ■ ■, all of its entries will be positive. Now if 
jr 2 0 is such that x £ Afjr, or equivalently, 
(/-A#)jtsO, then multiplying both sides by 
(/-Af)“‘ implies that jt:£0. Thus x must be 
zero. This is what 2 states. To show that 2 
implies 1, suppose 1 does not hold, i.e., that 
p(M) a 1. The Perron-Frobenius theory for 
nonnegative matrices states that p(Af) is itself an 
eigenvalue of Af. Moreover, associated with 
pild) we can find an eigenvector j:'> 0. This 
implies that p{lii)x'— Mx', which in turns 
implies that 2 does not hold. Thus, we have 
demonstrated that 1<:>2. 

We now show by showing that 

p(A3f)= inf ||/?“'Af/?||i. By definition, 

Re» 


l|«-'Af/i||, = max2ll(R''MR)<,|l, 



The expression on the right is also equal to the 
induced norm of the matrix R~'MR as a map 
from (R", ||.||») to itself. Referring to this norm 
by l.li> we therefore have ||/?"'Afy?||, = 
|/?~'AfJi|i. Since any matrix norm bounds from 
above the spectral radius of that matrix we have: 

inf ||R-‘Af/?||, 

Rem 

= inf \R~'MRU>piR-^MR) = p{M). 
/? € 


But if we choose R = diag(rl. r'„), where 

(rl,.... r'„)‘ is the positive eigenvector cor¬ 
responding to the eigenvalue p(Af), the 
inequality becomes an equality and the equiv¬ 
alence between 1 and 3 is established. It is 
interesting to note that for the optimum scalings 
R = diag(ri,. .. , r^), all the rows of R~'MR 
have the same norm. As will be demonstrated 
shortly, this fact is used to show why Condition 3 
in the above theorem is necessary for system 
robustness. 



FtCi 10. Scaled system 


7.2.2. Proof of necessity and sufficiency. When 
n = l, the spectral radius condition in the 
theorem above reduces to the condition 
||Af||| < 1. A simple application of the Small 
Gain Theorem shows that this condition is 
sufficient for stability. Necessity has been shown 
by Dahleh and Ohta (1988). For n larger than 
one, we now show that inf ||R“'A/R||i < 1 is 

Rem 

sufficient for robust stability. We do this with the 
aid of Fig. 10 obtained via the addition of 
scalings R and /?', where Re. 91. Clearly, the 
robustness of this system and that in Fig. 9 are 
equivalent in the sense that if one is robustly 
stable, then so is the other one. Moreover, for 
the system in Fig. 10. RAR~^ belongs to S(n) 
whenever A belongs to 3i(n), and thus 
IIRAff '11^1. That being the case, the Small 
Gain Theorem can be invoked to conclude that 
ll«"‘AfR|| I < 1 is sufficient for robust stability. 
This holds for any R The least conservative 
sufficient condition obtainable in that manner is 

inf ||R''AfR||,<l. 

Rem 

It should be mentioned at this point that spectral 
radius type conditions have been used in the 
literature to give sufficient conditions for stability 
of large scale systems see Vidyasagar (1981) and 
the references therein. 

We now demonstrate that inf ||i?“'AfR||, < 1 

Rem 

is necessary for robust stability. For simplicity, 
we do this for the case n = 2. The approach will 
be to show how one can construct a destabilizing 
perturbation A e S(2) whenever inf ||R“'AfR||, 

Rest 

al. So suppose that mf ||R“‘A/R||i s 1. We 

Rest 

have previously shown that this infimum is in 
fact a minimum, and it is achieved by an 
optimum scaling, R, obtained from the eigen¬ 
vector corresponding to p{I^). It was also shown 
that the two rows of R~^MR will have equal 
norms. This can be expressed as follows: 

||(/?-'AfR,)ll. = ||(R-'A/R)2||, = \\R-'MR\U s 1. 
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Fig. 11. Scaled system with constructed input 


where (R~^MR), denotes the ith row of R ^MR. 
The system R^^MR appears in Fig. 11 and has as 
its input ^ = (^ 1 , £ 2 ) and output z = (z,, Zj). In 
the figure y = ( Vi, > 2 ) consists of the output 
z = (Z|, Z 2 ) after a bounded signal, the output ol 
a sign function, has been added to it. This 
bounded signal will be interpreted as an external 
signal injected for stability analysis. The strategy 
taken will be to construct | satisfying the two 
requirements; 

( 1 ) ^ is unbounded. 

( 2 ) ^ results in a signal y which satisfies 
II^*§,11 1 ^ ll^*y.lli for 1 = 1 . 2 , where F* is the 
truncation operator which acts on sequences 
by preserving the first /c 4 1 terms and 
setting the rest to zero. 

The first requirement on ^ guarantees that if 
an admissible perturbation A were to mapy to 
it would be a destabilizing one because the 
bounded external signal would have produced an 
unbounded internal signal The second 
requirement, guarantees that such an admissible 
perturbation exists. In other words, if | and y 
satisfy the second condition, then it is possible to 
find A,, for i = l,2, so that A, is causal, has 
induced norm less than or equal to one, and 
satisfies A,y, = If the first requirement is also 
met, this A will be a destabilizing perturbation. 

For simplicity we shall assume that all A/,^s 
have finite impulse response of length, say N. 
The construction of | proceeds as follows. While 
maintaining |^i(fc)| < 1 for = 0 , , . . , N - 1 , the 
first N components of ^ can be constructed so as 
to achieve Since j > 

1 , this implies that ||P/v i^lU— 1 , which in turn 

implies that ||/V/_iy||^^2. Next, while still 
maintaining we pick the next N 

components of ^ so as to achieve the second row 
norm, ||(/i'‘A/i?) 2 ||i. As a result we have 

1 which implies that ||/ 2 /v-iy,|U^ 

2, Note that the second requirement on ^ has 
been met for /c = 0, , . . , 2N - 1. In addition, 
because of the way the first 2N terms of ^ have 


been constructed, we have 

This allows us to relax the restriction on |^#(*:)| 
for ^ > 2JV “ 1 without violating the second 
requirement on Specifically, we now allow 
|?^(/c)| to be as large as 2 for /c = IN ,.. . , 4N - 
1. In the same way as before we can pick |(fc) 
for this range of k so that we satisfy 

which allows us to increase |^i(fc)| by 1 for the 
next 2N components of and repeat the whole 
procedure again. From this construction, it is 
clear that when ^ is completely specified it will 
be unbounded and hence meets the first 
requirement. The second requirement is also 
met since all along i^,(A:) was chosen carefully so 
as not to become too large too soon. 

It should be mentioned that the destabilizing 
perturbation can be taken to be linear 
time-varying (LTV), or it can instead be 
nonlinear time-invariant. So the spectral radius 
condition for robustness is also necessary and 
sufficient whenever the class of perturbation is 
restricted to include norm-bounded nonlinear 
time-invariant perturbations. 

7.2.3. Construction of the destabilizing per¬ 
turbation. In the previous section, we have 
claimed that given ^ ^ and 

such that 

IIA>^/IUVfc, and for / = 1 , 2 , then there exists 
A = diag (A|, A 2 ) such that Ay = ^ and ||A,||=^ 
1 . Such a A was shown to be a destabilizing 
perturbation. In this section, we prove this claim 
by explicitly constructing the perturbation A. It 
turns out that A, can be cither LTV or nonlinear 
and time-invariant. We shall construct Ai to be 
of the former type, while A 2 will be of the latter 
type. 

So .suppose wc are given ^ 1 = [$i(0}r=o e 
and yi = {yi( 0 }r-oe such that j|P*|,||.s 
||/’tyi||. V/c. The construction of A| is trivial if 
yi = 0: just pick A, itself to be zero. So assume 
y, ^0. We start the construction of A, by 
identifying a subsequence of y,, 
say (yi(ii),yi(« 2 ). • ■ ■) which, depending on y,, 
may or may not be finite. This subsequence may 
be defined recursively in the following manner: 
Let {| be the smalle.st integer such that yi(ii) #0. 
Given yi(in), let i„+i be the smallest integer 
greater that i„ such that |yi(i„+i)| ^ |yi(<„)|. 
Using the ^i(t)s and yi(i/)s we are now ready to 
construct A| through specifying its matrix kernel 
representation as follows: 
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/ 


g.Oi) 

yiO’i) 


0 


A,= 




0 ■■■ 0 


hM 

yiih) 

0 


0 




g.(h-i) 

>l(« 2 ) 


0 ••• 0 


g.(^) 


\ 



Notice that each row of the above matrix has at 
most one nonzero element, which, by the choice 
of the will have its absolute value less 

than or equal to one. This implies ||Ai||:^l. 
Moreover, A| is clearly causal and it can easily 
be checked that = which is what we 
wanted to show. 

We now construct a nonlinear, time-invariant, 
and causal perturbation A 2 . As before, A 2 must 
be such that || A 2 II ^ 1 and A 2 y 2 = Let A 2 be 
defined as follows: 


- 0 


(Az/)(*) = 


0 


if for some integer 
Pkf^PkSjz 
otherwise. 


Note that A 2 maps ^2 to ^2 and || A 2 II ^ 1. 


7.3. Comparisons 

It is worthwhile comparing the class of 
perturbations that have gain less than unity over 
i 2 (which arise in the standard |u) with the class 
of perturbations that have gain less than unity 
over C. If the perturbations are restricted to 
time-invariant ones, the ^«-stable perturbations 
with gain less than unity lie inside the unit ball of 
<i-stable perturbations (for the multivariable 
case, the unit ball will be scaled by a constant). 
This follows directly from the norm inequality 
between and //«. If the perturbations are 
allowed to be time-varying, then the two sets are 
not comparable. Earlier, an example was 
presented that shows that the ball is larger 
than the ball. On the other hand, the 


operator A defined by 

(A/)(fc)=/(0), 

is 4 stable but not 4 stable. 

A question which might arise is, how do the 
derived robustness conditions differ from the 
Structured Singular Value? The answer lies in 
the class of perturbations assumed. While the 
perturbations here may be nonlinear time- 
varying (NLTV), nonlinear time-invariant 
(NLTI), or LTV for the conditions to be 
necessary and sufficient, p theory gives necessary 
and sufficient conditions only for LTI perturba¬ 
tions. While the spectral radius test gives 
sufficient conditions for robustness for LTI 
perturbations it can be shown that in general 
these are not necessary, and thus it is potentially 
conservative when it is known that perturbations 
are LTI. In terms of computation, the robustness 
test proposed here is easy to compute and gives 
exact answers for any number of perturbation 
blocks, n. On the other hand, p is much harder 
to compute especially since for n>3 only an 
upper bound can be computed. One can use the 
Small Gain Theorem to get sufficient conditions 
for robust stability in the presence of NLTV 
induced norm-bounded perturbations in the 
same way it was done for the A norm. In this 
case, a sufficient condition would be 
infj?e« < L It has recently been 

shown that it is also necessary (Shamma, 1992). 
This condition, however, is not sufficient to 
guarantee robustness when perturbations of the 
type considered in this paper are present, i.e. for 
4 induced norm-bounded perturbations. In 
contrast, robustness in the presence of 4 
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Table 1. Comparisons between different robustness criteria 


Perturbation class 

M(W)<1 

inf ||/l-'M/t||„.<1 

R*« 

p(M)<l 

NLTV, bounded 

ncc 

nec and suff 

suff 

NLTV, bounded C-gain 

nec 

nec 

nec and suff 

NLTl, bounded 

nec 

suff 

suff 

NLTI, bounded C-gain 

nec 

nec 

nec and suff 

LTV, hounded ^^-mduced norm 

nec 

nec and suff 

suff 

LTV, bounded C^mduced norm 

nec 

nec 

ncc and suff 

LTI, bounded ^^-induced norm 

nec and suff 

suff 

suff 

LTI, bounded C induced norm 

ncc and suff 

suff 

suff 


induced-norm bounded perturbations does imply 
robustness to €2 induced-norm bounded pertur¬ 
bations. The relationship between the various 
robustness conditions is summarized in Table 1 
(In the table: nec, suff, respectively mean 
necessary and sufficient.) 

Most of the proofs concerning ju are found in 
the tutorial paper by Packard and Doyle (1992) 
and in Shamma (1992) for the time varying case. 
The proof that // is both necessary and sufficient 
in the case of LTI perturbations with bounded 
norm goes as follows: consider the C stability of 
for all ||A||,S1. Suppose that 
Af(^'^") = c with |r|^l. We need to show that 
there exists a LTI perturbation A with ||A||i^ 1 
such that A(e'®‘’) = 1/r. Let H be any system in 
satisfying = l/c and = 

The assertion can be verified by obtaining a 
solution to the model matching problem: 

inf ||//-(A-e^"^)(A-e-'^^«)G||p 

It can be shown that the solution of this problem 
has a value equal to l/|c|, and thus the existence 
of A is verified. The extension from the small 
gain theorem to ju is straightforward. It is also 
interesting to note that ju is both necessary and 
sufficient for general problems (Bamieh and 
Dahleh, 1993). 

In terms of robust controller synthesis, the 
controller must be chosen such that p{M) is 
minimized. The dependence of M on the 
controller is reflected through the Youla 
parameter, Q, since M can be expressed as 
(Francis (1987); Vidyasagar (1985); Youla et al. 
(1976)) 

M = M{Q) = T,-T^QT,. 

where the 7js depend only on Go- Because 
p(A/) = inf ||/? the robustness synthesis 

problem becomes one of finding 

inf inf ||«"’M(0)/?lli. 

Q Hlahle R eJV 


With Q stable and fixed, we have seen that 
picking the eigenvector associated with p(M(^) 
will yield the minimum value over all scalings in 
When R is fixed, we have an ^j-norm 
minimization problem. This problem and its 
solution will be discussed in the remaining part 
of the paper. So the approach which will be 
taken to solving the robustness synthesis 
problem is to start with an initial R e9t. For that 
R we find the optimal Q resulting from the norm 
minimization problem. We then fix that Q and 
solve for the optimal R associated with this new 
Q and so on. Since at each step the objective 
function gets smaller and smaller, and since it is 
bounded from below by zero it is guaranteed to 
converge to some value. Unfortunately, this 
value may not be the global minimum. If at that 
point, a satisfactory level of performance 
robustness has been reached, we can stop and 
use the final Q to construct the controller. 
Otherwise, the iteration process should be 
restarted with a different initial scaling matrix in 
9?. This scheme is similar to the so-called D-K 
iteration used in the p-synthesis technique 
(Doyle and Stein, 1981; Doyle, 1982). The main 
difference is that while the scales used for 
p-synthesis are frequency dependent and a 
convex optimization problem must be solved at 
each frequency, the scalings here are not 
frequency dependent and can be readily found 
by computing the eigenvector associated with 
p{M). Such a computation can be done very 
effectively using power methods, and no 
optimization problem need be solved to find the 
optimal scalings. 


8 SYNTHESIS OF THE CONTROLLER 
As stated earlier, the minimization problem 
is given by; 

n„= inf \\T,-T^QT,\U. (OPT) 

Q «»nblc 

In this section, we will show that this problem is 
equivalent to a linear programming problem in 
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infinite-diinensionai space. By utilizing the 
duality theory of Lagrange muJtjpIiers, it is 
shown that in some cases the linear programs are 
in fact hnite-dimensional and thus exact 
solutions for (OPT) can be obtained. For the 
rest of the cases, the duality theory provides 
upper and lower approximations of the optimal 
solution. The use of the Lagrange multiplier 
theory highlights the strong resemblance be¬ 
tween the /’i problem and standard linear 
programming problems. 

The admissible subspace ^ is defined as: 

Sf={Re €7^" I R = TOT, Q is stable}. 


The objective is to obtain a characterization of 
the feasible set y. Notice that can be uniquely 
determined from the equality A, ^ = As 

was shown in DahJeh and Pearson (1987) and 
McDonald and Pearson (1991) the choice of /?,| 

IS constrained by the zeros of that are 

inside the unit disc. There is only a finite number 
of such zeros, and each zero is interpreted as a 
bounded linear functional on Jin. In the sequel, 
we use the following terminology. 

Definition 1. A transfer function G interpolates 
T 21 , t,, if is stable. 


The €i problem can be interpreted as a distance 
problem: Find an element in the subspace Sf 
which is closest to the fixed element T,. where 
distance is measured in the /’i-norm. Previous 
work (Dahleh and Pearson, 1987, 1988a) used 
the duality theory for distance problems to arrive 
at a solution for (OPT). Here we take an 
alternate approach using Lagrange multiplier 
theory, which is in fact more intuitive and 
transparent, to arrive at similar conclusions. 

8 .1. Characterization of the subspace S 
In the discussion below, it is assumed that T 
has full column rank = n 2 , and has full row 
rank == n^. It is evident that this captures the 
most general situation since if either of these 
conditions docs not hold, we can perform 
inner-outer factorizations on and and 
absorb the extra degree of freedom in Q. Also, 
it is assumed that there exist n^ rows of T and n^ 
columns of which are linearly independent for 
all A on the unit circle. This assumption 
simplifies the exposition although it is not 
necessary. In general, it is enough to assume the 
above for one point on the unit circle (Staffans, 
1990). Under this assumption, T 2 and % can be 
written in the following form without loss of 
generality (possibly requiring the interchange of 
inputs and/or outputs): 



% = {%, tzl 

where has dimensions x rij and is 
invertible and T 31 has dimensions and is 

invertible. Moreover, and T,, have no 
transmission zeros on the unit circle. Thus 
can be written: 



The motivation for this terminology stems 
from the fact that for to be stable, (5 

must have zeros at the same locations and 
directions as the zeros of T^i and T,|. Each zero 
is in fact a bounded linear functional that 
annihilates the element G, and thus has a 
representation inside the dual space of with 
the appropriate dimension. If these functionals 
are inside Co, then we can view G as the 
annihilator in the dual of Co- For example, let 
G(fl) = 0, where G is SISO, and |tf|<l. By 
definition of (5, we have G(fl)= E g{k)a^-fl. 

k^i) 

Define z„ = (l, a", . . .) e Co, then the inter¬ 

polation condition can be expressed as 
(z„, G) =0. If fl is a complex number, then two 
functionals are defined, the real of z„ and the 
imaginary of z,,. The multivariable case carries 
more details, but the basic idea is the same (see 
Dahleh and Pearson, 1987; McDonald and 
Pearson, 1991). 

The choice of is constrained further so that 
the rest of the equations are still consistent, 
which in turn dictates a set of constraints on the 
rest of the elements of R. Define the following 
coprime polynomial factorizations: 


7^2^21' — D2 


(3.3) 


Using these definitions, we state the following 
result characterizing the feasible set y for this 
case (McDonald and Pearson, 1991). 


Theorem 3. Given T 2 , T, with the assumptions 
as above, and ^ e A, there exists ^ e A 
satisfying R = if and only if: 


(i) 

(-A 


(ii) 

(^M 

ti 

0 

(iii) 

interpolates and 
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The conditions shown in parts (i), (ii) are 
convolution constraints on the sequence R. 
The interpolation condition in the last part can 
be tightened, since only the common zeros of fzi 
and lAc^d to be interpolated. 

Tile discussion above shows that the charac¬ 
terization of y can be summarized by defining 
two operators, 

and 

where j, r are some integers. The first operator 
captures the interpolation constraints, and thus 
has a finite dimensional range, and the second 
captures the convolution constraints. These two 
operators can be constructed in a straightfon^'ard 
fashion, bookkeeping being the only difficulty. 
To overcome this problem, it is helpful to think 
of /? as a vector rather than a matrix. To 
illustrate this, let the operator IT be a map from 


to ^ defined as follows: 

(WR)(k)- 

r2i{k) 

.ik)l 

The operator is a one-to-one and onto 
operator, whose inverse is equal to its adjoint (a 
fact used later). It simply rearranges the 
variables in R. The conditions on R presented in 
the above theorem can be written explicitly in 
terms of each component of R. 

To construct the first operator V, recall that 
each interpolation condition is interpreted as a 
bounded linear functional on R. By stacking up 
these functionals, the operator V is constructed. 
For example, suppose i 2 i and are SISO and 
both have N zeros in the open unit disc. Then 
the matrix V is given by V = V^W where 



0 

0 

0 

Re (fl!) 

0 


0 

0 

0 

Im (al) 

0 


0 

0 

0 

Re (ai/) 

0 

\lm (a5(,) 

0 

0 

0 

Im {a's) 

0 


Re(fl',) 0 

Im (a\) 0 • ■ • 

Re(ag 0 

Im 0 


y = o, 1.2, 


For the second operator, recall that convolu¬ 
tion can be interpreted as multiplication by a 
block Toeplitz matrix, in this case with finite 
memory since Nj, £> 2 , A ^3 and D-s all have finite 
length (the corresponding A-transform is a 
polynomial). By simple rearrangement, the 
operator is constructed with its image inside 
Hence ^ is given by ^ = S’lV, where y is a 
block lower triangular matrix. For a detailed 
example (see Dahleh and Pearson, 1988a; 
McDonald and Pearson, 1991). 

To illustrate the construction of the operator 
y, consider as an example the coprime-factor 


perturbation problem considered earlier for a 
SISO. The condition for stability robustness is 
given by Dahleh (1992) 

\\[V-QN -f; + C?Aflllr^l. 

where V, 0 are left coprime factors of a 
nominal controller. In this case, 7^ = 1 and 7^ = 
(N -Af). Since = with N, M 

coprime, the conditions in the above theorem 
translate to 

The matrix y is then given by: 



/(m(0) 

n(0)) 

0 

0 

0 

0 


I (m(l) 

n(l)) 

n(0)) 

0 

0 

0 


, (m(2) 

n(2)) 

"(D) 

(m(0) 

n(0)) 

0 

0 


i (m(3) 

«(3)) 

«(2)) 

Ml) 

n(l)) 

(m(0) 

n(0)) 0 


It is interesting to note that in this example the 
operator ^ captures all the conditions and no 
inteipolation conditions are needed. The condi¬ 
tions presented in the theorem can be redun¬ 


dant, and can be significantly reduced (Staffans, 
1990). 

The subspace y is then the set of all elements 
R^e7^^ such that rR=0 and «R = 0. Let 


AUTO Z9;l-E 
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b, = rTu b 2 =<€Ti. and -/i. The 

optimization problem can be restated as: 

inf ||<I>||, subject to T<l) = b,, ^•^ = b 2 , 

«e rf*" 

(OPT). 


8.2. Relations to linear programming 
It is well-known that in finite-dimensional 
spaces ^|-norm minimization is equivalent to 
linear programming. This turns out to be true in 
general, and can be justified as follows; Let 
- <I>^, with if>lik), <p^j(k) ^ 0. The norm 

is then replaced by the function max, E <p'ii(k) + 

/.* 

0j(k). Define the operator by 

= E tPifik). The following problem is 
/.* 

easily seen to be equivalent to (OPT); 
inf Ai, 

subject to 

+ 4 >^) — fie^O, 

r((p'-ip^) = b,, 

«(<!)’-O^) = 

^!/k), <Pi(k)^0, 

where e 6 R" and = (1, 1, . .. , 1). It is 
interesting to notice that if O', where 
restricted to finite impulse response sequences, 
the above problem is readily a linear program¬ 
ming problem. This will turn out to be a crucial 
observation in obtaining approximate solutions, 
as will be described later on. 


8.3. Lagrange multiplier formulation 
Let A" = <7*"’ X ^7’"’ X R and Z = R” X R" X 
R^ X €\ X Let Px, Pz denote the positive 
cones inside X, Z consisting of elements with 
nonnegative pointwise components. Define the 
op>erator jrf; AT —» Z, decomposed conformally 
with X and Z, and the vector b e Z as follows: 

/X X -e\ / 0 \ 

/ r -r 0 \ b, 

I -r r 0 I b = I -b, 




Define the linear functional c* = ^0^ on X. 

With these definitions, (OPT) becomes: 
inf (x, c*), 

Ax S b, 

X eX, X £ 0, 


where x e AT has the form 



All the inequalities should be interpreted with 
respect to the positive cones. It is interesting that 
with the above definitions, (OPT) looks very 
much like a standard linear programming 
problem, with the exception that the number of 
variables and constraints is infinite. 

The Lagrange multiplier is an element inside 
Z*, the dual space of Z which can be identified 
as: Z* = R” X R'' X R* X Co x (Here we have 
assumed that Z is equipped with the weak* 
topology, not the norm topology.) The dual cone 
Pz again consists of the nonnegative elements in 
Z*. The Lagrangian can be defined as 

L(x, z*) = {(x, c*) + {siix - b, z*)) 

= {(x, f* + .si*z*) - (b, z*)}, 

where j^*:Z*—*X* is the adjoint operator of 
si. From the theory of Lagrange multipliers 
(Luenberger, 1969), the minimum solution can 
be obtained by performing an unconstrained 
minimization of L, i.e. 

/i(,= sup inf {(x, c* -t- si*z*) — (b, z*)}. 

Z * 2:0 

This result is true despite the fact that the 
constraints do not satisfy the regularity condi- 
tions (Dahleh et ai (1993)). Clearly for /i,) to be 
finite, i.e. -oo, c* + jtfz* ^0 and hence the 
above infimization is achieved for ac = 0. This 
gives a dual formulation of (OPT) summarized 
as; 

Po= sup (b, -z*) 

z *&0 

subject to c* + si*z*^0. (DOPT). 

To evaluate this explicitly, let si*, z* be given 
by: 



subject to 


By direct substitution, (DOPT) is converted to 
/io = sup (b„ or, - oTj) -I- (bz. /8, 
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subject to 

+ ^*( 0 , - 02 ) + - ^ 2 )^ 0 , 

i -1 

«!. ^2. ftl, p2, ri^O. 

Finally, substituting a = Oi - 02 and ^ = ^1 - jSj 
we get 

/io = sup (i>,, a) + ( 62 , fi), 

subject to 

-^*rj^Va+ «*/3s^*t/, 

1,^0, (DOPT) 
oreR', /3eco. 

This dual formulation sheds a new light on the 
optimization problem. In our context, it will 
provide two important results; the existence of 
finite-dimensional duals for specific classes of 
problems, and the ability to construct suboptimal 
solutions that are within a prescribed e from the 
actual minimum. 

Commen/. 

The computation of the adjoint operators is 
quite simple once the operators are already 
constructed. Recall that V = V^W-, hence the 
adjoint operator V* = W~Wl. Similarly, *51?* = 

Matrix representations of the operator 
and its adjoint are obtained in a similar 
fashion. 

8.4. Exact solutions for a class of problems 

Let the space 5^ be characterized solely by 
interpolation conditions. This is the situation 
when both and f, have full row rank and 
column rank, respectively. In this case = 0 and 
f >2 = 0- The dual problem (DOPT) involves only 
a finite number of variables and thus it is a 
finite-dimensional problem. The constraints 
however are infinite. Since the elements of Y* 
were constructed from zeros inside the unit disc, 
the entries will eventually decay and only a finite 
number of the constraints are active. A bound 
on the number of such constraints can be derived 
(Dahleh and Pearson, 1987). The problem is 
now a standard finite-dimensional linear pro¬ 
gram, which can be solved exactly. The solution 
to the primal problem (OPT) can be constructed 
either by the alignment conditions, or by 
observing that the dual of (DOPT) is exactly the 
primal problem. 


8.5. Approximation 

In the sequel, we will assume that *i^Tl - bi is 
a finite impulse response sequence. This 
condition is equivalent to saying that there exists 
a FIR feasible solution for (OPT). If this 
condition is not satisfied, then the problem can 
be modified so that the condition will hold 
(Dahleh and Pearson, 1988a; McDonald and 
Pearson, 1991). Upper approximations of po can 
be readily obtained from the primal problem. 
Define fis as follows: 

Pn = min p, 

subject to 

^(4)' + < 1 , 2 ) SO, 

r(«I»‘- 4 )^) = 6 ,. 

«(«!>*-4>^) = ^ 2 . 

m). 0 ?Xfc)^O. 

0 ,/(fc) = O, 0 ?Xfc) = OV*>N. 

Since ^ is constructed from FIR sequences, this 
optimization will involve a finite number of 
variables and a finite number of constraints. It is 
evident that p^ is a non-increasing sequence 
satisfying po^ Pn for all N. Also, since feasible 
FIR solution exists, then p^ is finite for N large 
enough. Since FIR solutions are dense, it follows 
that ps-*Pf, as N—kxi. For each p^ a solution 
for the primal problem can be constructed. The 
difliculty with this procedure is that it is not clear 
how far the solution is from optimal at any given 
N. This will be overcome by presenting lower 
approximations of the problem. 

It is interesting to notice that the dual of this 
problem is obtained through truncating the 
constraints of the dual problem (DOPT). 
Another approximation obtained from the dual 
problem can be obtained by truncating the 
variables ^ € Cq (Dahleh, 1992; Staffans, 1990). 
Define as follows: 

pN = TnM (61, nr) + (bj, p). 

subject to 

m 

X rj, ^ 1, Tj ^0. 

1-1 

oreR*, Pecft, /8(k) = 0Vit>A/. 

It is evident that ps-fto and that Po as 
The former assertion is due to the fact 
that the new problem has fewer degrees of 
freedom, and the latter is due to the fact that 
finite sequences are dense in Cg. The above 
problem is not immediately a finite-dimensional 
problem—the constraints due to the operator V* 
are still infinite; however, only a finite subset of 
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these are active as it was in the case where was 
equal to 0. A complete discussion of the 
computation of this problem is given in Staffans 
(1SI90). Clearly, there is no feasible solution for 
the primal problem for any of the ju^s. 

8 .6. Computations 

In the case where = 0, the minimization 
problem is solved exactly. In all other cases, only 
approximate solutions are obtained through 
obtaining upper and lower approximations of fi^. 
A third method of solution known as the Delay 
Augmentation method was proposed in Diaz- 
Bobillo and Dahleh (1992). The basic idea in 
there is to augment the matrices T 2 and Tj with 
delays so that they become square matrices. A 
square problem is then solved as the number of 
delays is increased. This method has the 
following definite advantages over the methods 
proposed earlier; (1) it involves solving one 
linear program at each iteration that provides 
upper and lower bounds and feasible solutions, 
(2) it only requires computing interpolation 
conditions, which can be done using matrix 
computation, (3) it captures in a precise way the 
exact order of the optimal controller, and does 
not cause order inflation due to approximations, 
(4) in many special cases, the exact optimal 
controller can be computed for the non-square 
problem. 

The general Lagrange Multiplier approach 
allows the above theory to be extended to 
handle arbitrary problems with linear con¬ 
straints, via solving linear programming prob¬ 
lems. Details on exact and approximate solutions 
of such problems including mixed and f, 
problems are reported in Dahleh et al. (1992). 

To obtain fast solutions that do not necessarily 
capture the structure of this problem, one can 
follow the approach in Boyd and Barratt (1991) 
in which one seeks direct FIR solutions for Q. 
This problem can be posed as a linear 
programming problem which can approximate 
the actual solution arbitrarily closely. However, 
unless one invokes duality, the difference 
between the approximate and actual value of jUo 
remains unknown. No information can be 
inferred about the structure of the optimal 
controller. 

It is interesting to note that exact solutions for 
special problems with # 0 have been con¬ 
structed in Diaz-Bobillo and Dahleh (1993,1992) 
and Staffans (1991). Although existence of 
^ 1 -optimal solutions is guaranteed (under mild 
conditions, namely no interpolations on the unit 
circle), it is not known whether these solutions 
are rational or not. If ^ = 0 optimal solutions 


are FIR, and hence rational. The general case is 
still an active area of research. 


9 CONCLUSIONS 

This paper gives an overview of the problem 
of synthesizing optimal controllers to deliver 
performance specifications in the time domain, 
in the presence of plant uncertainty and bounded 
but unknown exogenous inputs. A general 
framework for the robust performance problem 
is presented from which necessary and sufficient 
conditions are derived. These conditions were 
related to the spectral radius of a matrix 
constructed from the configuration of the closed- 
loop system. Alternate equivalent conditions are 
also discussed in terms of linear matrix 
inequalities. These conditions are in turn used in 
the synthesis problem, which requires the 
solution of an optimal control problem. A 
solution of this problem using the duality theory 
of Lagrange multipliers is used. This approach 
highlights in a non-trivial way the relations 
between optimization problems for infinite¬ 
dimensional systems and infinite linear program¬ 
ming problems. In fact, the solutions presented 
exploit the problem structure and do not rely on 
a general theory for solving infinite linear 
programming problems, since such a theory does 
not exist. 

This paper discusses only discrete-time prob¬ 
lems. The interest in discrete-time systems stems 
from the fact that most controllers these days are 
digital controllers and are interfaced with the 
continuous-time plant through A/D and D/A 
converters. A better formulation should have a 
hybrid system consisting of both continuous- and 
discrete-time dynamics. Such systems have 
recently received considerable attention from the 
control community and are known as sampled- 
data systems. A formulation of the sampled- 
data problem can be found in Bamieh et al. 
(1992), Dullerud and Francis (1992), Khammash 
(1992) and Sivashankar and Khargonekar (1991) 
in which it is shown that synthesizing a digital 
controller for a continuous-time plant can be 
done by solving a purely discrete-time problem. 
This motivates the earlier discussion. 

There are other related problems that are not 
discussed in this paper. The problem of 
designing controllers for tracking a specific 
trajectory is an important problem and was 
solved in Dahleh and Pearson (1988b). The 
problem of finding conditions for the robustness 
of time-varying system in the presence of 
structured uncertainty has been solved in 
Khammash (1992). The problem of determining 
the merits of time-varying compensation vs 
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time-invariant compensation was studied in 
Khammash and Dahleh (1993). The synthesis 
approach has been extended for periodic and 
multi-rate sampled plants (Dahleh et aL, 1992). 
Also, this theory was successfully incorporated 
as part of an adaptive control scheme, in which 
the stability of the closed loop system was 
guaranteed for a larger set of plant uncertainty 
(Dahleh and Dahleh, 1990; Voulgaris et ai, 
1993). Finally, a case-study for the applicability 
of this theory was reported in Dahleh and 
Richards (1989) in which a controller was 
designed for a model of the X -29 aircraft. 

A pressing research problem is the under¬ 
standing of the structure of the optimal €i 
controllers. Such an understanding will not only 
add insight into the problem, but will also offer 
simpler ways of computing the optimal solution. 
This has been the case for the and H 2 
problems. Some interesting results in that 
direction are reported in Staffans (1991) in which 
exact solutions for the infinite-dimensional linear 
programs arising in some special non-square 
problems have been computed. Also, it was 
shown in Diaz-Bobillo and Dahleh (1991) that 
optimal solutions may require a dynamic 
controller even though all the states are 
available. The existence of some separation 
structure on the problem (similar to that of 
the //* problem (Doyle et ai, 1989)) is still 
under investigation. 

Another important research direction is the 
synthesis problem by exactly minimizing the 
spectral radius function, rather than the iterative 
scheme suggested. The iterative scheme is 
guaranteed to converge only to local minima and 
hence there is a need for looking for another 
approach for minimizing this function. 

In this paper, a comparison between the 
spectral radius function and is sketched. At 
this point, it is not known whether there exist 
examples in which the two methods exhibit 
extreme behavior. Research in that direction is 
currently in progress. 

REFERENCES 

Bamieh, B. A. and M. Dahleh (1993). On robust stability 
with structured time-invariant perturbations. Syst. Conir. 
Lett. (Submitted.) 

Bamieh, B. A., M, A. Dahleh and J. B. Pearson (1992). 
Minimization of the L"-induccd norm for sampled-data 
systems. IEEE Trans. Aut. Control. (To appear.) 

Boyd, S. P. and C. H. Barratt (1991). Linear Controller 
Design: Limits of Performance. Prentice Hall, Englewood 
Cliffs, NJ. 

Boyd, S. P. and J. C. Doyle (1987). Comparison of peak and 
RMS gains for discrete-time systems. Syst. Contr. Lett., 9, 
1 - 6 . 

Dahleh, M. A. (1992). BIBO stability robustness for coprime 
factor perturbations. IEEE Trans. Aut. Control, 37, 
357-355. 


Dahleh, M. and M. A. Dahleh (1990). Optimal rejection of 
persistent and bounded disturbances: continuity properties 
and adaptation. IEEE Trans. Aut. Control, 35, 687-696. 

Dahleh, M. A. and J. B. Pearson (1987). optimal 

feedback controllers for MIMO discrete-time systems. 
IEEE Trans. Aut. Control, 3Z, 314-322. 

Dahleh, M. A. and J. B. Pearson (1988a). Optimal rejection 
of persistent disturbances, robust stability and mixed 
sensitivity minimization. IEEE Trans. Aut. Control, 33, 
722-731. 

Dahleh, M. A. and J. B. Pearson (I988b). Minimization of a 
regulated response to a fixed input. IEEE Trans. Aut. 
Control, 33, 924-930. 

Dahleh. M. A. and Y. Ohla (1988). A necessary and 

sufficient condition for robust BIBO stability Syst. Contr. 
Lett., 11, 271-275. 

Dahleh, M. A. and D. Richards (1989). Application of 
modern control theory on a model of the X-29 aircraft, 
LIDS. MIT, report No. LIDS-P-1932. 

Dahleh, M. A., P. Voulgaris and L. Valavani (1992). 

Optimal and robust controllers for periodic and multi-rate 
systems. IEEE Trans. Aut. Control, 37, 90-99. 

Dahleh, M. A., N. Elia and 1. Diaz-Bobillo (1993). 

Controller design via linear programming. (In 

preparation.) 

Dcodharc, G. and M. Vidya.sagar (1990). Some results on 
-optimality of feedback control sy.stems: the SISO 
discrete-time case. IEEE Trans. Aut. Control, 35, 
1082-1085. 

Desocr, C. A. and M. Vidyasagar (1975). Feedback Sy,stems: 
Input-Output Properties. Academic, NY. 

Doyle, J. C. (1982). Analysis of feedback systems with 
structured uncertainty. lEE Proceedings, 129, 242-2.30. 

Doyle, J. C and G. Slcin (1981). Multivariable feedback 
design: concepts for a classical/modern synthesis. IEEE- 
Trans. A C, 26, 4-16. 

Doyle, J. C., K. Glover, P. P, Khargonckar and B. A. 
Francis (1989). State space solutions to standard and 
control problems. IEEE Trans. Aut. Control, 34, 
831-847. 

Diaz-Bobillo, I. and M. A. Dahleh (1993).- Slate feedback 
optimal controllers can be dynamic, LIDS, MM, Report 
No. LrDS-P-2051. Systems and (ontrol Letters. (To 
appear.) 

Diaz-Bobillo, I. and M. A, Dahleh (1992). Minimization of 
the maximum pcak-to-peak gain: the general mulublock 
problem. IEEE Trans. Aut. Control. (To appear). 

Dullerud, G. and B. A. Francis (1992). J/'' analysis and 
design of sampled-dala systems. IEEE Trans. Aut. 
Control, 37, 436-446. 

Franci.*!, B. A. (1987). A Course in Control Theory. 
Springer-Verlag, Berlin. 

Glover, K. and D, McFarlanc (1989). Robust stabilization of 
normalized coprime factor plant description with 
bounded uncertainty. IEEE Trans. Aut. Control, 34, 
831-830. 

Horn, R. and C. Johnson (1985). Matrix Analysis. 
Cambridge University Press. 

Khammash, M. (1992). Necessary and sufficient conditions 
for the robustness of time-varying systems with applica¬ 
tions to sampled-dala systems. IEEE Trans. Aut. Control. 
(To appear.) 

Khammash, M. and J. B. Pearson (1990). Robust 
disturbance rejection in ^'-optimal control systems. Syst. 
Contr. Lett., 14, 93-101, 

Khammash, M. and M. Dahleh (1993). Time-varying control 
and the robust performance of .systems with structured 
norm-bounded uncertainty. Automatica (To appear.) 

Khammash, M. and J. B. Pearson (1991). Performance 
robustness of discrete-time systems with structured 
uncertainty. IEEE Trans. Aut. Control, 36, 398-412. 

Khammash, M. and J. B. Pearson (1991). Robustness 
synthesis for discrete-time systems with structured 
uncertainty. Proc. of 1991 ACC, 2120-2124 

Khammash, M. and J. B. Pearson (1992). Analysis and 
design of robust performance with structured uncertainty. 
Syst. Contr. Lett. (To appear.) 



56 


M. A. Dahleh and M. H. Khammash 


Luenberger, D. G. (1969). Optimization by Vector Space 
Methods. John Wiley, NY 

McDonald, J. S. and J. B. Pear$on (1991) ^,oOptimal control 
of multivariable systems with output norm constraints. 
Automatica, 27, 317-329. 

Mendlovitz. M. A. (1989). A simple solution to the 
optimization problem. SysL Contr. Lett., 12 , 461-463. 

Packard, A. and J. C, Doyle (1992) The complex structured 
singular value. Automatica, (To appear.) 

Shamma, J (1992). Robust stability with time-varying 
structured uncertainty. IEEE Trans. Aut. Control. 
(Submitted.) 

SiVBshankar, N. and P. Khargonekar (1991). i^-induced 
norm of sampled-data systems. Proc 1991 CDC. 

Staffans, O. J. (1990). On the four-block model matching 
problem in Helsinki University of Technology, Espoo, 
Report A289. 

Staffans, O. J. (1991). Mixed sensitivity minimization 
problems with rational -optimal solutions. J. of 
Optimization Theory and Applications, 70, 173-189. 

Vidyasagar, M. (1981). Input-output analysis of large-scale 


Vidyaaagar, M. (1981). Input-output analysis of large-scale 
interconnected systems. Lecture Notes in Control and 
Information Sciences. Springer-Verlag, Berlin. 

Vidyasagar, M. (1985). Control Systems Synthesis: A 
Factorization Approach. MIT press, Cambridge, MA. 

Vidyasagar, M. (1986). Optimal rejection of persistent 
bounded disturbances. IEEE Trans, Aut. Control, 31, 
527-534. 

Voulgaris, P., M. A. Dahleh and L. Valavani (1993). 
Slowly-varying systems: C to C performance implications 
to adaptive control. Automatica. (Submitted.) 

Willems, J. C. (1971). The Analysis of Feedback Systems. 
MIT Press, Cambridge, MA. 

Youla, D. C., H. A. Jabr and J. J. Bongiorno (1976). 
Modern Wiener-Hopf design of optimal controllers—part 
2: the multivanable case. IEEE Trans. Aut. Control, 21, 
319-338. 

Zames, G. (1981). Feedback and optimal sensitivity: model 
reference transformations, multiplicative seminorms, and 
approximate inverses. IEEE Trans. Aut. Control, 26, 
301-320. 



AuloiMtica, Vol. 29, No. I, pp. 57-70, 1993 
in Oreal Britain. 


0005-1090/93 M.QO fO.OO 
© 1992 Pergnmon Pran Lid 


Mixed ^2/ ^OD Control for Discrete-time Systems 
via Convex Optimization* * * § t 

ISAAC KAMINER 4 PRAMOD P. KHARGONEKAR* and 
MARIO A. ROTEA§ 

A mixed problem for discrete-time systems is solved by converting it 

into a convex optimization problem over a finite-dimensional space. 
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Ab§tnct —A mixed control problem for discrete-time 

systems is considered, where an upper bound on the ^ 
norm of a closed loop transfer matrix is minimized subject to 
an constraint on another closed loop transfer matrix. 
Both state-feedback and output-feedback cases arc con¬ 
sidered. It is shown that these problems are equivalent to 
hnite-dimensional convex programming problems. In the 
state-feedback case, nearly optimal controllers can he chosen 
to be static gains. In the output feedback case, nearly 
optimal controllers can be chosen to have a structure .similar 
to that of the central single objective controller. In 
particular, the state dimension of nearly optimal output- 
feedback controllers need not exceed the plant dimension. 

1. INTRODUCTION 

Design of control systems almost invariably 
involves tradeoffs among competing objectives. 
It is often the case that the controller is required 
to meet several different performance and 
robustness goals, and all of these cannot be met 
simultaneously. For example, it is intuitively 
clear that to obtain a greater robust stability 
margin, it is likely that the performance of the 
control system needs to be compromised. In 
classical single loop feedback design, these 
tradeoffs are performed in terms of the (open) 
loop transfer function. For instance, stability 
margins in terms of either the phase/gain 
margins or the distance of the Nyquist plot to the 
critical point are traded off against disturbance 
rejection at low frequencies. Clearly, it is 
important to develop analytical tools to help the 
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designer understand how the various competing 
objectives conflict with each other. From this 
point of view, one should postulate the 
controller synthesis problem as the problem of 
studying tradeoffs among competing objectives. 
For a more detailed discussion of multiobjective 
controller synthesis as well as additional 
references, see Boyd and Barratt (1990), Dorato 
(1991), Khargonekar and Rotea (1991b), and 
Rotea (1990). 

The subject of this paper is a certain 
constrained optimal controller synthesis 
problem—the so-called mixed synthesis 

problem. Mixed problems can be 

motivated in many different ways. As a matter of 
fact, there are many different mixed 
problems. These problems are one way of 
analytically formulating the issue of tradeoffs in 
control system synthesis. 

To give a brief description of the various 
mixed problems, consider the feedback 

system shown in Fig. 1. Let i = 0, 1 , denote 
the closed loop transfer matrix from the 
exogenous input w, to the controlled output z,. 
One mixed X 2 / problem is to find an 
internally stabilizing controller ^ which mini¬ 
mizes ||T^,k>J |2 subject to the constraint 
* < 7 . TTiis problem is equivalent to a 
problem of optimal nominal performance subject 
to a robust stability requirement. To be more 
specific, a controller that solves this mixed 
problem will ensure that the closed loop 
system is robustly stable to all finite-gain stable 
(possibly nonlinear time-varying) perturbations 
A, interconnected to the system by iVi = Azi, 
such that ||A||*^ 1 / 7 . On the other hand, 
\\T^Hr ,\\2 represents the steady-state variance of 
the output Zq when w, = 0 and is white noise 
with unit intensity. Currently no analytic 
solution to this mixed ^ problem is 
available. Rotea and Khargonekar (1991a) have 


SI 



58 


I. Kaminer et al. 



obtained some sufficient conditions for the 
solvability of this mixed problem in the 

state-feedback case. While the results in Rotea 
and Khargonekar (1991a) have been obtained 
for continuous-time systems, many of them can 
be extended quite easily to discrete-time 
systems. 

A somewhat different mixed ^1^^ problem 
formulation was introduced by Bernstein and 
Haddad (1989) to combine the LOG and 
controller design theories. This problem is 
restricted to the case Wo = 1 ^ 1 =:^. Instead of 
minimizing they considered the mini¬ 

mization of an “upper bound" for ||rs„,vll 2 f 
subject to the constraint l|7’i,»,||»< y. Recently, 
many papers have appeared that address this 
mixed controller design problem, see for 

example, Bambang et al. (1990), Doyle et al. 
(1989a), Khargonekar and Rotea (1991a, b), 
Mustafa and Bernstein (1991), Steinbuch and 
Bosgra (1991), Yeh et al. (1992), Zhou et al. 
(1990) and the references cited therein. 

The mixed 9CilK^ problem considered in this 
paper not only provides a more tractable 
approach to the problem of minimizing nominal 
performance subject to a robust stability 
constraint, but it can also be interpreted as an 
optimal performance problem. Indeed, as shown 
by Zhou et al. (1990) in the continuous-time 
case, the “dual" of the auxiliary cost or upper 
bound of Bernstein and Haddad (1989) is closely 
related to a system gain from a combination of 
power and white noise exogenous inputs to the 
power of the regulated output. 

In this paper, we focus on the discrete-time 
version of the mixed problem as 

formulated by Bernstein and Haddad (1989). 
While much work has been done on the 
coptinuous-time case, the discrete-time case has 
received much less attention. Indeed, at this 
time the discrete-time analog of the coupled 
Riccati equations obtained by Bernstein and 
Haddad (1989) for the continuous-time case are 
not available for output-feedback problems. 
Some results for discrete-time mixed 3 ^ 2 / 
problems have been obtained by Bambang et al. 
(19W), Haddad et al. (1991), and Mustafa and 


Bernstein (1991). Mustafa and Bernstein (1991) 
have considered the static state-feedback prob¬ 
lem and derived sufficient conditions for 
optimality of a state-feedback gain. Bambang et 
al. (1990) and Haddad et al. (1991) have 
considered the static output-feedback problem. 
It seems that at this time no solution to the 
mixed X 2 I problem is available in the general 
dynamic output-feedback case. This is the 
primary motivation for this paper. 

Our approach is as the recent paper by 
Khargonekar and Rotea (1991a) where a convex 
optimization approach to the continuous-time 
mixed XjlX^ problem introduced in Bernstein 
and Haddad (1989) has been developed. A 
similar approach has been applied earlier by 
Bernussou et al. (1989) to a quadratic stability 
problem. The starting point is to take a 
“sub-optimal approach”. More specifically, with 
J denoting the “mixed X^IXJ’ performance 
measure (a precise definition of J is given in 
Section 2) let 

vC^) ;= inf (/; internally 

stabilizing and ||Tj,«,|U< y}, 

denote the optimal mixed X^IX.^ performance 
measure. Then we consider the following 
problem: 

“Compute vCiS) and given a>v{^), find an 

internally stabilizing controller ^ such that 
y. and ihe mixed X^IX.^ 

performance measure satisfies J < a". 

The main results of this paper are contained in 
Sections 4 and 5. The full-information/state- 
feedback case is considered in Section 4, while 
the output-feedback case is considered in Section 
S. It is shown that if the plant state is available 
for feedback, one can come arbitrarily close to 
the optimal mixed X^IX.^ performance measure 
using constant gain (i.e. nondynamic) state- 
feedback controllers. In other words, in the 
state-feedback case, static gain controllers offer 
the best possible performance. 

In the full-information feedback case (i.e. 
both the exogenous input and the system state 
are available for feedback) there is a significant 
departure from the continuous-time case. It 
turns out that in the discrete-time case, one 
cannot come arbitrarily close to the infimum by 
taking static state-feedback controllers. The best 
that one can do is to use static full-information 
controllers. As a consequence, this result is of 
little practical interest except that it is critically 
useful in dealing with the output-feedback case. 
This situation is similar to that in the single 
objective standard X^ control problem for 
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discrete-time systems as in Basar and Bernhard 
(1991), Iglesias and Glover (1991), Limebeer «r 
al. (1989), Liu et al. (1991) and Stoorvogel 
(1990). 

It is shown that in the state-feedback as well as 
the full-information case, the mixed 31^/311 
optimal performance v(») and a static gain 
controller that satisfies J<a (for any a> v(^) 
can be obtained by solving a finite-dimensional 
convex programming problem over a bounded 
set of real matrices. 

In the output-feedback case, it is shown that 
the mixed 31^/ 3K. control problem can be 
reduced to a full-information feedback problem 
for an auxiliary plant, which is obtained from the 
given plant by solving an (3IL filtering) algebraic 
Riccati equation. Thus, the output-feedback 
problem can be reduced to a finite-dimensional 
convex programming problem over a set of real 
matrices. It is shown that the output-feedback 
controllers can always be chosen to have a 
structure similar to that of the standard 
central controller. This implies that the order of 
(nearly) optimal output-feedback controllers 
need not exceed that of the generalized plant. 

While the approach taken here is somewhat 
similar to the approach of Boyd and Barratt 
(1990), in that they also reduce such controller 
synthesis problems to convex optimization 
problems, there are significant differences 
between our results and those of Boyd and 
Barratt (1990). In particular, we reduce the 
mixed controller synthesis problem to a 

convex optimization problem over a bounded 
subset of qxn and n x n symmetric real 
matrices, where q and n are, respectively the 
control input and the state dimensions. We 
accomplish this reduction of the problem without 
finite-dimensional approximations of the set of 
stabilizing controllers or frequency discretiza¬ 
tions. Consequently, a solution to our convex 
programming problem is a global solution to the 
mixed 3 ^ 2 /X, synthesis problem. This is 
considered to be an important contribution of 
our work. By comparison, the results of Boyd 
and Barratt (1990) applied to the present 
problem would reduce it to a convex optimiza¬ 
tion problem over the infinite-dimensional space 
of stable transfer functions. 

Next, we briefly introduce notation used in 
this paper. The symbol 0 denotes the empty set. 
Given a real matrix A, ||i4|| denotes its 
maximum singular value, tr (i4) denotes its trace, 
and A' its transpose. We will say that a square 
matrix A is asymptotically stable if all its 
eigenvalues are inside the open unit disk. For A 
and B real symmetric matrices, A>B 
(respectively A^B) iff the difference - B is 


positive-definite (respectively, positive-semi- 
definite). Linear time-invariant systems de¬ 
scribed by state space equations and are denoted 
by the script symbols, whereas the corresponding 
transfer matrices denoted by italics. For 
example, $ denotes a system with transfer 
function G. The Hardy spaces X and X consist 
of matrix valued functions that are square 
integrable and essentially bounded, respectively, 
on the unit circle with analytic extension outside 
of the unit circle. The norms on these spaces are 
defined in the usual way. 

2 . THE MIXED PERFORMANCE MEASURE 

In this section, we will define the mixed 
performance measure. This will then be 
used in setting up the controller synthesis 
problem in the next section. 

Let us begin by considering a finite¬ 
dimensional linear time-invariant discrete-time 
system J as shown in Fig. 2. 

Suppose that S' is internally stable with the 
discrete-time state-space model: 

(crjc)(/c) \ = x(k -I-1) = Fx{k) -I- Gw{k), 
z„(/c) = Hox{k) -f J„w{k), 
lzi{k) = HiX{k) + JiW(k), 

( 1 ) 

where the matrices F, G, //, and J, are real and 
of compatible dimensions, and F has all 
eigenvalues in the open unit disk. (In the sequel, 
we will not show the time variable k explicitly in 
system equations.) Let 



denote the transfer matrix from w to z ^ 

Let denote the controllability gramian of 
the pair (F, G), i.e. is the unique solution of 
the Lyapunov equation 

FL,F'+ GC = L,. (2) 

Then, as is well known, 

IIVIlHtr(HoL.//; + Vo). 

Let y>0 be given, and consider the transfer 
matrix In this paper, we will be interested 
in the 3iC norm bound ||7'j|,„||.< y. The 



Fig. 2. Diagram for the definition of the mixed 
performance measure. 
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following theorem summarizes some useful 
results on characterizing this norm bound. 


Theorem 2.1. Consider the internally stable 
linear time^invariant discrete-time system given 
by (1). Then the following statements are 
equivalent: 

(1) lirz.JU<y. 

(2) There exists a nonsingular matrix P such that 


fF C 

nr F 

G IfF"' 01 

LO 1 

JL/f./y 

y,/rJL 0 yj 


(3) There exists a real symmetric Y>0 such that 




Moreover, Y can be chosen to be the same as 
the one in item 4 below. 

(4) There exists a real symmetric V > 0 such that 
M(Y) ;= y^l - /,y; - //, YH[ > 0, and 
R{Y) := FYF' -Y + {FYH'i + C/i)Af 

x(W,yF'-t-y,G') + GG’<0. (5) 


Moreover, Y can be chosen to be the same as 
the one in item 3 above. 

(5) There exists a real symmetrical V ^ 0 such 
that 


SHY) :=yH- y,y; - H, YH\ > 0, and 
R{Y)-FYF'-Y+{FYH\-^GJ\)M-' 

x(/y,yF'+y,G') + GG’ = o, (6) 

and F + {FYH[ +GJ\)M~^Hi is asymptotically 
stable. (In fact, Y satisfying the above conditions 
is unique.) Moreover, if denotes a solution to 
either (4) or (5), then Y^f. 


Proof. The equivalence of items 1 and 5 can be 
found in Molinari (1975). The equivalence of 
items 1 and 4 follows from the equivalence of 
items 1 and 5 and a standard small perturbation 
argument. The equivalence of items 3 and 4 
follows from simple algebraic manipulations and 
the Schur complement formula. Setting Y: = 
(F'F)“' yields 2=J>3, and setting P = Y~''^ gives 
3^2. Finally, in item 5, the inequality Y:sY 
follows from Ran and Vreugdenhil (1988). ■ 

Now suppose ||7 'i,h,||o,< y. Let y denote the 
unique real symmetric matrix that satisfies 
condition 5 in Theorem 2.1. Then, from the 
definition of the controllability gramian and 
Theorem 2.1, it follows that 

0^L.sy. (7) 

Note that this is the best possible upper bound 


for the controllability gramian that may be 
defined in terms of the solutions to the various 
quadratic matrix inequalities in Theorem 2.1. 
Thus, 

II Jll = tr (HoUH'o + yoy;) ^ tr (M, YHi + JJi). 

The above inequality motivates the following 
definition of the mixed performance 

measure (or cost) J{Tw) for fhc linear 
time-invariant system S': 

y(7;j:=tr(/foy//i+y„yi). (8) 

The performance measure defined in (8) is the 
same as the one considered by Mustafa and 
Bernstein (1991), Bambang et al. (1990), and 
Haddad et al. (1991). (More precisely, this 
performance measure is one of the costs 
considered in Mustafa and Bernstein (1991).) 

It is easily seen that JiT^^) is only a function 
of the transfer matrix and does not depend 
on the choice of realization, as long as such 
a realization is internally stable. This j ustifies 
our notation. Also ||F^h.|| 2 S V7(^, and 
lim Vy(7;„) = ||r^,„||j. The mixed 

Y — 

performance measure J{T^,) is also a function of 
the parameter y. In the sequel y will remain 
fixed. Therefore, without loss of generality, we 
set 


y = i- 

for the remainder of this paper. Any other 
constraint level can be accommodated by simple 
scaling. 

The following result provides an alternative 
characterization for the mixed 
performance measure J{T^^) that will be useful 
for establishing some of the results in this paper. 
The proof of this result is very similar to the 
proof of Lemma 1.1 in Khargonekar and Rotea 
(1991a). For the sake of brevity, details are 
omitted. 

Lemma 2.2. Consider the stable system 5" 
defined in (1) and let denote the transfer 
matrix from w to z. Suppose that ||r;t,H.|U < L 
Let M(.), R{.) be given by (5), with y = 1. Then 

JiTJ = inf {tr {H^YH'o + JJi):Y = Y'>0 

such that Af(y) >0 and R{Y) < 0). 

3. THE MIXED CONTROL PROBLEM 

In this section, we formulate the controller 
synthesis problem to be solved in this paper. 
Consider the finite-dimensional linear time- 
invariant discrete-time feedback system depicted 
in Fig. 3, where ^ is the generalized plant, 
including weighting functions, and is the 
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controller to be designed. The signal tv denotes 
an exogenous input, while Zq and 2 ] denote 
controlled (i.e. regulated) signals. The signals u 
and y denote the control input and the measured 
output, respectively. The transfer matrices of the 
plant and the controller are denoted by G and 
C, respectively. Let 



denote the closed loop transfer matrix, where 
and are the closed loop transfer 

matrices from w to Zn and w to Z], respectively. 

Definition 3.1. Let ^ and % be the given plant 
and controller. The controller ^ is called 
admissible (for the plant ^ if internally 

stabilizes the plant The set of all admissible 
controllers for the plant ‘S is denoted by sS{‘S). 
Furthermore, we define 

In the above notation, stands for 

“admissible”. As is well known, if 

and only if ^ is stabilizable from u and 

detectable from y. Also, the subscript in 
stands for the infinity norm constraint on 
T 

r|H” 

Consider the feedback system shown in Fig. 3. 
Given a plant 'S and an internally stabilizing 
controller the mixed cost of the 

closed loop system is a function of the transfer 
matrix only. We will denote this transfer 
matrix by T^^iG, C) and define 

/(G, C) -JiT,UG. C)). 

to emphasize on which plant and controller these 
closed loop quantities depend. 

Following Khargonekar and Rotea (1991a), 
the sub-optimal mixed controller synthe¬ 

sis problem considered in this paper is defined as 
follows. 

The mixed control problem. “Calculate 

the optimal mixed performance measure 

v(«):=inf{/(G, C):'«€,rf.(»)}, (10) 


and, given any af>v(^J), find a contrcdler 

6 Ji/.CS) such that /(G, C) < a". 

In some cases involving state-feedback, it is 
natural to also consider memoryless, i.e. static 
controllers. In such a case, the mixed 
synthesis problem is defined in the following 
way. 

Definition 3.2. The set of static admissible 
controllers satisfying the % constraint is 
denoted by 

where q = dim (u) and p = dim (y). 

The optimal performance over all admissible 
memoryless controllers is 

vJ»):=inf(;(G,C):«ej^,.„(»)}. (12) 

In (11) and (12), the subscript “m” stands for 
memoryless controllers. 

The main results of this paper show that the 
computation of the optimal mixed 
performance (10), and the construction of a 
sub-optimal compensator, can be reduced to the 
convex optimization problem over a bounded 
convex subset of a space of real matrices. 

4 STATE AND FULL-INFORMATION FEEDBACK 
PROBLEMS 

In this section, we give a solution to the 
controller synthesis problem formulated pre¬ 
viously, for the state/full-information feedback 
case. Here full-information feedback means that 
both the state and the exogenous inputs are 
available to the controller. Even though, such a 
feedback scheme is not realistic from a practical 
point of view, the full-information results are 
instrumental for addressing the more general 
case of output-feedback. 

We will first ask the question whether the 
infimum of the mixed X^t X.T, performance 
measure over all dynamic full-information 
feedback controllers equals the infimum over all 
static state-feedback controllers. In the 
continuous-time case, the answer to this question 
is in the affirmative (Khargonekar and Rotea, 
1991a). However, in the discrete-time, the 
answer, in general, turns out to be in the 
negative. In fact, we will show that the infimum 
of the mixed X^/X^ performance measure over 
all dynamic full-information feedback controllers 
equals the infimum over all static full- 
information feedback controllers. Thus, the 
analogy with the continuous-time case breaks 
down in this sense. As mentioned in the 
Introduction, a similar situation also occurs in 
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the single objective standard control problem 
in the discrete-time case. 

On the other hand, if only the state of the 
system is available for feedback, then we will 
diow that the infimum of the mixed 
performance measure over all dynamic state- 
f^dback controllers equals the infimum over all 
static state-feedback controllers. 

Finally, we will show that the static state/full- 
information controller synthesis problems may 
be effectively solved by means of finite¬ 
dimensional convex optimization. 

Let us begin by considering the mixed 
synthesis problem defined in Section 3 for the 
following plants (see also Fig. 3); 

(1) State-feedback plant; the plant 9 is given by 
the state-space model 

ax = Ax + ByW + B^u 
<g Zo = CqX + Doxw ^ DmU 

Z] = CiX + DiiW + D|2« 

Ly =x. 

(2) Full-information plant; the plant ^ is given 
by the state-space model 


ax = Ax -f fliW + B2U 

Zb = CoX + Doi w + D 02 M 
Zi = C|X + DiiW + Diju 

= w']'. 


(14) 


In the sequel, we let G,f and Gfl denote the 
transfer matrices of (13) and (14), respectively. 
The subscripts “s/” and “fi" denote “state- 
feedback” and “full-information” structure, 
respectively. The only difference between the 
state-feedback and full-information plants is in 
the measurement equation. Note also that no 
assumptions on problem data, i.e. the matrices 
introduced in (13)-(14), are imposed. 


4.1. Reduction to memory less feedback 

Theorem 4.1. Consider the full-information 
plant 'Sfi defined in (14). Then 


0, then and v(%). Let 

e > 0 be given. From (10) it follows that there 
exists e J^.(^/i) such that 

J{Gfl,C)^y{%) + d2. (15) 

Let ^ be given by 

1 u = + Dc.x 4-DezW. ^ ^ 

The closed loop system corresponding to the 
interconnection of and ^ is given by: 

ar] = Fti + Gw 
z„ = H„r\ + JnW 
Lz, = //,T| +/,*v, 

where 


p.- PiDci 

B,C ,1 

L 

>1. J 



L 8.2 J 


^0 • “ [C() + A>2^c1 

^02 Up], 

A) • ~ Al + A 2 A2» 


Hi := [Cl + A 2 A 1 

Dx2C^, 

Jy : = Dyy + £^12^2- 



Since ^ e sdJfS/i), it follows that F is stable and 
l|7'2,H.(C/i, C)||„< 1. Using Theorem 2.1 and 
Lemma 2.2 we may now conclude that 
3y= r>0such that: 


( 1 ) M 

(2) RiY) := FYF' -Y+ {FYH\ + GJ\) 

X M~\H, YF' + /, G') + GG' < 0, 

(3) tr (//oTWi + V,',) ^y(G,,. C) -h e/2. 

(17) 

Now combining (15), and item 3 in (17), we get 

tr (HoT/f;, 4- W ^ v(G/0 e. (18) 

Using the matrix Y introduced in (17), we will 
construct a memoryless controller 3f for Let 
n = dim (x) and = dim (^) and partition Y and 
/{(y) according to the plant and controller 
dimensions, i.e. 


where sdJl,%i) and are as in (9) and 

(11), respectively. In this case, 

v(«yi) = v„(»^), 

where and Vm($i)) denote the mixed 

X 2 /X., optimal costs in (10) and (12), respec¬ 
tively. Furthermore, given any a>v{^f,). there 
exists a static full-information controller X e 
such that J(Gfi, K) < a. 



where dim(yi) = nXn, dim(yj) = n xn^, 
dim (ya) = rtc X n^, and similarly for F(y). Note 
that y*! > 0 and Ri < 0. Define the memoryless 
full-information controller X by 

u = KiX 4- K 2 W, 

where 


Proof. We only need to show that if # 


Kx-.= D,y + CX2yv\ K2-D^. (19) 
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The closed loop system resulting from the 
interconnection of and X is given by 

ax = F„x + G„w 
Zo^HomX +Jo„,W 

where F„:=A + B 2 Ku G„:= Bi + 82 X 2 , 

ffcvm — Co + DfiiKi, Jo„ := Dqi + Hi„ : = 

Ci + D^Ku /i„ := Dll + £>12/^2, and the gains 
Ki and K2 are given by ( 19 ). 

Define Q Since, y>0 it 

follows that Q>0 by taking Schur complement. 
Simple algebraic manipulations show that the 
“1-1 block” of RiY) satisfies 

R, = - y, + G„G'„ + B 2 C,qc:b '2 

+ {F„YH[„ + G„J[„ + B 2 C,QC',D[ 2 )M-' 

X (//,„y,f: -I- /i„flL + DnCMCB'i) < 0, 

( 20 ) 

where 

M = -/f,„y,/fL 

-D,2C,ec:Di2>0. (21) 

Since y >0, (20), (21) and the implication 4=^3 
of Theorem 2.1 imply that T, satisfies 



since Q > 0. 

Now using (22) and y, > 0, a simple Lyapunov 
argument shows that F„ is a stable matrix 
Further, from implication 3^1 in Theorem 2.1, 
it follows that ||y,,„(G;s,/C)||.< 1. Conse¬ 
quently, the memoryless controller defined in 
(19) satisfies X e 
It is easy to verify that 

Ho„YiHL + JomJL 

= H„YHi + JoJi - D^CMCID^. 
Since Q>0, we may now conclude that 
tr iHo„ y,//L + ^ tr (HoYHi y„y,',), 

which, together with (22), implies 

tr ^ v(«^) e. 

Using the last inequality, y|>0, the implication 
34>4 in Theorem 2.1, Lemma 2.2, and (18) we 


obtain 

v„i%)^JiGfi, K) 

s tr Y,H'^ /o«/L) s y{%) + e. 
Since e is arbitrary, we conclude that 

The last part of the theorem now follows from 
definitions. ■ 

Theorem 4.1 applies in the case where both 
the state and the exogenous inputs are available 
for feedback. This is rarely, if every, true in 
practice. The principal motivation for this result 
comes from the output-feedback case. As will be 
seen in the next section. Theorem 4.1 is a key 
result in the derivation of the output feedback 
solution. 

However, in applications one often comes 
across problems where the state vector is 
available for feedback. From this point of view 
the following results on the state-feedback case 
is much more useful. 

Theorem 4.2. Consider the state-feedback plant 

defined in (13). Then 

jrf„(^?,^) # 0 O J!^..„(»„) ^ 0, 

where jrf-CS,/) and are as in (9) and 

(11), respectively. In this case 

v(%) = v„{%), 

where v{%f) and denote the mixed 

XilX» optimal costs defined in (10) and (12), 
respectively. Furthermore, given any a> 
there exists X e such that J(G,f, K) < 

a. 

A proof of this result can be constructed from 
the proof of Theorem 4.1 by setting B^j and 
equal to zero in the definition of the controller ^ 
given in (16). Details are omitted for the sake of 
brevity. 

4.2. A convex optimization approach to static 
feedback problem 

In this subsection we will show that the mixed 
XzlX^ problem with state/full-information feed¬ 
back can be reduced to a convex optimization 
problem over a convex bounded set of real 
matrices. That is, given a>v„(‘S^), real 
matrices /f, and K 2 such that J{Gfi, [AC| K 2 ]) < a 
can be found by solving a finite-dimensional 
convex programming problem. We will consider 
the full-information case. The state-feedback case 
follows by taking K 2=^0 in the analysis below. 

With reference to the full-information plant 
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defined in (14), let n = dim(jr), 9 = dini(u), 
p = dim (iv). Let 2 denote the set of all real 
nxn symmetric matrices, and define 

(23) 

Note that £2 is a strictly convex open subset of 
X 2 X R"’''’. Given (W, Y, K 2 ) e £2 define 

f(W. Y. K 2 ) 

: = tr ((CoV + D,cW)y-‘(C„y + D^wy 
+ iD^n + D^K2)iDoi + DM')- (24) 
Given any (IV, Y, K 2 ) e £2, define 
LiW, Y, K 2 ) 

.J AY + B2W1 [ AY+B 2 W 1 ' 

■“Lc,y + D,2vyJ Lc,y + z),2wJ 
r b,^B2K2 ir b,^B2K2 ]'_fy oi 
L D„ + D, 2/^2 JLd„ + D, 2^2 J Lo li 

(25) 

Now consider the set of real matrices: 

4>(»^) := {(IV, y, #: 2 ) e £2; L(W, Y, K 2 ) < 0), 

(26) 

and the constrained optimization problem 
:= inf {/(W. y, K 2 ); (IV, y, K 2 ) e 

(27) 

Note that ^ 0 since / ^ 0 on £2, as can be 
seen from (24). We now state the main result of 
this subsection. 

Theorem 4.3. Consider the system defined in 
(14) with transfer matrix G^. Let be 

the set of static controllers defined in (11). Then, 

<„(»/i)^0»4»(«/i)^0. 

where 4>(9/i) is given by (26). In this case 

v„i%) = 

where v„(‘§fl) and V'(®/)) are as in (12) and (27), 
respectively. Furthermore, given any cr> 
there exists a triple (IV, Y, K 2 ) e <!>(%) 
such that the static full-information controller 

ji:;=[ivy-‘ a' 2 ]. 

satisfies 

Xesa^.„(%) and 

This result is a direct and straightforward 
generalization of Theorem 4.2 of Khargonekar 
and Rotea (1991a). Proof is omitted. 

In the remainder of this section we will show 
that the optimization problem defined in (27) is 
convex. 


Lemma 4.4. Let £2 denote the set defined in 
(23). The mapping/;£2-^/?*^ defined in (24) is a 
real-analytic convex function on £2. 

Proof. Using (24) / may be rewritten as 
/(W, y, K 2 ) = tr (C„yci) -I- 2tr 

-htr(Do2lVy-'lV'Di2) 

tr (Dfli ^ 02 ^ 2 )(^oi + R-iD'm)- 

In Khargonekar and Rotea (1991a) it was shown 
that the first three terms in this expression are 
convex in £2. Clearly, the term tr (Doi -i- 
Oq2K2)(/2qi ■+■ ^ 2 ^ 0 ^ convex in K 2 . The 
convexity of /(.) now follows. The fact that /(.) 
is real analytic follows from its definition. ■ 

Lemma 4.5. Let L:£2—denote the matrix¬ 
valued mapping defined in (25). Then L is a 
convex mapping. Consequently, the constraint 
set 4>(^;j) defined in (26) is convex. 

Proof. Let (W, y, K 2 ) e £2. Using (25), it is easy 
to see that L(1V, Y, K 2 ) can be rewritten as 



= fWY-^W'F' + GKK'G' 



where 


A' 

c\- 

(5:4 «■ 


.B '2 

D'J’ 

Lo„ 

D, 2 - 


lV:=[y' W']', K:=[I K^]'. 

By Proposition E.7.f in Marshall and Olkin 
(1979) the mappings (IV, y)-» Avy'W'F', 
and K—^GKK'G’, are convex on their domains 
(here y=y'>0). Since the maps (IV, y)—► 
[y' IV']' and K 2 —►[/ Ki]' are afihne linear, the 
convexity of L follows. Finally, the convexity of 
follows from the convexity of L. ■ 

Lemma 4.6. Consider the set 0($^) defined in 
(26). Assume that 0^2 has full column rank. 
Suppose that for all z inside the open unit disc, 
the system matrix 

p/-A fi2] 

L -C, D,J’ 
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has full column rank. Then the set is 

bounded. 


This contradicts (29), and therefore 

i: = I-CiYC\>0. 


(32) 


Proof. We need to show that there exist jrasitive 
constants mj, m 2 , m 3 <«> such that 

(W, y,/C2)g4»(»^)=>||W||sm„ 

m^rrn, 

Let (W, Y, K 2 ) € 4>(^/i). From dehnitions of 
4>(51^) and L{W, Y, K 2 ), it follows that 
0 , 3 /^ 2/^2052 <L which implies that || 0 , 2 /f 3 ||< 
1. Since 0,2 is full column rank there exists a 
constant m 3 < such that 

11/^211 sm3<« 

Now define the matrices 

/ -b2^o:2 

■ .0 i-d,2XD'J’ 

c,:=(/-o,3Jroi2)c„ 

A: = A-B2XD\2Cu 

where X := (OijOjz)''- Premultiply L by T and 
postmultiply it by 7'. After simple algebraic 
manipulations we obtain 

fii- 

To prove the boundedness of Y, we are going 
to use Theorem 2.1 of Section 2 and Theorem 
3.1 in Ran and Vreugdenhil (1988). To be able 
to use these theorems, wc first need to show that 
/-CiTCi> 0 . It follows from (28) that 
C, YC\ + DnXD Iz - / ^ 0, and .so I - C, YC\ > 
0. Now suppose there exists a vector y 0 e /?'' 
such that 

(/-C,TCl)y=0. (29) 

Multiplying the inequality CiYC\ + Dt 2 XD\ 2 - 
/ ^ 0 by y on the right and y' on the left, we get 
y'DijA'/./Jzy = 0- Since X is invertible, it follows 
that 

Dl 2 y = 0 and C',y = C\y. (30) 

Also by taking the (2, 2 ) sub-block of the 
inequality L 0, we get 


' u 
LO I-D„XD\, 


0 . (28) 


Using (32) and taking Schur complement of 
the (2,2) block in (28), it follows that 

R . = AYA'-Y-AyC[X-'C^YA'sO. (33) 

Now the rank assumption on the system matrix 
ensures that the pair (Ci.A) has no unobserv¬ 
able modes inside the unit disc. Using a simple 
extension of Theorem 3.1 in Ran and Vreugden¬ 
hil (1988) and the observability of the stable 
modes of (C,,y4), it follows that there exists a 
real symmetric matrix K_ (depending only on A, 
B and C,) such that -Ta YL, or Vs - Y. . Let 
m, = II Y_||, then since Y>0, ||y|| s m, <«>. 
Finally, from equation (31) it follows that 

(c, y -I- D,2W')y-‘(Ci y + d.zW)' < i 

4>|i(c,y + D,3W)y-'l<i 

=>y"‘'^(c,y-i-D,2W)(c,y 

-t-D,2W)'y-''^</ 

^llc.y-iD,2W||<ViryI- 

Since ||y||^mi and D 12 has full column rank, 
we conclude that there exists m 2 < <“ such that 

||lV||<m2<® ■ 

5 OUTPUT-FEEDBACK CASE 
In this section we will solve the synthesis 
problem defined in Section 3 for the output- 
feedback case. We will show that the problem 
can be reduced to solving one algebraic Riccati 
equation, and a convex optimization problem 
similar to the one in Section 4. In the following 
subsection we introduce a technical result that 
will be needed in order to prove the main 
theorem of this section. 

5.1, Preliminaries 

The next result is an extension of Redheffer's 
lemma (see, for example, Iglesias and Glover 
(1991) and Stoorvogel (1990)) to the mixed 
performance measure for discrete-time 

systems. 

Consider the feedback interconnection of Fig. 


/ - (C,y + DnW)Y- '(C, y + D.ziy)' 

-D.z/fz^CiDlzX). (31) 

Premultiplying (31) by y' and postmultiplying it 
by y yields 

y'(/-c,yc;)y>o. 

Since CJy = C[y, it now follows that 

0 = y'(/ - C, yei)y = y'(/ - C, YC\)y > 0. 



Fio. 4. System interconnection in Lemma 5.1. 




66 


1. Kaminer et al. 


4, where 


and 


arf ^Arf + B,»v + BjUj 
Caff 

9‘:-- z„-Sf,+ Vo 


= CiTj + DiiW' + D,2 Ui 

r = C 2 fj + £>21 w + I^Vi, 
f a* = i4jc + 


Q: = 


j Uo ~ + -A)^ 

[u, = CtX + Dtr. 


(34) 


(35) 


The matrices in the state-space equations (34) 
and (35) are real and of compatible dimensions. 
Let P and Q denote the corresponding transfer 
matrices, and partition them as 



Lemma S.l. Consider the feedback system 
shown in Fig. 4, where ^ and Q are given by (34) 
and (35), respectively. Let denote the closed 
loop transfer matrix from w to z = (zo, z,). 
Suppose that ^ is internally stable, and let 
denote the controllability gramian of the pair 
(A. [B, flal). i.e. 

AL^A' + BiB'i +8282 = 1 ,. (37) 

Suppose also that Du is square and nonsingular, 
that A — B 2 Di 2 Ci is a stable matrix, and 


[B, 

fldj 

Du 

.D21 

^'"1 +AL,\C\ Ci] = 0, (38) 

U22J 

Du 

Di 2 

][Du 

O. 2 I' 

i>2. 

D22 


D22J 


-^[^‘iLeLc; Ci] = /. (39) 

LC 2 - 


Then the following statements are equivalent: 

(i) The feedback system in Fig. 4 is 
well-posed, internally stable, and ||r„,„||.< 1. 

(ii) Pis internally stable and ||Qi|U< 1. 

If the above conditions hold, the mixed 

costs JiT,„) (with respect to Zo) and J{Q) (with 
respect to Uo)> ^re defined and they satisfy 

J(ZJ = UiC„L,Ci) 

+ 2tr(A„C2L,Ci)-hJ(P). (40) 

A continuous-time version of the above result 
is in Khargonekar and Rotea (1991a). The 
discrete-time case differs from the continuous¬ 
time case in one important aspect. In the 
continuous-time case the mixed ^ cost 
y(r,H,) of the feedback interconnection of ^ and 
Q, is the sum of the mixed cost of Q and 


the 51^ norm of the “1-1” block of the inner 
plant This decomposition fails in the 
discrete-time case; there is an additional cross 
term (which depends on Q) in (40). 

Proof. The proof of the equivalence of the 
statements (i) and (ii) can be found in Doyle et 
al. (1989b); Iglesias and Glover (1991) and 
Stoorvogel (1990). 

SuppMise now that either one of these 
statements is true. For simplicity, the rest of the 
proof will be done under the assumption that P 22 
is strictly proper, i.e. ^22 = 0 . Let ^p = [x' r}']' 
denote the state of the composite system. It is 
easy to check that the system resulting from the 
interconnection of ^ and P is given by 

aip = F^jf -t- Gw 

Zo = H„\lf+J„w (41) 

Z| = //| ^ + 7, w, 

where 

F =\ ^ 1 

A + B2D^C2y 

G=l 1 

■ Lb,+ 62^,02,]’ 

//o •” [Co, C(1 + D 0 C 2 ], 

[^17^1 C, -I-D 12 D, C 2 ], 

^11 + ^12^1^21- 

Note that because of internal stability, all 
eigenvalues of F are inside the open unit disc. 
Moreover, ||7 ;,k.|U < 1- 

In order to establish formula (40), we need to 
determine the stabilizing solution of the ARE 
corresponding to the condition ||Tj,k'IU < 1- That 
is, the real symmetric matrix Y such that 

M(y) :=/-/, 7; -w,y/f;>o. 

B(y) ;= FYF' -Y + {FYH\ + G7i)Af 

x(//,yF'-l-7,G')-l-GG' = 0, (42) 

and F -I- {FYH[ GJ\)M~^Hx is asymptotically 
stable. Since A is asymptotically stable ||Q||U< 
1, there exists a real symmetric matrix ^ such 
that 

A(y) :=AyA' -y + (Ay&i + 

x(^,yA'-l-A^') + B^' = 0, (43) 

and A -I- {AfC'i + stable. 

Let L, be given by (37) and set 



After some algebra and using equations (43) and 
(37)-(39), we obtain M(y) = D,2M(l^)D;2>0 
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and /i(y) = 0. Moreover, 
F + (FFf/; + G/;)W-*//, 

“L 0 


-flzorz'cj’ 


where the “• block” is not relevant. Since 
A -I- {AfC[ + 66 [)M-^Ci and A - fl^Drz'C, are 
both asymptotically stable, we conclude that the 
real symmetric matrix Y defined in (44) is the 
unique stabilizing solution of the ARE (42). 

Next we compute the mixed ^ 2 ! cost 
From equations (39), (41) and (44), it 
follows that 


R:=/-C,ec;-l-C,gCiK-‘C20C;>0. (48) 

and Q satisfies the following discrete-time 
algebraic Riccati equation: 

AQA'-Q^B^B\-[AQC '2 + B,D ’2 AQCl] 

(49) 

where 

Moreover, the matrix 


7(r, J = tr (HoYHi + V,',) = tr (C„L,C;,) 

+tr(C„?Ci+ 6 o/yi) 

+ tr (^()C 2 LcCo + CoLcC 2 ^o) 

= U(Q,LXi)+JiQ) 

+ 2tt0„C2L,Ci). m (45) 

5.2. Main result 

We now consider the output-feedback mixed 
controller synthesis problem. Suppose 
the plant ^ in Fig. 3 is given by the state-space 
model; 


( ax = Ax •¥ B^w B^u 
2,, = C(>x -h D()M 
Z| = CxX -f DiU 
Ly = C 2 X + D 2 W, 


(46) 


where all the matrices in (46) are constant real 
matrices of compatible dimensions. We will also 
make the following assumptions; 

(Al) The triple (C 21 A, B^) is stabilizable and 
detectable. 

(A2) For each complex number z, such that 
|z| = 1, the matrix 


rz/-A -Bx 
[ C 2 D, J' 


has full row rank. 

Note that we have also assumed that there are 
no feedthrough terms from w to z,, or u to y. 
Even though it is possible to include these terms, 
we have chosen not to do so, to keep the 
presentation as simple as possible. The results 
given below may be combined with those in 
Stoorvogel (1990) to obtain formulae for the 
most general case. 

Suppose there exists admissible controller ^ 
such that the closed loop system is internally 
stable and ||T^,h-|U< 1, i e. Then it 

follows (Stoorvogel, 1990) that there exists a 
(unique) real symmetric matrix Q^O such that 

V := C 2 QC 2 + D 2 D 2 > 0, (47) 


A-[AQC 2 + B,D :2 AQC[]GiQr (50) 

is asymptotically stable. 

Given a real symmetric matrix Q, define the 
auxiliary full-information system: 

fax^= (A + ZR-^Ci)x^ 

+ {AQC 2 + BiD 2 
+ ZR-'C,QC'2)V ''^r 

+ (82 + ZR-'Dt)u 
= :A^g + Bi,r + B 2 ^u, 

%{Q) = C,^* + C„gC5V-'^r-(-A.M 

i;, = R-^^C,Xg 

+ R"’'2C,gC2V“''^r 

. GigXg 4 “ DiigK 4 “ Di2gU 

y=K (51) 

where Z : = AQC\ - {AQC '2 + BtD' 2 )V-^C 2 QC[, 
and A, C„ D„ B 2 are given in (46). Let Gfl{Q) 
denote the transfer matrix from (w, u) to 
(Oo, U|,u) in (51). The next result gives the 
solution to the mixed problem for the 

case of output feedback. 

Theorem 5.2. Consider the plant ^ defined in 
(46). Suppose that Assumptions A1-A2 hold. 
Let be as defined in (9), and suppo.se that 

Then there exists a unique (real 
symmetric) matrix g ^ 0 that satisfies the 
conditions (47)-(50). Let %{Q) denote the 
auxiliary system defined in (51). Then the 
following statements hold: 

(1) The set of admissible controllers 
is nonempty, and the optimal mixed 
performance measure is given by 

v(«») = tr(C„gCi) 

- tr (c„gciv-‘c 2 gc;,) 4 - v(%(q)). ( 52 ) 


AUTO Z 9 :l-F 
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where v(‘Sfi{Q)) is the optimal mixed 
performance for the auxiliary full-information 
plant %{Q). 

(2) Given any a > there exists a static full 
information controller 

X:=K^x, + K2 W, (53) 

such that Xe and 

J(Gfi(Q), K)<a-tTiCM) 

+ tr(c„ociv-'c,ec;). 

(3) For any full-information controller X = 

[^,^ 2 ] ^ the dynamic output- 

feedback controller 


■ \u = H^ + Jy. 

where 


(54) 


F: = A+ZR-^C, + (B 2 + ZR-'Di) 

X (K, - - (AQC^ + B, D '2 

+ ZR-^CiQC 2 )V-'^C 2 , 

G := {B 2 + ZR''Di)K2 V -+ (AQC^i 
+ B^D 2 + ZR-'C^QC 2 )V-\ 
H:=Ky-K2 V-^^C2, 

J:=K2 V-'^, (55) 

satisfies 

and /(C, d") = tr (CoeQ 

-tr(CneCiK-*C20Q+./(G;r(e), K). 

To solve the output-feedback mixed ^ 2 !^^ 
control problem using Theorem 5.2 the following 
steps should be followed. 

(1) Check if is not empty. This can be 

done by applying the standard theory for 
discrete-time systems and solving two discrete- 
time Riccati equations as in Basar and 
Bernhard (1991), Iglesias and Glover (1991), 
Limebeer et al. (1989), Liu et al. (1991), and 
Stoorvogel (1990). If is not empty, let Q 

denote the unique solution to (47)-(50). 

(2) Construct the auxiliary full-information plant 

^/i(Q) defined in (51). Let e>0 be given. 
Solve the convex program (27), correspond¬ 
ing to to compute a full-information 

gain X = ^^ 2 ] G jrf. „,(^9js(Q)) such that 

+ With this full- 
information gain, construct the output-feedback 
controller “t in (5'^. Then ^ belongs to jif„(‘S), 
and satisfies J(G, C) s v(S) -I- e. 


Proof. It follows from Stoorvogel (1990), that 
the plant 'S given by (46) can be represented as 
the feedback interconnection shown in Fig. 5, 
where Q^O satisfies equations (47)-(50), and 



Fig 5 Equivalent representation of the transfer matrix G. 


the plants SP{Q) and ^(G) are defined by 


[ax, = iA- {AQC 2 + fl,Di)V-'C 2 )xp 
+ {B,-{AQC'2+B,D'2)V-^D2)w 

-ZR-'^^Vi^-.A^Xp 


^(Q) = 


+ BtpW + B 2 pVi 
Xq C|)Xp . Gf^pXp 

z, = (C,-C\QC;,V-'C 2 )Xp 
-C,QC 2 V-'D 2 W + R''^v, 


= :C,pXp + D,pW + D2pV, 

r=V-‘'^C 2 Xp + V-^^^D 2 W 
= :C2pXp + D2,pii’, (56) 


m)- 


Vo = + D„2gU 

Vi = CigXp + Dugr + DugU 

y = C 2 jr, V''\ 


(57) 


where the matrices in (57) are those introduced 
in the definition of %iQ) given by (51). In 
(Stoorvogel, 1990), it has also been shown that 
*t^ e Jif(^ if and only if internally stabilizes the 
interconnection of 3P{Q) and ^(Q). Routine 
algebra also shows that Q is the controllability 
gramian of the pair (Ap, [Bjp B 2 p\), and that 
^{Q) satisfies the hypotheses of Lemma 5.1. 

First, we show that 


v(«^(0))<v(»)-tr(C„eQ 

+ tr(C«eCiV-‘C2GCo), (58) 

and that part 2 of the theorem holds. Suppose 
ae> vC^. From the definition of v(^^) it follows 
that there exists a controller ^ 6 sij^ ^ such that 
J{G. C) < a. Apply the controller ^ to the 
interconnection of Fig. 5. Now using Lemma 
5.1, since e jtf„=(^?) we get that g 
Moreover, 

/(G. C) = J{H{Q), C)-htr(CoQCi) 

+ 2tT(D„D,C2QCo), (59) 

where Dc = C('») denotes the direct feedthrough 
term of the controller 
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Define the full-information controller 
C* = C[C2 v^% 

and apply the controller to the full- 
information plant %(C) defined in (51). Clearly, 
“if* e j/a,('Syj(Q)). Further, an easy calculation 
shows that 

J{Gfl{Q), C*) = JiHiQ), C) 2tr {D^,D,C^QC^) 
-l-tr(CoGCiV-‘C2eci). (60) 
From (59)-(60), we obtain 

C*)=J(G, C) - tr (CoQCi) 

-htr(CnGCiV^-'C20Co). 

Now we can apply Theorem 4.1 to the 
full-information plant % to conclude that there 
exists a static controller 3if e „(^S^(G)) such 
that 

v(C^(G))^i(G^(e), K)^J{G^{Q), C*) 

< a - tr (CqQC;,) -H tr (CoQCiV-'C.QC,,). 

This shows that part 2 of the theorem holds. 
Moreover, taking limit as or—»v(^S), we may 
also conclude that (58) holds. 

We now show that part 3 of the theorem 
holds, and that 

v(G/G))^v(C)-tr(CnGC,;) 

+ ix{CoQC.,V-'C^QCn). (61) 

Here the key is to synthe.size the given 
full-information control law u - + K 2 r 

using a dynamic feedback controller which uses 
only the output y for the system '^(Q). In this 
construction, we essentially invert the transfer 
function from r to y in 3^(Q). 

Define the dynamic controller 

^ (‘Wc = -t- Bi^V-''\y - C2X,) 

• 1 u = K,x, + K2V-''^{y-C2xX 

(62) 

where If, and K 2 are such that yc-.= 
[5ir, e j!l?„(^§yi(G)), and Ag, Bg and V are 
defined in (51) and (47), respectively. Note that 
the controller (62) is an “observer based 
controller” for the auxiliary plant ^(Q). Using 
this fact, and the stabilizing property of Q, it is 
easy to see that ^ e j!/(5(f(G)), and 

T,„rm(2), O= r,.,,(G^m K). 

Thus, 6 si43€(Q)). 

Applying the controller to the interconnec¬ 
tion of Fig. 5, it follows from Lemma 5.1 that 
Furthermore, calculations as in first 
part of the proof imply that 

v(‘^)s/(G, C) = ti{CoQCo) 

- tr (CoGCiV-'QGQ + J{G4Q). K). 


This shows that part 3 holds. Moreover, by 
taking infimum over all full-information control¬ 
lers 3ir 6 may also conclude that 

(61) is satisfi^. Now equations (58) and (61) 
together prove part 1 of the theorem. ■ 

6. CONCLUSIONS 

In this paper we have considered a (sub- 
optimal) mixed control problem for 

discrete-time systems. This synthesis problem is 
well motivated since it represents a problem of 
(LQG) disturbance attenuation, as measured by 
the mixed ^13^ performance measure, subject 
to a robust stability constraint. 

We have shown that when the state of the 
plant, or the state and the exogenous input, is 
available for feedback, memoryless feedback 
gains offer the best possible performance. The 
optimal mixed performance, with state or 

full information feedback, was shown to be given 
by the value of a finite-dimensional convex 
program. This means that there are efficient 
numerical methods to compute the optimal 
performance, and a nearly optimal feedback 
gain. The reader may hnd an excellent 
description of some of these algorithms in Boyd 
and Barratt (1990). An ellipsoid algorithm has 
been developed in Rotea (1991) for a problem 
similar to the one considered in this paper. 

In the case of output-feedback., it is shown 
that mixed controllers can be chosen to 

be a combination of an filter, and a 
full-information gain for the mixed 9i^/ 
synthesis problem of a suitably constructed 
auxiliary plant. Thus, the output-feedback 
problem is no more difficult than the full- 
information problem. This appears to be the first 
complete solution to the mixed 3(^2 /problem 
with dynamic output-feedback for discrete-time 
system. 

Finally, the results in this paper may be 
combined with those of Rotea and Khargonekar 
(1991b) to solve mixed problems that 

involve time domain C constraints. 
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A. PACKARDt and J. DOYLE* 

A tutorial introduction to the complex structured singular value (^) is 
presented, with an emphasis on computable bounds and robust stability and 
performance tests for transfer functions and their state space realizations. 
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Abalnct —A tutorial introduction to the complex structured 
singular value (/i) is presented, with an emphasis on the 
mathematical aspects of ji. The p-based methods discussed 
here have been useful for analysing the performance and 
robustness properties of linear feedback systems. Several 
tests for robust stability and performance with computable 
bounds for transfer functions and their state space 
realizations are compared, and a simple synthesis problem is 
studied. Uncertain systems are represented using Linear 
Fractional Transformations (LFT's) which naturally unify the 
frequency-domain and state space methods 

1. INTRODUCTION 

This paper gives a fairly complete introduction to 
the Structured Singular Value (//) for complex 
perturbations. This paper is intended to be of 
tutorial value on the mathematical aspects of jU, 
and it is assumed that the reader is familiar with 
the control engineering motivation. The ju-based 
methods discussed here have been useful for 
analysing the performance and robustness 
properties of linear feedback systems. The more 
elementary methods are now available in 
commercial software products and the manual 
(Balas et al., 1991) for one such product would 
serve as a tutorial introduction to the engineer¬ 
ing motivation. The interested reader might also 
consult the tutorial in Stein and Doyle (1991) or 
other application-oriented papers, such as 
Skogestad et ai (1988). We present very few 
new results in this paper, although many of the 
results have appeared only in reports and 
conference proceedings. The paper is reasonably 
self-contained, skipping only those proofs which 
are readily available in the literature. 

Section 3 begins with the definition of and 
some of its elementary properties, including 

• Received 13 February 1992; received in final form 23 July 
1992. This paper was not presented at any IFAC meeting. 
This paper was recommended for publication m revised form 
by Guest Editor H. Kimura. 

t Mechanical Engineering, University of California, 
Berkeley, CA 94720, U.S.A. 

^Electrical Engineering, 116-81, Caltech, Pasadena, CA 
91125, U.S.A. 


simple bounds that form the basis for computa¬ 
tional schemes. This section also introduces the 
relationship between the upper bound for ju and 
Linear Matrix Inequalities (LMls) which results 
in a simple characterization of the convexity 
properties of the upper bound. The connections 
between // and Linear Fractional Transforma¬ 
tions are introduced in Section 4. These 
connections, especially the Main Loop Theorem, 
form the basis for most of the applications of 
to linear systems. In Section 5, using the 
definition of fx, and the Main Loop theorem, 
robust stability and robust performance theor¬ 
ems are derived for linear systems with 
structured linear fractional uncertainty. 

Sei^tion 6 covers a maximum-modulus 
theorem for linear fractional transformations. 
Section 7 presents a generalization of the 
standard power algorithms for computing the 
spectral radius or maximum singular value of a 
matrix to the computation of fi. This power 
algorithm provides an attractive method for 
computing lower bounds for Sections 8 and 9 
consider issues associated with the upper 
bound, focusing particularly on conditions under 
which the upper bound is equal to fx. For certain 
simple block structures, this equality is 
guaranteed. 

The remainder of the paper discusses applica¬ 
tions of to problems motivated by control 
systems. Section 10 considers how various ^ 
problems can be viewed in transfer function and 
state-space formulations. This leads to a variety 
of tests for robust performance, each with an 
interesting and useful interpretation. In Section 
11, many (computable) necessary and sufficient 
conditions for quadratic stability of uncertain 
systems are given, for a wide variety of 
uncertainty structures. The proof techniques 
used in each different case are identical, giving a 
unifying treatment of many known and new 
results. 
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Section 12 considers a simple case of 
^i-synthesis, the problem of minimizing /i as a 
function of some free parameter, such as a 
controller. Not surprisingly, //-synthesis is a 
much harder problem than //-analysis. For 
example, unlike //-analysis problems, no method 
for minimizing the upper bound for // in the 
synthesis problem using convex optimization has 
been found. 

Finally, the paper outlines some related work 
in Section 13, beginning with a brief history of 
the early development of the // theory. This 
outline is not intended to be exhaustive or 
complete, but simply to touch on a few of the 
topics nearest to this paper that were not 
considered in detail. LMls are presented as 
potentially unifying theoretical and computa¬ 
tional tools. The relationship between // and 
quadratic vs L\ notions of robust performance 
and robust stability is then discussed, followed 
by // with mixed real and complex perturbations, 
llie section ends with a discussion of model 
validation and generalizations of //. 

2. NOTATION 

The notation is standard. R denotes the set of 
real numbers; C denotes the set of complex 
numbers; H is the absolute value of elements in 
R or C; R" is the set of real n vectors; C" is the 
set of complex n vectors; ||i>|| is the Euclidean 

norm for v eC", ||i»||^:= E denotes the 

(-1 

set of square summable sequences in C"; l|e|U is 

(X 

the I 2 norm of sequence e e ||e ||2 := E lk*||^; 

*=i 

R"*'" is the set of nx m real matrices; C"^"' is 
the set of n X m complex matrices; H" is the set 
of Hermitian n x n complex matrices; /„ is a 
nxn identity matrix; and 0„ or 0„xm is an 
entirely zero matrix of obvious dimensions. For 
^ g Qnxm. jjjg transpose of Af; M* is the 

complex conjugate transpose of M; q{M) is the 
minimum singular value of M; a^M) is a 
singular value of M; a{M) is the maximum 
singular value of Af. For M eC'”'": A,(Af) is an 
eigenvalue of Af; p(Af) is the spectral radius of 
A/, p(Af) := max |A,(Af)|; tr(Af) is the trace of 

I 

Af, tr(Af):= E Af^,. If Af eC"’'" satisfies M = 

/-I 

Af * then Af > 0 denotes that Af is positive 
definite, and denotes the unique positive 
definite Hermitian square root. 

3. STRUCTURED SINGULAR VALUE 

This section is devoted to defining the 
structured singular value, a matrix function 


denoted by //(-). We consider matrices Me 
C"*". In the definition of //(Af), there is an 
underlying structure A, (a prescribed set of 
block diagonal matrices) on which everything in 
the sequel depends. This structure may be 
defined differently for each problem depending 
on the uncertainty and performance objectives 
of the problem. Defining the structure involves 
specifying three things: the total number of 
blocks, the type of each block, and their 
dimensions. 

In this paper, we consider two types of 
blocks—repeated scalar and full blocks. Two 
nonnegative integers, 5 and F, denote the 
number of repeated scalar blocks and the 
number of full blocks, respectively. To book- 
keep the block dimensions, we introduce 
positive integers /“i, . . . , rj-; m,, . . . , rrif. The 
fth repeated scalar block is r, x r,, while the _/th 
full block is nil Xm^. With those integers given, 
define AcC"’*" as 

A= {diag[6, A,. • • • > Aj+i,..., Aj+f]: 
b,eC,^s^,eC"|^'”^, 1 <;<F}. (3.1) 

For consistency among all the dimensions, we 
s f 

must have E r, + E ni, = n. Often, we will need 

(-1 /-I 

norm bounded subsets of A, and we introduce 
the notation 

BA={AeA:a(A)<l}. (3.2) 

Note that in (3.1) all of the repeated scalar 
blocks appear first and the full blocks are square. 
This is done to keep the notation as simple as 
possible and can easily be relaxed. 

Definition 3.1. For Af e fi^(M) is defined 


min{6 r(A):A6A, det(;-AfA) = 0}’ 

(3.3) 

unless no A e A makes I - Af A singular, in 
which case //^(Af) ;= 0. 

Remark 3.2. The set A defines a multi-index of 
integers and vice versa, so it makes sense to 
identify as one object the set, the block 
structure, and the associated multi-index of 
integers and refer simply to a block structure A. 
Clearly, //^(Af) depends on the block structure 
A as well as the matrix Af. 

Remark 3.3. Without loss in generality, the full 
blocks in the minimal norm A can each be 
chosen to be dyads (rank = 1). To see this, first 
consider the case of only one full block, 
A = Suppose that /-A/A is singular. 
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Fio. 1. A#-A feedback connection. 


Then for some unit-norm vector jteC”, 
AfAjt =x. Define y := Ax. It follows that =^0, 
and ll^ll ^ ^(A). Hence, define a new perturba¬ 
tion, A e C as 

A :—yx*. 

Note that a(A) - ||_v|| a(A), and y = ^, so 

that I — Af A is also singular. Repeating this on a 
block-by-block basis allows for each full block to 
be a dyad. 

Remark 3.4. It is instructive to consider a 
“feedback” interpretation of at this 

point. Let AfeC”^'* be given, and consider the 
loop shown in Fig. 1. This picture is meant to 
represent the loop equations u = Mv, v = Au. 
As long as 7 - A/A is nonsingular, the only 
solutions u, V to the loop equations are 
M = u = 0. However, if 7 - Af A is singular, then 
there are infinitely many solutions to the 
equations, and norms ||u||, ||v|| of the solutions 
can be arbitrarily large. Motivated by connec¬ 
tions with stability of systems, which will be 
explored in detail in the sequel, we call this 
constant matrix feedback system “unstable”. 
Likewise, the term “stable” will describe the 
situation when the only solutions are identically 
zero. In this context then, provides a 

measure of the smallest structured A that causes 
“instability” of the constant matrix feedback 
loop shown above. The norm of this “destabiliz¬ 
ing” A is exactly 

Remark 3.5. It is immediate from the definition 
that for any or e C, //(orAf) = |ar|//(Af). 
However, for all nontrivial block structures, the 
function //;► R is not a norm, since it does 
not satisfy the triangle inequa/ity. 

Remark 3.6. 4 natural question is why we work 
with ju and not 1///, especially in view of 
equation (3.3) in the definition of //. While it is 
clearly a matter of taste, there are important 
reasons. Mathematically, // is continuous and 
bounded and scales as indicated above. Perhaps 
more importantly, it connects more naturally 
with LFTs and generalizes the spectral radius 
and maximum singular value, as will be seen 
below. 

An alternative expression for follows 

easily from the definition. 


Lemma 3.7, jU^CAf) = = max p(AAf). 

Proof. Since for any a e C, = 

|ar| we need only consider two cases: 

= 1 iff max p(AAf) = 1 and » 0 iff 

maxp(AAf) = 0. These facts can be verified 

directly from the definition. 

This lemma applies continuity of the function 
p:C"*‘"-»R based on continuity of the spectral 
radius and max functions, and the compactness 
ofB^. 

We can relate to familiar linear algebra 

quantities when A is one of two extreme sets: 

• If A= {6/:6eC} (5 = 1. F = 0, r, = n). 
then p^CAf) = p(Af), the spectral radius of Af. 

Proof. This follows immediately from Lemma 
3.7. 

• If A = C"'''’ (5 = 0. f=l, m, = n), then 
p JAf) = oiM). 

Proof If a(A)< l/a(Af), then a(AfA)<l, so 
I-ML is nonsingular. Applying equation (3.3) 
implies PA(Af) ^ a(A/). On the other hand, let u 
and V be unit vectors satisfying Mv = a(Af)u, 
and define A := (l/a(A/))i;M*. Then o(A) = 
l/a(Af) and / — Af A is obviously singular. 
Hence, Pa(A/) 2 CT(Af). 

Obviously, for a general A as in (3.1) we must 
have {A/„: A e C} c A c C"’*". Hence directly 
from the definition of p, and the two special 
cases above, we conclude that 

p(Af) s p^(Af) ^ a(Af). (3.4) 

These bounds by themselves may prove little 
information on the value of p, because the gap 
between p and a can be arbitrarily large. They 
are refined with transformations on Af that do 
not affect PA(Af), but do affect p and a. To do 
this, define two subsets of 

Q = {G6A:e*0 = /„}, (3.5) 

D= {diag [£> 1 ,..., D,, .... ds+f-tm.]- 

e D, = d: > 0, ds^, G R, d,^j > 0). (3.6) 

The reasons for taking D positive will be clear 
shortly. Note that for any A e A, Q e Q, and 
DeD, 

Q*6Q, QAgA, AQcA, 
a(QA) = a(Ae) = a(A), ^ ‘ ^ 

D‘'^L = AD''^. (3.8) 
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Theorem 3.8. For all Q e Q and £) e D 

= (3.9) 

Proof. For all D € D and A € A, 

det (7 - A/A) = det (7 - A7D-''^AD''^) 

= det (7-D''^A/D-'^A), 

since D commutes with A. Therefore 
Also, for each QeQ, det (7 - 
MA) = 0 if and only if det(7-Afgg*A) = 0. 
Since Q*A = A and d(0*A) = o(A), we get 
H^(MQ) = as desired. The argument for 

QM is the same. 

Therefore, the bounds in (3.4) can be tightened 
to 

max piQM) ^ max p{AM) = PiJiM) 

: inf (3.10) 

DeD 

where the equality comes from Lemma 3.7. Note 
that in computing the iniimum, and one of the 
diagonal entries of the elements of D can be 
assumed to be equal to 1. This is without loss in 
generality, since for any nonzero scalar y, 
D‘'"A/D-"" = (yD)‘'"Af(yD)-''l Hence, from 
this point on, we assume that ds+F^\. Also, 
using the polar decomposition theorem for 
invertible matrices, it is easy to see that 
restricting the elements of D to be Hermitian, 
positive definite, as opposed to just invertible, 
does not affect the infimum. Certain convexity 
properties make the upper bound computation¬ 
ally attractive. For block structures with 5 = 0, it 
is shown by Safonov and Doyle (1984) that by 
using an exponential parametrization of D, the 
function d(D''^AfD"‘'^) is convex in log (D*'^). 
In Sezginer and Overton (1990) a very elegant 
and simple proof shows that the function 
a(e^Me~^) is convex on any convex set of 
commuting matrices X. This generalizes the 
results in Safonov and Doyle (1984) and relies 
only on elementary linear algebra. The simplest 
convexity property is given in the following 
theorem, which shows that the function 
a(D‘^A7D“‘^) has convex level sets. 

Theorem 3.9. Let M e be given, along with 
a scaling set D, and ^ > 0. Then 

{DeD:(D>'^A7D-‘'^)</8}, 

is convex. 

Proof. The following chain of equivalences 


comprises the proof: 

- / 9^7 <0 

OM*DM-p^D<0. (3.11) 

The latter is clearly a convex condition in D. 

Remark 3.10. The final condition in equation 
(3.11) is called a Linear Matrix Inequality 
(LMI). Note that although it is equivalent to the 
condition in Theorem 3.9, the functional 
dependence on D is much simpler and makes the 
convexity property clearer. For these reasons, 
LMls appear to be attractive for computation. 
General linear matrix inequalities are discussed 
in greater detail in Section 13.2. 

4 LINEAR FRACTIONAL TRANSFORMATIONS 
AND ft 

The use of p in control theory depends to a 
great extent on its intimate relationship with a 
class of general linear feedback loops called 
Linear Fractional Transformations (LFTs). This 
section explores this relationship with some 
simple theorems that can be obtained almost 
immediately from the definition of p. To 
introduce these, consider a complex matrix M 
partitioned as 



and suppose there is a defined block structure A 2 
which is compatible in size with M 22 (for any 
Az 6 A 2 , M 22^2 is square). For A 2 6 A 2 , consider 
the loop equations 

e = Mud + Af, 2 »', 
z = M 2 id + AI 22 W, (4.2) 

w = A22, 

which correspond to the block diagram in Fig. 2. 

Definition 4.1. Given complex matrices M and 
A 2 as described above. This set of equations 
(4.2) is called well posed if for any vector d, 
there exist unique vectors w, z, and e satisfying 
the loop equations. 



Fio. 2. Linear fractional transformation 
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It is easy to see that the set of equations is well 
posed if and only if the inverse of / - JW 22 A 2 
exists. If this inverse does not exist, then 
depending on d and Af, there is either no 
solution to the loop equations, or there are an 
infinite number of solutions. When the inverse 
does exist, the vectors e and d satisfy 
e = Sf{M, A 2 )i/, where 

5 ^(A/, A2) := A/ji + ~ ^21^7) 'A/211 

(4.3) 

A 2 ) is called a Linear Fractional Transfor¬ 
mation (LFT). If Ai is a block structure 
compatible in dimension with then for 

A] E Ai an analogous formula describes 

^A,, M), 

S^(Ai, M) := M22 + Ai 2 ,A,(y - Mn A,)- ‘A/,,, 

where the upper loop of M is closed with a 
matrix Ai. 

The y{M, A 2 ) and .S^(A), Af) notation can be 
somewhat confusing on first encounter. It comes 
from the “star-product” of Redheffer. Suppose 
that Q and Af are complex matrices, partitioned 
as 


G.i 

G.2 

Af = 

Afii 

Af|2 

021 

022 - 


.^2, 

Af22- 


with the matrix product ^ 72^^11 defined and 
square. If I-Q 22 MU is invertible, then the 
block diagram in Fig. 3 is well defined. We can 
extend the definition of y so that it equals the 
result of this interconnection, 

Simple manipulation gives 

y(e, Af) := 

^iQ,Mu) e,2(y-Af„c?22)'Af.2i 

Af2 ,(/-G22 Af„)-*e2I y(G22 .Af) J' 

where S^{Q, Afn) and ^( 622 . are defined as 
above. Note that this definition is dependent on 
the partitioning of the matrices Q and Af above; 
it may be well defined for one partition and not 
well defined for another. 

For clarity, the notation y(-, •) should have 
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two additional arguments, specifying dimensions 
of the partitions. However, in this paper, the 
only situations using y that will arise are LFTs, 
namely: 

(1) the number of rows of Q is less than the 
number of columns of Af. and the number of 
columns of Q is smaller than the number of 
rows of Af, or; 

(2) the number of columns of Af is less than the 
number of rows of Q, and the number of 
rows of Af is smaller than the number of 
columns of Q, 

and all inputs/outputs into the (dimensionally) 
smaller matrix are closed in the interconnecting 
transformation. Hence, we do not need to 
specify the dimensions of the interconnecting 
channels, since they are equal to the dimension 
of the smaller matrix. 

Alternative notation for LFTs has been used 
in previous papers, most notably 

y(Af, A2) = y,(Af. A2). 

y(A,, Af)=»L(Af. A,). 

where / and u indicate that the lower and upper 
loop, respectively, are closed. We believe that 
the y notation is more natural, easier to work 
with, and generalizes smoothly to y(Af, Q) in 
Fig. 3. 

4.1. Examples of LFTs 

Given the state space realization of a discrete 
time system 

then its transfer matrix is 

O(z) = D-fC(z/-A)-'fl = y 0 /, Af). 

Systems with uncertainty can also be easily 
represented using LFTs. One natural type of 
uncertainty is unknown coefficients in a state 
space model. As a simple example, we will begin 
with a familiar idealized mass-spring-damper 
systems shown in Fig. 4. Suppose m, c, and k are 
fixed but uncertain, with m = m(l + 
r = c(l4 - ^(1-f-Then defining 
Xi=y and JC 2 = y we can write the differential 



c k F 

y -I- —3/ 4- —y = — 

771 m m 




Fig. 4 Mass-spnng-damper system. 


Fig. 3. General star product. 
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equation in state space form as 


[;] 

= y(M 

.A) ■ 

.i 

;]■ ■ 

A = diag((5„,, ( 



0 

1 

0 

0 

0 

0 


-iC 

-c 

1 


zan 

-w* 



—— 

— 

-Wm 




m 

m 

m 


m 

m 

M = 

1 

0 

0 

0 

0 

0 


-k 

-c 

1 


-Wc 










m 

m 

m 


m 

m 


0 

c 

0 

0 

0 

0 


k 

0 

0 

0 

0 

0 


More generally, the perturbed state space system 

^*+1 + B{6)dii, 

e*=C(6)jc*: + f5(<5)d*, 

where 6 is a vector of parameters that enter 
rationally can be written as an LFT on a 
diagonal matrix A made up of the elements of 6, 
possibly repeated. The form of the LFT is 
Morton and McAfoos (1985) 


Xk-^l 


M\x Af,2 Af ,3 


Xk 

ek 

= 

A /21 ^22 


dk 



_A/3i Myi A/33__^ 




with perturbation w* = Az* yielding 

In general, for problems of this type it is easy to 
obtain realizations, but it is , difficult to insure 
that they are minimal, except in the case where 
the parameters enier linearly. 

A fundamental property of LFTs that 
contributes to their importance in linear systems 
theory is that interconnections of LFTs are again 
LFTs. For example, consider a situation with 
three components, each with a LFT uncertainty 
model. The interconnection is shown in Fig. 5. 
By simply reorganizing the diagram, collecting 
all of the known systems together, and collecting 
all of the perturbations (the A|S) together, we 
end up with the diagram in Fig. 6, where P 



Fio. 5. Example interconnection of LFTs 



depends only on Gi, C 2 , and the diagram 
layout. Note how general uncertainty at the 
component level becomes structured uncertainty 
at the system level. Additional information on 
LFTs and how they arise in engineering 
problems is found in Doyle et al. (1991). 

4.2. The Main Loop Theorem 

For notational ease, let Bi:= (A, e \ \ o(A,) ^ 
1). In this formulation, the matrix Afi, = 
ST{M, 0) may be thought of as the nominal map 
and A 2 6 B 2 viewed as a norm bounded 
perturbation from an allowable perturbation 
class, Aj. The matrices A/, 2 , A/ 2 i, and M 22 and 
the formula y(Af, ■) reflect prior knowledge on 
how the unknown perturbation affects the 
nominal map, A/,,. This type of uncertainty, 
called linear fractional, is natural for many 
control problems, and encompasses many other 
special cases considered by researchers in robust 
control and matrix perturbation theory. 

The constant matrix problem to solve is; 
determine whether the LFT is well posed for all 
A 2 in B 2 and, if so, then determine how “large” 
A 2 ) can get for A 2 e B 2 . 

Define a third structure A as 

( 4 - 5 ) 

Now there are three structures with respect to 
which we may compute p. The notation we use 
to keep track is as follows: p,( ) is with respect 
to Aj, ^ 2(0 is with respect to A 2 , Pa(') is with 
respect to A. In view of this, p,(Af,|), fi 2 {M 22 ) 
and are all defined, though for instance, 

iU|(M) is not defined. The first theorem follows 
immediately from the definition of fi. 

Theorem 4.2. The linear fractional transforma¬ 
tion A 2 ) is well posed for all e B 2 if and 
only if 112 (^ 22 ) < 1- 

As the perturbation A 2 deviates from zero, the 
matrix A 2 ) deviates from Mu. The range 
of values that A 2 )) takes on is 
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Fio. 7. Scaling for main loop theorem 


intimately related to as follows: 

Theorem 4.3. (Main Loop theorem) 
^2(^22) 1 

< 1 O max Aj)) < 1. 


Proof. First note that jUA(Af)<l implies that 
^* 2 (^ 22 )< 1 . so we may assume the latter and 
prove the equivalence of the two remaining 
conditions. Let A, e A| be given, with a(A,)£ 1, 
and define A = diag [A], A 2 ] so that A e A. Now 


det (/— A/A) = det ^ ^ 11^1 

L —A/21A1 


— Mi2^2 

/-A/22A2. 


(4.6) 


By hypothesis I - A/ 22 A 2 is invertible, hence 
det (/ - Af A) = det (/ - Af 22 A 2 ) 

• det (/ - A/„ A, - “ ^ 22 ^ 2 ) ■'A/ 2 , A,). 


Collecting the A, terms leaves 
det (/ - A/A) 

= det (/ - A/ 22 A 2 ) det (/ - ^(A/, A 2 )A,). (4.7) 

By the definition of fi, the left-hand side of (4.7) 
is nonzero for all A e iff ^a(^) 1- 

Similarly, the right hand side is nonzero for all 
A = diag [A,, A 2 ] E Ba iff ^Xi(5^(A/, A 2 )) < 1 for 
all A 2 E B 2 . This completes the proof. 


Remark 4.4. This theorem forms the basis for 
most uses of n in linear system robustness 
analysis, whether from a state space, frequency 
domain, or Lyapunov approach. 


Remark 4.5. This theorem is stated in terms of 
feasibility conditions, testing whether some 
quantity is less than one. This allows for an 
elegant statement and proof, but other versions 
are possible, with some complication in notation. 
Scaled versions of the Main Loop Theorem 
appear later in this section. 


Remark 4.6. The importance of the theorem can 
be highlighted by a slight restatement. Suppose a 
property of a matrix W can be related to a ^ 


test on the matrix. That is, there exists some 
block structure Ai^ such that 


matrix W satisfies property ^O^a»(W')< 1 - 

Then the perturbed matrix y(M, A) is well 
defined, and has the property ^ for every 
A E Ba if and only if Ma(A/) < 1, where 
A:= (diagfA^, A]: Ai^e A^, Ae A}. In other 
words, whenever a property of a matrix can be 
related to a ^ test, then there will be a ^ test of 
greater complexity to determine if the property 
is robust to structured perturbations in the form 
of LFTs. 

The role of the block structure A 2 is clear in 
this theorem—it is the structure for the original 
perturbation. However the role of the perturba¬ 
tion structure A, is often misunderstood. Note 
that ^|(■) appears on the right hand side of the 
theorem, so that the set A, defines what function 
of the matrix ^(A/, A 2 ) is to be computed. 

This theorem can be illustrated by a 
system-theoretic example with a transfer func¬ 
tion and its state space realization. This example 
involves two of the simplest cases of LFTs. 
Suppose that A, ;= {6,/„: A, e C) and A 2 — 
which are the special cases considered in 
Section 3. Recall that for AeC"*", ^,(j4) = 
p{A), and for D e fi 2 (^) = 6(D). Now, 

let A be the diagonal augmentation A, and A 2 , 
namely namely 


A 2 J J 


:C<" 


Let AeC'”'", BeC"’"", CeC"”*", and De 
Cmxm^ be given, and interpret them as the state 
space model of a discrete time system 

x»+,=Ax*-t-Bu*, 
yk = Cx* + Du^, 

and let a/e be block state state 

matrix of the system, 



Applying the theorem with these data implies 
that the following are equivalent. 
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• The spectral radius of A satisfies p(A) < 1, 
and 

max (D-hCS,(I-Ad,)-‘B)<l. 

0,eC 

• The maximum singular value of D satisfies 
a(D) < 1, and 

max p(y4 + flA 2 (/- DA 2 )"‘C) < 1. 

d(A2)al 

• The structures singular value of A/ satisfies 
|Ua(A0<1. 

The first condition implies two things; the system 
is stable, and the |HU norm on the transfer 
function from u to y (obtained by setting 
A, = 1/z) is less than one. That is 

||C|U := max d{D + C(zl - Ay'B) 

z eC 
|r|al 

= max &(D + C(5|(/ - A(5i)“'fl) < 1. 

d|cC 

l^lsl 

The second condition implies that (/ - DA 2 )~‘ is 
well defined for all a(A 2 ) ^ 1, and that the 
uncertain difference equation 

= (A + BAjil - DA2)"‘C)j:*, 

is stable for all such A 2 . 

The equivalence between the small gain 
condition, ||G||.< 1, and the stability robustness 
of the uncertain difference equation is well 
known as the small gain theorem, in its 
necessary and sufficient form for linear time 
invariant systems. What is important to see is 
that both of these conditions are in fact 
equivalent to one condition on the structured 
singular value. Already we have seen that the 
spectral radius and maximum singular value are 
special cases of p. Here we see that additional 
important linear system properties, namely 
robust stability and input-output gain are also 
related to a particular case of the structured 
singular value. 

Returning to the main loop theorem, note that 
the bound on the performance is the same as the 
bound on the perturbation, namely one. Scaling 
the matrix M by 1 /p, for some positive scalar p, 
and then applying the theorem gives the 
following. 


Corollary 4.7. Let ^>0 be given. Then 




^ 2 (^ 22 ) < P 

max p,(y(Af, A2))</8. 


The bound on performance and the bound on 
the perturbation are related, they are recipro¬ 


cals. For nonreciprocal values, certain blocks of 
M must be scaled and p recomputed. Specifi¬ 
cally, for or ^ 0, define M,, as 


M„ 


aMu 

M 2 [ 


VaMa' 

M 22 - 


(4.8) 


Some simple facts about M„; 

(1) if or = 0 then Pa(M„) = ^ 2 (^ 22 )', 

(2) for any A 2 e A 2 , with I - M 22^2 nonsingular, 

y(M„ A2) = ay(Af, A 2 ); 

(3) max {arp,(Afn), ^ 2 (^ 22 )} ^ Pa(M„) s 
max{l, ar}p 4 (Af); 

(4) Pa(M„) is a continuous, nondecreasing 
function of or. 


Theorem 4.8. Let /3>f42(W22) be given, and 
:= max = P)- Then 

max |Ui(y(Af, A2)) = — - (4.9) 

A2e(l/^)B2 0^0 


4,3. Upper bound LFT results 

Theorem 4.3 gives necessary and sufficient 
conditions for performance/robustness charac¬ 
teristics in terms of a /i evaluation. The p test 
always takes on the form '‘is p{M) < 1?”. Hence, 
upper and lower bounds on p can be used in the 
following manner; an upper bound gives a 
sufficient condition for the robustness/ 
performance characteristic of the theorem; a 
lower bound gives a sufficient condition for when 
the robustness/performance will not be met. 
Clearly, both are important. The upper bound 
guarantees robustness of a property of a linear 
fractional transformation for perturbations up to 
a certain size, and a lower bound exhibits 
p)erturbations which cause a degree of degrada¬ 
tion in the LFT’s properties. 

The above comments apply for any upper and 
lower bound. Of specific interest is the 
additional information that is obtained in using 
the upper bound. Generally 

implies a great deal more 
than p^(M) < 1. As usual, let A, and A 2 be two 
given structures, and let A = {diag [A^, A 2 ]: A, e 
A|}. Similarly, let D* be the appropriate D 
scaling sets for the two structures, (equation 
(3.6)) and denote D as the diagonal augmenta¬ 
tion of these two sets, D;={diag(Di, D 2 ]:D^e 
D.) 


Lemma 4.9 (Redheffer, 1959, 1960). Let M be 
given as in equation (4.1). Suppose there is a 
£) € D such that < 1. Then there 

exists a Di e Di such that 

max a{D\^y^{M, A 2 )Df^'^)< 1. 

A2eB2 
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Fig. 8 . Equivalent LFTs. 


Proof. The easiest method of proof is just to 
track the norms of the various vectors in the 
loop equations for the linear fractional transfor¬ 
mations shown in Fig. 8. Let D, and Dj be the 
separate parts of the DeD which achieves 
Obviously, fi 2 (Af 22 )<l, so 
for any AjeBz the two LFTs are well posed, 
and from d to e are the same. Let d#0 be any 
given complex vector of appropriate dimension, 
and let e, w, and z be the unique solutions to the 
loop equations for the linear fractional transfor¬ 
mation on the right in Fig. 8. By hypothesis, we 
have 

Pf+lk||"<||H'l|H||d||^ (4.10) 

and since a(A 2 ) ^ 1 

||w||^-||z||^ (4.11) 

Combining these gives 

\\e\\^<\\d\\\ (4.12) 

Equation (4.12) also holds for the linear 
fractional transformation on the left, since the 
matrix relating d to e is the same for both linear 
fractional transformations. This implies that 

as desired. 

Consider the problem determining the value 
of 


inf max ^ 2)0 ‘^^), (4.13) 

D|€D| A2EB2 

and also finding a Di e Di that achieves a cost 
arbitrarily close to the infimum. Suppose the 
dimension of Afn is /i x n. Define an additional 
structure 

:= {diag [A, A 2 ]: A e C"*", A 2 e A 2 }. 

(4.14) 


Then 

inf max d(D\'^y{M, A 2 )T>r‘^) = 4 (4.16) 

D|eD] AjeB^ (X 

In this section, all of the results were stated for 
y(Af, A 2 ). Analogous results hold for Af). 


5 ROBUSTNESS TESTS WITH fi 
The structured singular value can be used to 
quantify robustness margins for a linear system 
with linear fractional uncertainty. Specifically, 
suppose that P{s) is a rational, proper matrix of 
size (nj -H n 2 ) X (fii + ^ 2 ) and block structures 
AicC”’*”‘ and A 2 c:C"^^”^ are given. Partition 
P(s) as 



/’u(*)' 

P22{s)y 


For Ai 6 Aj, consider the interconnection shown 
in Fig. 9. For any AieBi, i5^(Ai,P(j)) is the 
transfer function from d,—The closed-loop 
system is said to be; 

• well-posed if det (/ - Pn(oo)A,) ^ 0. This is the 
necessary and sufficient condition that all 
closed-loop transfer in Fig. 5.1 be proper; 

• stable if all closed-loop transfer functions in 
Fig. 5.1 are analytic in the closed right-half¬ 
plane. 


Theorem 5.1. Suppose that P{s) has all of its 
poles in the open left-half plane. Let ^ > 0. Then: 

(1) For all A, gA, with a(Ai)<;/5, the per¬ 
turbed closed-loop system is well-posed and 
stable if and only if 

sup /Ui(/',,(j))< 4 . 

Rc(.i)^0 P 


Theorem 4.10. Let M, Aj. D,, and A^v be given 
as above. Suppose that ^< 22 (^ 22 ) Define a 
by 


f / 

\Df 01 

jar: inf 

. 0 l\ 

1 D,6D, \ 


r crMi, 


*1) <l| 


Afzi -II 0 ^ 

J/ i 


(4.15) 


(2) For all A, e A, with a(Ai) ^ P, the per- 
turbed closed-loop system is well-posed, 



Ffo. 9. Uncertain system for robustness tests. 
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stable and 

sup P(s))]<^, 

Re (J)3e0 P 


and stable if and only if 


sup 

Re (j)2f0 




if and only if 

sup iua(P(s))<^. 

Re (f)eO P 

Proof. The proof follows from the definitions of 
p, well-posedness, stability, and invertibility of 
matrices with elements in a commutative ring. 

Remark 5.2. Although the structured singular 
value is not necessarily a norm, we introduce the 
following notation: for a proper, rational matrix 
P, analytic in the closed-right-half-plane, and 
block structure A of appropriate dimension, 
define 

\\P\U-= sup fi^iPis)). 

Re (.r) 2 n 

Remark 5.3. In Section 6, techniques that allow 
the right-half plane supremums to be replaced 
(equivalently) by imaginary-axis supremums will 
be developed. 


(2) For all A,€.if(A,) with ||A,|U£P, the 
perturbed closed-loop system is well-posed, 
stable and 

sup p2[5^(A,(i), P(s))]<^, 

Re(j)arO P 

if and only if 

sup fl^iP{s))<^. 

Re(j)20 P 

In summary, the peak value on the p plot of the 
frequency response, that the perturbation 
“sees”, determines the size of perturbations that 
the loop is robustly stable (and/or performing) 
against. 

Other, more sophisticated assumptions about 
the perturbations may be formulated, and solved 
with p. These include gap/graph topology 
uncertainty (Foo and Postlethwaite, 1988; 
Khargonekar and Kaminer, 1991), and different 
induced norms to measure the size of the 
uncertainty, (Bamieh and Dahleh, 1992). 


It is possible to easily generalize these 
robustness theorems to the case where A is a 
block-diagonal, finite dimensional, stable, linear 
time-invariant system (as opposed to a constant, 
complex matrix). Let A be a block structure, as 
in equation (3.1). We want to consider feedback 
perturbations to P which are themselves 
dynamical systems, with the block-diagonal 
structure of the set A. To do so, first let .>M(S) 
denote the set of rational, proper, stable, 
transfer matrices. Associated with any block 
structure A, let M{A) denote the set of all block 
diagonal, stable rational transfer functions, with 
block structure like A. 

MiA) := {A(-) 6 ^(S); A(jo) e A for all j„ e C+}. 

For any Ai6./M(Ai), the closed-loop system is 
said to be: 

• well-posed if det(/-P,i(<»)Ai(«))=^0. This is 
the necessary and sufficient condition that all 
closed-loop transfer functions in Fig. 5.1 be 
proper; 

• stable if all closed-loop transfer functions in 
Fig. 5.1 are analytic in the closed right-half¬ 
plane. 

Theorem 5.4. Suppose that P(s) has all of its 
poles in the open left-half plane. Let /3 > 0. Then: 

(1) For all Ai€,4<(Ai) with ||Ai||,£/3, the 
perturbed closed-loop system is well-posed 


6 MAXIMUM-MODULUS THEOREM FOR LFTS 
WITH fi 

This section describes a maximum-modulus 
theorem that satisfies: (i of an LFT on a 
norm-bounded structured set achieves its maxi¬ 
mum on the unitary elements of this set. This is 
a generalization of the ordinary maximum- 
modulus theorem for rational functions of a 
complex variable. We begin by stating a 
well-known result from complex analysis namely 
that the roots of a polynomial are continuous 
functions of the coefficients of the polynomial. 

Lemma 6.1. Let /(z)= E a,z' be an nth order 

1-0 

polynomial, a„ #0. Let zi, Z 2 ,... , be the 
roots of /. For any e > 0 and any integer m > 0, 
there exists a <5^,^ > 0 such that if g(z), defined 
by 

g(z) = X biZ', 

i-O 

has coefficients b,eC which satisfy 1^,1 
then there are n roots of f + g, labeled 
Zi, Z 2 ,..., z„ that satisfy [z, - z<| < e. 

Next, we shift our attention to polymomials in 
several dimensions, that is, polynomials taking 
C*-»C. If zeC*, we let ||z||. := max |z,|. For 
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p:C*-»C, a polynomial, define as 

^p = min{l|z|U:p(z) = 0}. (5.1) 

In other words, Pp is the norm of the minimum 
norm roots of the polynomial. The next two 
lemmas are from Doyle (1982). The first 
concerns minimum norm roots and is a direct 
consequence of Lemma 6.1. The second 
provides the essential argunient of the 
maximum-modulus theorem for |u, 

Lenuna 6.2. Let p be a polynomial from 
C*—»C. Define fip via (6.1). Then there exists a 
2 € C* such that |z,| = for each i, and p( 2 ) = 0. 


Z|,... , Define a variable £ by 

2 = diag[zi/r,,. • • I > Xj+wl’ 

Then det (/ + MUIV*) is a polynomial on 
since the determinant involves only multiplica¬ 
tions and additions of its argument. Since 
P&(A/) = 1, a minimum norm (using IHU on 
as above) root cf this polynomial has 
norm equal to 1. Let 2 be the particular 
minimizing root with all components of equal 
magnitude, namely one. Then we can write 
2 = 4) for some 4> e Q. This gives 

det (/+ Af i;4iV*) = 0. 

Defining Q := U^V* completes the proof. 


Sketch of proof. A proof is given in Doyle 
(1982). The main idea is as follows: let z e C* be 
a root of p with = ||z|U. If all of the 
coordinates of z satisfy |z,| =/3p, then stop. 
Otherwise, take one of the coordinates of z, say 
Z] whose magnitude is less than /3,,. Consider the 
polynomial q(z,) :=p(z,, Z 2 , . .. , 2 *). This has a 
root at z,, and |z,|</3p. If this is a nontrivial 
polynomial, then, by slightly reducing (in 
magnitude) all of the z,, i ^2, the coefficients of 
the polynomial change slightly, and it has a root 
very close to Z\. This implies that p has a root 
z 6 C* such that ||z||„ < /3p, which contradicts the 
definition. On the other hand, if the polynomial 
q =0, then the variable z, does not matter, and 
we can repeat the argument on a different 
coordinate, say Zj, that satisfies IZ 2 I < /3,,. 

Lemma 6.3. Let AcC"’"" be a given block 
structure, and let Q be defined as in Section 3. If 
M 6 C"'*" has - 1, then there is a 0 e Q 

such that det (/ - MQ) = 0. 

Proof Since //a(^)= 1> there is a A e A with 
d(A)=l and det(/-MA) = 0. Also, for any 
AeA with a(A)<l, the matrix l-Mh is 
nonsingular. Do a singular value decomposition 
on each block that makes up A. This gives 
(/, V e Q, and a diagonal 2 e A, such that 

det iI + MUtV*) = 0. 

Since 2 e A is diagonal, it appears as 

= diag [hjr . a^,, a^], 

for some nonnegative real numbers h, and dcj, 
and w = E m, (recall that the /th full block is 

mj 'X mj, hence each full block contributes rrL of 
the or’s). With a(A) = 1, at least one of the 6, or 
fiy is 1. 

Consider 5 -l- w complex variables. 


The next theorem from Doyle (1982) follows 
immediately from Lemma 6.3 and the facts that 
Q 6 Q implies p(QM) ^ Pa(A/) and det (/ — 
MQ) = 0 implies that p{QM) s 1. 

Theorem 6.4. 

max p(QM) = max p(AAf) = p^(M). 

(?eO AeBa 


Hence the lower bound given for p in equation 
(3.10) is actually not just a bound, but an 
equality. 

To motivate the main result of the section, 
recall the general setup for the linear fractional 
transformation ^{M, A). Suppose M e 
f<(n,+n 2 )x(n,+nj) jj, gjygn partition it in the 
obvious way 


M 


Mu 

M 2 , 


Mn 

A/zzJ' 


( 6 . 2 ) 


with M„ e C'’ ""'. Let A, c C"”"" and Az c 
be two block structures and define Bj, Bj, Qi, 
and Qz correspondingly. 

The maximum-modulus theorem from Packard 
and Balsamo (1988) is as follows. 


Theorem 6.5. Let M be given as in (6.2), along 
with two block structures Aj and Az. Suppose 
that P 2 ^M 22 ) 1- Then 

max Q 2 )) = max /i,(y(Af, Az)). (6.3) 

C?2f02 A26B2 


Proof. A detailed proof can be found in the 
reference. The main idea is as follows; suppose 
(by an appropriate scaling) that the maximum on 
the right hand side of equation (6.3) is one. 
Then, since fiz(Mzz)<l, it is possible to show 
that p^{M) = 1. Using Lemma 6.3, construct 
matrices Qi and Qz such that /- 
M diag[C,, Qz] is singular. Again, use the fact 
that |U 2 (Mz 2 ) <1 to conclude that / - 
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Qz)Qi is singular. This shows that 
Qz))^ completing the argument. 

Remark 6.6. This is similar to a result in Boyd 
and Desoer (1985): that for functions W(z) 
analytic on the disk, the function n(H(z)) 
achieves its maximum on the boundary: 
maxi«(^f(z))-max;i(//( 2 )). It is possible to 

iilatl |z|-l 

use their result to derive Theorem 6.5 and vice 
versa. 

It is instructive to see how Theorem 6.4 can be 
obtained as a special case of Theorem 6.5. Let 
AcC"**" be a given block structure, with 
associated sets and Q. Define A| := 
{d/„: d e C), and for each M e C"**", dehne M as 



By Theorem 6.5, and noting that = p( ), 
we have 

max p{MQ) = max Q)) 

QeQ GeQ 

= max A)) 

AcBa 

= max p(MA) 

AcBa 

= /i^(Af). (6.4) 

This is exactly Theorem 6.4. 

7. LOWER BOUND POWER ALGORITHM 
This section presents an iterative algorithm 
to compute lower bounds for the structured 
singular value. The algorithm resembles a 
mixture of power methods for eigenvalues and 
singular values, which is not surprising, since the 
structured singular value can be viewed as a 
generalization of both. If the algorithm con¬ 
verges, a lower bound for p results. We prove 
that p is always an equilibrium point of the 
algorithm. 

In Fan and Tits (1986) the calculation of p is 
reformulated as a smooth optimization problem. 
As with all of the known exact expressions for p, 
the function to be maximized has local 
maximums which are not global, so in general 
the method yields only lower bounds for p. 
Similar comments can be made for the ideas in 
Doyle (1982) and Helton (1988), as well as the 
algorithm in this section. The contribution here 
is yet another lower bound algorithm to aid in 
the analysis of robustness of systems with 
structured uncertainty, along with a deeper 
conceptual understanding of the structured 
singular value. 


We begin by noting that both the functions 
defined by r(A):=p(AM) and 
f:Q-»R, defined by r(Q) :=p(QM) have local 
maximums which are not global. Note, though, 
that the function f is a restriction of r, and it is 
possible to construct examples where a point 
^ e Q is a local maximum of r, but not a local 
maximum of r. Such a point definitely does not 
correspond to the maximizer that gives p^iM), 
and so we will not consider the corresponding 
lower bound from such points as acceptable. 
Rather, acceptable lower bounds will correspond 
to points Q E Q which are local maximums of the 
function r. 

Roughly speaking, this section develops an 
iterative algorithm which ultimately generates a 
point Q E Q that is a local maximum of the 
function r. In general, these are a proper subset 
of the local maximums of the function f, though 
the global maximums over the two sets are the 
same. Some of the preliminary results are 
generalizations of those found in Fan and Tits 
(1986) and Daniel et al. (1986). 

We will be interested in local maximums of 
the function r(A) = p(AA/), therefore we begin 
with some facts from perturbation theory, which 
assist in characterizing local maximums. 

7.1. Matrix facts 

In this section, we collect a few useful facts. 

Suppose W;R->C”’‘" is an analytic function 
of the real parameter t. If is an eigenvalue of 
Wp := 1V(0) of multiplicity one, then for some 
open interval containing 0, this eigenvalue is an 
analytic function of t, as are the eigenvectors 
associated with it. That is, suppose there are 
x„, y„£C", satisfying y^Xo = 1, = and 

y^. Then there is an e >0 and analytic 
functions jr:(-c. e)— »C”, y :(—e, e)— >C", and 
A:(-6 , e)^C, such that x(0) = x„.l'(0) = >'n. 
A(0) = A|, and for all f e (-e, e) 

y*x = l, 

Wx = ^, (7.1) 

y*W = ky*. 

This follows from Kato (1982). We can then 
differentiate and obtain A(0) = >'5IV(0)X(). 

The next two lemmas follow from elementary 
linear algebra. They will be used in the main 
theorem of the next section. 

Lemma 1. 1. Let y, x e C" with y # 0 and x¥^0. 
There exists d e R, d>0, such that (l/Vd)y = 
y/dx if and only if Re(y*Gx)^0 for every 
G e C'""' satisfying G + G* s 0. 

Lemma 7.2. Let x and y be two nonzero vectors 
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in C". Then there exists a Hermitian, positive 
definite DeC"’‘", such that = if 

and only if Re(gy*x)sO for every geC with 

g + i^o. 

The condition in Lemma 7.2 on y*x involving 
geC is equivalent to y*x being real and 
positive. We have chosen to write it in the form 
above so that it is a natural analog to Lemma 7.1 
and is stated exactly as it will be applied in 
Theorem 7.3 of the next section. 


have been foregone in favor of a simpler 
development. 

Theorem 7.3. Let MeC"'*" be given, and 
suppose A„>0 is a distinct eigenvalue of M, with 
nondegenerate right and left eigenvectors x and 
y. Suppose that p{M) = Ao. If the function 
R defined by r(A) = p(AM) has a Iwal 
maximum (with respect to the set B^) at A — I, 
then there exists a DeD such that D’^'^y = 
D'^x. 


7.2. Eigenvector characterization of local 
maximums 

Consider the function r:B^—»R, defined by 
r(A) = p{^M). Recall that = max r(A), 

and that the global maximum occurs on the 
subset Q c= Ba- In this section, we characterize 
the occurrence of a local maximum of r at 
A = / € Q c Ba, in terms of the eigenvectors of 
M. We begin with some notation. 

Let X and y be nonzero right and left 
eigenvectors of Af, associated with an eigenvalue 
A: Mx = kx and y*M-ky*. Partition x and y 
compatibly with the block structure A, 

Xr^ yr\ 

Xr, yr, 


X “ 



ynt2 


(7.2) 






mi 


where x,, and x„,, y„,eC'”' for each i 

and j. We call these the “block components of 
X and y, and for technical reasons, we define a 
nondegeneracy condition: x and y are nonde¬ 
generate if for every i, y*Xr,*0, and for each 
I I :jtO V ¥=0. We will also assume that 
p(M) = Ao is a distinct eigenvalue ot M. 

The condition that p(Af) = Ao>0 is without 
loss of generality, because A can always be used 
to enforce this (for any <t>, c’ A — A). ^ 

conditions of nondegeneracy and A<, distinct are 
not so easily dispensed with and there ^e 
basically two approaches to deal with them. The 
first would be to argue that there are generic 
conditions and thus unlikely to cause problems 
in practice. A far more satisfactory solution is to 
generalize the theorems and proofs in this 
section to remove them. In fact, this can e 
done, but not without substantial additional 
technical complication. Since the results in t is 
subsection are presented primarily to 
into the power algorithms to be presented in the 
next subsection, these additional technicalities 


Proof. Let G e A with G + G'* ^ 0 so that G has 
the form 

diag[g|/r.- gsffs- G,, . . . , Gf], (/.3) 

where Re (g,) s 0, and G, + Gf^0 for all i and j 
and e"'eBA for all taO with e‘"' = 7 for f = 0. 
Define a matrix function W;R-»C"’‘" by 
W(r) := Note that at l = 0, A„ is a simple 

eigenvalue of W(0), with x and y the right and 
left eigenvectors. For some nonempty interval 
containing 0, this eigenvalue is always simple, 
and hence there is an analytic function of the 
real variable t, A(0. defined on that interval, 
such that A(l) is an eigenvalue of VV(f) for all t 
and A(0) = i,. It is easy to calculate MO), namely 

A(0) = y*lV(0)x = A.oy*Gx. (7.4) 

By hypothesis, Ao>0, p(M) = An and the 
function p(AM) has a local maximum (with 
respect to B^) at A = /. Therefore 



(7.5) 


which says that the magnitude of A must be 
nonincreasing at f = 0. Using the block 
notation” of (7.2) and substituting (7.3) and 
(7.4) into (7.5) yields 


Re 


\,^i 


E y* 

1=1 


G,x, 

I f ‘ 




(7.6) 


This must hold for arbitrary G e A satisfying 
Q + G* ^0. Applying Lemmas 7.1 and 7.2 we 
conclude that for each i, there is a A = 
D,>0 such that D~'^yr-D, Jtr,, 
and for each L there is a d, 6 R, dy > 0 such that 
\l\fd,y„ =yld,x„^. Arranging all of these D,s 
and dyS into one block diagonal D completes the 
proof. 


Remark 7.4. Note that assuming Ao is distinct 
assures differentiability (Kato, 1982). Since Ao is 
a solution of max max |A,(AAf)|, it is likely that 

at the maximum it will be distinct. In any case, if 
Ao is not distinct, it can still be shown to be 
differentiable at a local maximum, and the rest 


MITO Z9:I-6 
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of the proof remains. Unfortunately, this proof 
of differentiability is tedious and technical and 
for this reason has been omitted. 

Theorem 7.5. Let 6 »eQ achieve the global 

optimum for the problem max p{QM). Suppose 

OeO 

that the eigenvalue associated with p(QoAf) is 
distinct, real and positive, and hence equal to 
H = If X and y are nondegenerate right 

and left eigenvectors of the eigenvalue p, then 
there exists a D e D, and | e C", ||^|| = 1 such 
that 


These two constraint equations can be rewritten 
as 

For a given D, Q, and define vectors a, b, z, 
and w by 

b:=D-^% a-D-^^Q*l 

w:=D'% ^ 

With this definition, we have Mb = fia and 
M*z = Pw. We can eliminate ^ from (7.9) to get 




(7.7) 


b = Qa = D V, 
z = Da = Q*n>. 


(7.10) 


Froof. By Theorem 6.4, any global maximizer 
of max p{QM), is also a maximizer of 

QeQ 

max p{AM). Define M := QnM, then A = / is a 
local (in fact global) maximizer for max p{AM). 

A 

Apply Theorem 7.3 to the matrix M and define 
^ = D^'^x = D~^'^y to prove the theorem. 

Remark 7.6. This result was first shown in Fan 
and Tits (1986) for the case of S = 0. It is also 
similar to the “principal direction alignment” 
ideas in Daniel et al. (1986). Theorem 7.5 is 
more general, though, since it handles repeated 
scalar blocks as well as full blocks. 


Remark 7.7. This theorem is not true if we 
consider local maximums that are not global of 
the function f;Q-+R defined as r(e);= 
p{QM). 


Remark 1.8 . Any real number /3 > 0 satisfying 


QD''^MD-'^ = PI 


(7.8) 


for some Q 6 Q, D e D and nonzero | e C” is a 
lower bound for p^{M). This follows because 
I — (IIP)QM is a singular matrix. 


7.3. Lower bound power algorithm 
In this section, we propose an iterative 
algorithm (reminiscent of the power algorithm 
for spectral radius) to find solutions to equations 
(7.7) and therefore get lower bounds for p. 

Rewriting (7.7), and changing notation a bit, 
we want to find Q e Q, D e D, ^ > 0, and ^ e C" 
with II ^11 = 1 such that 


Since the unknowns Q and D generally may 
have high dimension, we would like to write the 
four relationships from equation (7.10) in a 
manner that does not involve the matrices Q and 
D. With a few technical conditions, this can be 
done. In order to simplify the upcoming 
formulas, we will consider a block structure with 
S = 1 , f = 1 (by duplicating the appropriate 
formulas for additional blocks, whether they are 
repeated scalar blocks or full blocks, it is 
straightforward to extend the algorithm to more 
general .structures). Hence the sets D and Q are 

D = {diag[D„d,/„,]:D,6C'"^ 

D, = Dr>0,d2>0}, (7.11) 

Q = {diag [ 9 , 4 ,, QzY-qxq^ = 1, 

02 = /„,}. (7.12) 

With respect to this, we will partition the vectors 
accordingly, so ^ where r,eC'^' and 

Z 2 6 C"', and likewise for the other vectors. 


Lemma 7.9. Let r, and be positive integers. 
Let z,, w,, 6 ,, o,eC''' and Zj, W 2 , bz, ajeC'”' 
be nonzero vectors with a^Wj^O. Then, there 
exists a D G D and 0 g Q such that 


b = Qa, z = Q*w, 
z = Da, b = D~^w, 


if and only if 


w*,a, 

= I « I Wl, 

kfa,| 

fri=r%—Tfli. 

I n ^ «Ai I 


.Ikzii 
^2 “ ,, ,, 
llflzil 

. Ilflzil 


Proof. 

The relations for Z 2 and bz follow by direct 
substitution. For Z] and bj, it is easiest to 
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define an auxiliary variable and 

then verify via substitutions. 

♦-Let q\ = aXwJ\a*WT\, since this is well 
defined. Likewise, choose d^=\\wMa 2 \\. 
By assumption, is well defined and 
nonzero. Since Ww^W = ||z,||, let be any 
unitary matnx that takes into z-^. The 
matrix 02 also rotates 62 into 02 , 

‘ 1 1 

02^2 — ~T‘ 02*^2 = — 2j = fl, 

“2 dz 

Next, we calculate o;z, = |a>,|, which is 
nonzero by assumption: hence Lemma 7.2 yields 
a Hermitian, positive definite D, such that 
Difl] = Z]. As we hope, Dj takes 6 , into w, too, 

Djbi = = q^Zi = w,. 

Defining D and Q in the obvious manner 
completes the proof. 

We are now prepared for the main theorem. 

Theorem 7.10. Let M e C"*'" be given, and let A 
be the two block (5 = 1, F = 1) structure defined 
above, with block sizes r, and m,, where 
r, + m, = n. Suppose > 0 is given. Then there 

exists 0eQ, DeD, ^ = [^']eC", ||||| = 1, 

^ 2 with 

(1 H) 

if and only if there exists nonzero vectors a^, W], 
bi, fl| e C''i and Z 2 , W 2 , ^> 2 , flz ^ C""' with a*w, ^0 
and 


= Mb, 


Zi = ■ 




Wi 


krflii 

Pw = M*z, 

. OlWi 

bi = -T—,Ou 

|fli w,| 




. Ikzll 

= M II '^2- 

Ikzll 


(7.14) 


Remark. In order to find decompositions using 
the representation that this theorem allows 
(equation (7.14)—free of Qs and Ds), we can 
restrict ourselves to unit vectors a, h, z, w. 
Why? Suppose there are nonzero vectors 
satisfying (7.14). Examining the equations, it is 
clear that scaling z and w by some a^O and 
scaling b and a by some y 0 does not affect any 
of the equalities in (7.14). Moreover, the 
equalities in (7.14) always imply that ||z|| = ||»v||, 
and ||fl|| = ||fc||, so by proper scaling, all the 
vectors would be unit norm. 


In the above theorem, we have purposefully 


written the conditions (7.14) in a manner that 
suggests attempting to find a solution in an 
iterative fashion. In particular, for 2, let 

vectors and and positive scalars 

$k evolve as 


\KauJ 






“ 2 *..- 


(7.15) 


11^2.., II , 

IIW2.JI 

where the values of and are chosen 
>0, so that = ||w*^.i|| = 1. 

Note also that if the initial b and w vectors 
that start the iteration are unit vectors, then at 
every step, all vectors, a, ft, z, and w will be unit 
length. 


Remarks. 

(a) Potential problems within the iteration are: 

• Afft^ = 0 or Af*z^ = 0, then a,^^^ or is 
not well defined. 

• = then the vectors and/or 
fti^^, are not well defined. 

• Either \\w 2 j\ =0 or HuzJI =0, making bz, 
and/or Zz^ not well defined. 

If any of these conditions occur, then one 
possibility is to restart the algorithm at a 
different initial condition (i.e. a new fti„, b^^^, 
H'l,, and W 2 ^). A more sophisticated approach 
is to examine the above conditions and 
recognize that a sensible iteration can still be 
defined even if these conditions occur. 
Algorithms have been developed along these 
lines and will be discussed elsewhere. 

(b) If the iteration does converge to an 
equilibrium point, then the p values must 
be equal, that is /5 = ^. This is easy to 
see: suppose the equations in (7.14) are 
satisfied (convergence of the algorithm in 
(7.15)) but the p associated with ft and a is 
^ and the P associated with z and tv is $. 
The converged equations imply that 
there exists a Q eQ and D eD such 
that QD^^MD-^^ {D^%) ^ $(D^^b) and 
{QD^'^MD-^^y(D^^b) = ${D^'^b). Since 
the ps are real, they must be equal. Hence, 
when verifying convergence of the algo¬ 
rithm, it is necessary to begin checking the 
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convergence of the vectors only after the 
and values are nearly equal. This saves 
some computations early in the iteration. 

(c) • If there were only the first block, which is 
a repeated scalar block, the iteration 
would be a power iteration for the largest 
(in magnitude) eigenvalue of the matrix 
M. Since n for one repeated scalar block is 
the spectral radius, the algorithm we have 
proposed reduces to a valid algorithm in 
the special case of one repeated scalar 
block. 

• If there were only the second block, which 
is a full block, the iteration becomes a 
eigenvalue power algorithm for M*M, 
hence it will give the largest singular value 
of M. Again, with respect to this specific 
block structure, this is what we want. 
Hence, the iteration we have proposed is a 
mix of two separate, well understood 
iterations, both of which converge to the 
largest eigenvalue/singular value. We might 
hope that this algorithm will converge to the 
largest p for which the equations in (7.8) are 
solved, which by Theorem 7.5 is equal to 
Unfortunately, this is not always the 

case. 

Extensive computational experience (Balas et 
al, 1991; Packard et al., 1988) has led to the 
following conclusions. 

(1) The algorithm works well in practice, and 
versions of it have been used very 
extensively in universities and industry. It 
appears to have roughly order growth 
rates for computation as a function of 
problem size. The main difiiculty is that it 
occasionally does not converge or converges 
to a value of P which is not (i. 

(2) The difficulties described above do not seem 
to occur in practice, however there are 
matrices whose optimally scaled eigenvector 
block components (equation (7.2)) do not 
satisfy the “nonzero” block conditions 
described at the beginning of Section 7.2. 
This typ)e of situation will lead to the 
difficulties mentioned. In any event, while it 
is easy to construct matrices where these 
problems happen, running the algorithm on 
frequency responses of actual closed loop 
systems has not been a problem. 

(3) Limit cycles can occur, and seem to occur 
more often when there are large repeated 
scalar blocks. Unlike an equilibrium point, 
the presence of a stable limit cycle does not 
immediately give rise to a lower bound 
for fi. 

(4) In general, there are several stable equi¬ 


librium points, with different values of p. 
This is in contrast with the conventional 
power algorithms for p and a, where only 
the largest ones are stable. It is even possible 
that the algorithm converges to a value of p 
which is .smaller than p(M). 

(5) It is possible to refine the power algorithm to 
guarantee convergence to some local maxi¬ 
mum, but at the exponse of greater 
computation time. We are currently re¬ 
searching algorithms that give favorable 
tradeoffs between convergence properties 
and running times. 


8. RELATING ft AND inf 

DeD 

The purpose of this section is to study the 
relationship between and the upper 

bound. The two-step strategy we take first 
involves characterizing the optimality conditions 
for the upper bound, and then determining 
under what situations these optimality conditions 
imply anything about the existence of a block 
structured perturbation matrix A satisfying 
det (/ - Af A) = 0. 


8.1. Optimality conditions for inf d{D''^MD '^) 

OeD 

We want to characterize when a(Af) = 
inf o(D'^MD“'^) that is, when D := I is 

OeD 

Optimal. Begin with M e C""", and let its 
singular value decomposition be 

M = o^UV* + U 2 Z 2 VI (8.1) 

where O] > 0 is the maximum singular value of 
M and has multiplicity r; U, V U 2 , 14 e 

U*U 2 = 0-, V*14 = 0; and Sz 6 is 

nonnegative and diagonal with a^l„^r - 22 > 0 . 

We need some additional notation, in 
particular 

Z:={D-D;D, DeD). (8.2) 

Note that the elements of Z are not invertible, 
and in general are of the form (since ds+F= 1) 

diag [Zi,.... Zs, Zs+il„,, .... Zs+F 

where for each is5, Z, = Z* e and for 
/■ s F - 1, Zs+, e R. Later, we will use the fact 
that Z is a real inner product space, with inner 
product defined by P, T eZ 

s F-t 

(P, T) ;= 2 trace (PJ.) + ^ Ps+fs+r 

i-l y-1 

For Rotational purposes, partition U and V 
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compatibly with A as 




"fl. ■ 

El 

, v = 

Bs 

Hi 



.Hr_ 


where With this 

notation, and a little manipulation, for any 
ZeZ, we can write X„,„(U*ZU - V*ZV) in 
terms of inner products in Z, 

-V*ZV)= min (Z,/*’’), (8.4) 

rieC'^ 

iinii-1 

where for each tj e C", e Z is defined by its 
block components 

FP := A,riri*A; - 


matrix ZeZ such that 

A„.„(t/*Zl/-VZV)>0. 

Equivalently, 

Am«(V'*ZK-f/*Z(/)<0. 

Now, note that for every a > 0 

[J^j][A#*(/ - aZ)M - a\(I- aZ)\{V H], 

is equal to 

(7?a'(V'*ZV - U*ZU) 
aaj{a,ViZV -Z2U^ZU) 

ara,(a,V*ZV2-{/’Zt/zlz) 1 

Zl-oil + aiaiV^ZV^ - 2, f/JZf/jSz) J ’ 

Call T:=a,(a^VZV 2 -U*ZU 2 Z 2 ), and L:= 
((rfV'|ZV2-2if/*Zt/222). Using this notation, 
the matrix in question becomes 


Let V*, c= Z be the set of all such P’’. That is 

V„:={diag P3,p5.i/.,...., 

P^+F-\^rnf-i> OmJ' F?>P3+i 
as in (8.5), t; eC', Ihll = 1}. (8.6) 

Although the matrices U and V are not unique, 
the set does not depend on their particular 
choice. For a given Z e Z, we have 

A„..„(t'*ZC/-F*ZK)= min (Z, P). (8.7) 

Hence, it is the set Vy^, that determines whether 
or not there is a Z such that 

;i^.„(u*zu-v*zv)>o. 

Let the convex hull of a set T" c Z be denoted 
co(r). 

Theorem 8.1. There exists a ZeZ such that 
l„,JiU*ZV-V*ZV)>{) if and only if 0$ 
CO (V*,). 


^a\a{y*ZV -V*ZU) aT j 

L aT* Zl-oil-^aLl 

Choose a>0 small enough so that the three 
conditions 

7 - trZ > 0. 

Zl - a\l + crL < 0. 
aiiV*ZV - VZU) 

- aT{Z\ - a\l + aL)- 'T* <{), 

are satisfied. This is possible, since 7>0, 
{y*ZV -V*ZU)<Q, and 2i-cr^/<0. Using 
Schur complements, it is clear that for such a, 
the matrix 


aia{V*ZV-U*ZU) aT 

L aT* Zl - a]l + aL 


This implies that 

[M*(/ - aZ)M - aid - aZ)] < 0. 
Define D := 1 — aZ, and note that 


Proof. This is a consequence of (8.7), and a 
standard result about convex hulls of sets in 
inner product spaces (Luenberger, 1969). 

Now the optimality condition can be derived. 
In the proof that follows, note that no appeal to 
differentiability of eigenvalues is necessary, and 
all of the steps of the proof are elementary linear 
algebra. The idea for such a simple approach is 
from Young (1992) and Poolla (1991). 

Theorem 8.2. inf a(D'^AfD”'^^) = o(Af) if and 

OeD 

only if 0 E CO (V*,). 

Proof (^) Suppose that O^co(V^). Choose a 


inf a(D''^AfD“''^)<6r(D"^MD-''^ 

< 0| = a(M), 

as desired. 

«:) Supposethat inf a{D''^MD~''^)<a{M). 

Choose D e D such that d(D'^MD"''^) < a{M). 
Define Z e Z via Z ;= 7 — D. Note that 

M*DM -a\D = M*{I - Z)M - a\{I - Z) < 0. 
Hence, for all rj eC', with ||t||| = 1, we have 
0 > ri*V*[M*(I - Z)M - a?(7 - Z)]K7j 
= a^[T;*t7*(/ - Z)Urf - r\*V*(I - Z)Vri] 

= a\r]*{V*ZV - V*ZU)r\. 
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Hence, this Z-.= I - D eZ satisfies 

X^„{VZV-U*ZU)<0, 

which is equivalent to 

kUU*ZU-V*ZV)>0. 

By Theorem 8.1, it must be that Of co(7m), as 
desired. 

8.2. Connecting fi with a(M) 

The convex hull of V*# determines whether or 
not D:=I is the optimum scaling. Following 
Doyle (1982) we ask, “what is true about M if 
OeV^?” Since V;,^cco(V,^) certainly D:=/ is 
optimal, but is anything else true? The answer 
links the upper bound and fi. 

Theorem 8.3. Let Af e C"’'" be given, along with 
a block structure A, and define accordingly 
(equations (8.1), (8.3), (8.5) and (8.6)). Then, 
a{M) = fi(M) if and only if 0 e V^vf- 

Proof. The following four statements are 
equivalent. 

(1) OeV^. 

(2) There exists rj e C', ||j/|| = 1 and 0 e Q with 
QUri = Vtj. 

(3) There exissts | e C", ||||| = 1 and Q eQ with 
QM^ = dl 

(4) = 

1— »2: From the definition of (8.6), 

Oe V#, implies that for some tj eC\ ||»7|| = 1, 

A.t,ti*A:-B,vV*B: = 0, i^s, 

Obviously, for i £ S, there is a phase such 
that e^”'A,Ti = B,j]. For y s f - 1, ll£^r>|| = 

||//yr/||, so there exists a unitary matrix Q, such 
that Q,E,r] = H,ri. The only thing left is the last 
full block. Since ||f/9ll = ||Vr;|| we must have 
||£/pr 7 || = ||/f|rr}||. This gives a unitary matrix Qf 
with QpEpT) = Hpr\. Arranging the phases and 
Qs in a block diagonal fashion gives Statement 
( 2 ). 

2- * 1: This follows along the lines of 1—>2. 
2-» 3: The matrix M has a SVD of M = 

ot/V* + t/zSzFj. Hence QM{Vrt) = oQUrj = 
crVrf. Defining | = Vrj gives Statement (3). 

3^2: A SVD of QM is QM = d{QU)V* + 
( 01 / 2)22 Vj. If QM^ = then $ must lie in the 
subspace spanned by the right singular vectors 
associated with 0 . Hence there is a vector Tj 
satisfying § = Vrj. Obviously l|rjl| = 1 and 

QUr, = QUV*^ = ^QM^ = ^=VT,. (8.9) 

U 


3- »4; QM^ = d^ implies that fXe^{M) = 
max piQM) ^ piQM) s a(Af). However, a is 

QmQ 

always an upper bound for p, hence we must 
have equality. 

4— »3: This is clear, since maxp(OAf) = 

QeQ 

Theorem 8.3 can be used to relate the upper 
bound and p^iM). In particular, we consider 
block structure A that have the following 
property: for all W 6 0 e co (V„.) always 

implies Oe V^,. Note that while this property is 
stated in terms of Vh', is is actually a property of 
the underlying block structure. We will say that 
a block structure satisfying this property is 
^-simple. In Section 9, we will completely 
characterize which block structures are ^-simple, 
and which block structures are not. For now, we 
prove that /i-simple block structures always have 
p equal to the upper bound. 

Theorem 8.4. Suppose the block structure A is 
/i-simple. Then, for every M s C"’*", 

p^{M)= inf a(D''^A/D-*'"). 

OeD 

Proof. Let )8 = inf Let D* be a 

DeD 

sequence in D such that con¬ 

verges to jS as k—»ao. Denote Wi, = 

Since the sequence is bounded, it has a 
convergent subsequence with limit W. 
Obviously, by continuity of a and p, a(W) = p 
and Pii{M) = /iA(W). We claim that 0 e co (V„/). 
If not, then there exist D e D and e > 0 such that 
d(D''^WD' — P — e. Choose k so that ||iy* — 

W|| < elly/KiO ), where k-( ) denotes condition 
number. Then 

||D‘'^(W*-W)D-‘'^1|<|, 

which yields 

This contradicts that p was the infimum, thus 
indeed 0 g co (V^). By hypothesis, this means 
OeVw, so by Theorem 8.3, /iA(W) = a(W). 
Recalling continuity, we get p^(M) = p as 
desired. 

Consider the minimization over the Ds. Since we 
are minimizing the maximum singular value, the 
top singular values tend to coalesce, so that at 
the minimum, the multiplicity of a is greater 
than or equal to two. This is typical of any 
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“mm-max” problem. Suppose though, that at 
the minimum, was distinct. 

Obviously, since we are at a minimum, we must 
have 06co(r). But if the multiplicity of a is 
only one, then V is a single point, hence V = {0}. 
This reasoning gives: 

CoroUary J.5. If, at the minimum of 
a{P'^MD the maximum singular value 
has multiplicity of one, then u(M) = 
min 

Dftt 

9. PROPERTIES OF V 

In this section, we study the convexity 
properties of the set V, since the relationship 
between Ha{M) and mf depend 

on the relationship between V and co (V). It will 
be shown, that for some block structures A, the 
implication 

0 6co(Vw,)-^0e V^, 

holds for every complex matrix W of appropriate 
dimensions. Hence, for those block structures, 

inf 

DeO 

for every matrix Af. Likewise, for other block 
structures, specific matrices can be constructed 
for which the upper bound can be shown to be 
greater than fi. The upper bound may be equal 
to ju for certain matrices (see Theorem 8.3 for 
example) but in general, the upper bound is not 
equal to fi. 

These upcoming results are summarized in the 
Table 1, which indicates section numbers for the 
accompanying derivation or example. Note that 
the (5 = 0, f=l) entry is trivial, and the 
(5 = 1, F = 0) entry implies that for any 

p(M)= inf 

OeC"”" 

which is a well-known fact. 

Before beginning, we make a notational 
change, for ease of exposition. Although the set 
V was defined as a subset of block diagonal. 


nxn Hermitian matrices, in this section we 
identify V with the set H'"' X H'j X • • • X X 

R'^-\ 

9.1. S = 0,F = 2 

The situation with two full blocks is relatively 
simple. Referring back to (8.5), V will always 
have the form 

V={ri*{E*E-F*F)rj:rieC\ ||r/|| = l}, (9.1) 

fo’- some given r>0 and E, FeC"'’‘^ Since 
E*E - F*F is Hermitian, V is just a closed 
interval in the real line. Obviously, this is always 
convex, so if 0 e co (V), in fact, 0 e V. Hence by 
Theorem 8.4: 

Theorem 9.1. If A consists of two full blocks 
(5 = 0, F = 2), then 

Pa{M)= inf 

DeD 

Remark 9.2. The two block case was first solved 
by Redheffer (1959) with a quite different 
approach involving the use of Schauder’s fixed 
point theorem (Dunford and Schwartz, 1958, 
1963). 

9.2. S = 0, F = 4 {Morton and Doyle, 1985) 
Consider the case when A consists of four 

1 X 1 blocks, so 5 = 0, F = 4, and m, = 1 for each 
j. Let a, b, and c be positive real numbers, d and 
/ be complex numbers, and i/;, and V'z he real 
numbers. Define matrices t/, V e by 

0 0 fl 

jc c —jc 

/J Le'’^7 

For the time being, suppose that these are both 
unitary matrices, so that U*U = V*V = Ij. Later 
we will actually assign the correct values, but at 
the moment we just assume this is already done. 
Then define M e C*** by 

M-UV*. (9.2) 

With the assumptions of unitariness on V and V, 


Table I. Guaranteed equality between (i and the 

UPPER BOUND 



F=0 











H 

■ml 

YES 

Section 9.4 


■■ 

MO 


-HO 

Section 9.5 


■■ 

MU 

NO 
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(9.2) is a singular value decompostion of M. M 
has two singular values at 1 , and two singular 
values at 0. With respect to the block structure A 
that we have defined, what properties does the 
set Vnf have? In particular: 

• Is 0 £ CO (Vj^)? If so, then 

inf = 

OeD 

Otherwise, it is less than one. 

•Is OeVm? If so, then niM) = a(M) = h 
otherwise it is less than one. 

Since the multiplicity of the maximum singular 
value is two, we can parametrize all unit vectors 
in C^, and get a parametric representation of 
Vjvf. It is easy to see that any vector t] e C^, with 
||t;|| = 1 is of the form 

re^*' cos 01 
*'“Le'^^sin0J’ 

for some real <pi, 02 > ^nd 6. As it turns out, 
depends only on the difference - <l> 2 , which 
we will denote as (/>. Simply plugging in for the 
definition of from Section 8 , we get 

fl^(cos^ 8 - sin^ 6 ) 

Vm = sin 0 cos 0 cos ^ 

L 4c^ sin 0 cos 0 sin ^ 


proves that for every block structure A satisfying 
5 + F a 4, there exist matrices M with 

inf 

OeO 


9.3. S = 0, F = 3 (Doyfe, 1982) 

In this problem, for every matrix M, c R^, 
of the form 


for some integer r, and Hermitian matrices H, 
and H 2 £ In Doyle (1982), it is shown that 
this set is always convex, so that the upper 
bound is exactly equal to For complete¬ 

ness, the results needed to prove this are stated 
below. 

Begin with some notation from Doyle 
(1982). For any positive integer r, define 
the sets F'’:= (x 6 C'; ||x|| = 1} and S'^:={i»£ 
R"^':||y|| = 1}. If H,, /fj, . . . , are Hermitian 
matrices in C^’’, define a function 

V*H2V 




for each rj e P'. 




eR^:<^, 0€R heR\ (9.3) 

It is apparent that O^V^. That would require 
(from the first coordinate in (9.3)) that 
0 = [(2n + l)/4];r, for some integer n. The 
second and third coordinates being zero would 
then require both cos 0 = 0 and sin <t> = 0, which 
is impossible. Hence 0 ^ and < 1. 

On the other hand, setting 0 = 0, and then 
0 = nil, gives that both [a^ 0 0 ]^ and 
[—a^ 0 0]^ are elements of Vw Consequently, 
0£co(Vjv,). Therefore 

inf = d(M) = 1 . 

DeD 

In order to complete the counter-example, we 
must choose the free variables so that U and V in 
(9.2) are unitary, as we said we could. 

Set y = 3 + and /3 = Vs — 1 and defin e 
a = ^/2fY, b = 1 /Vy, c = I/V 7 , d=-V^7y, 
/ = (1 + /)Vl/y/3, V'l = -nil and ^^2 = n. Some 
algebra later, we conclude that is the set of 
all Jc £ R^, such that ||jr|| = 2/(3-I-V^). Obvi¬ 
ously, 0 $ Vm, but 0 £ CO (V*,). Extensive search¬ 
ing over the set Q in the lower bound formula 
has revealed that for M defined above, n^iM) is 
approximately 0.87326. This counter-example 


Lemma 9.3. Let q be a positive integer. Let 
a,, c, £ R, and 6 , e C for i = 1, . . . , q. For each i, 
define a Hermitian 2 x 2 matrix H, by 



Then there exists a vector d e R‘' and a matrix 
V E R"**^ such that 

f„iP^):={f„iri)-.TieP^} 

= {d-l- Kmiu eS^}, 

where /« is defined in (9.5). 

Lemma 9.4. Let d e R^ and V e Then the 
set {d -t- Vm :« E S^} c R^ is convex. 

Hence, for 9 = 2 and r = 2, the set f{P^) £ R^ is 
convex. For a block structure with 5 = 0, F = 3, 
the set V is always of the form f{P') c R^ (i.e. 
q = 2). Recall though, that r is the multiplicity of 
the maximum singular value. Conceivably, this 
can be any positive number, hence the above 
reasoning needs to be generalized for r > 2 . 

Theorem 9.5. Let r be any positive integer. Let 
Hi, H 2 £ C''’*’’ be Hermitian matrices. Then the 



91 


The complex structured singular value 


set 

fnin * ([ € R^:»? e CMhll = l}, (9.6) 

is convex. 

9.4. S = 1,F=1 

Consider a block structure of one repeated 
sca|ar block, and one full block, S-F = l. 
Recall the definition of equation ( 8 . 6 ). With 
this structure, the set will always be of the 
form 

V= (Ar]ri*A* - ; tj e C^ ||t/|1 = 1}, 

(9.7) 

for some given r > 0 and A, B e C"'*''. It is easy 
to see that in general, V is not convex. For 
instance, take A = I and B = 0. However the 
following is always true. 

Theorem 9.6. Let V be defined as in (9.7), for 
arbitrary matrices A and fl of appropriate 
dimensions. If 0 € co (V), then 0 e V. 


son et al. (1986). The example presented here is 
minimal, in the sense that no smaller P^bl^ 
(“smaller” meaning dimension of blocks and Af) 
could be a counter-example (by results in 
Section 9.4). The dimension of each repeated 
scalar block is two. 

(a) Let a e (0,1) and y c (0,1) be given. Define 
the matrix M 6 by 

■ 0 1 0 1 

y 0 y 0 

V n n 

2fl 0 a 0 

0 - 2 a 0 -a. 


Define a block structure A 2 := {^ 2/2 : 62 6 C}. 

(b) For all A 2 eB 2 the LFT ^(M, ^^) is well 
defined, and appears as 


y(M. A2) = 


0 

1 + 062 

1 - ad. 


1 - 062 

1 + ad. 


(9.10) 


Proof. Suppose that Oeco(V). Then, for some 
integer p, there exist nonnegative o. with 
a,- = 1 and vectors rj, 6 C" with l| 7 j,|| = 1 such 

that 

ia,(A»,,»j.M*-flr),i?rfl*) = 0. (9.8) 

(-1 

which is rewritten as 

a (|; aiTi,r]*y* = “- 9 - 9 *)®*' 

Since the a, are nonnegative, and not all 0, the 
dyad summation in (9.9) is a positive semidelin- 
ite matrix that is not zero. Let X be its 
Hermitian, positive semidefinite square root. 
Therefore aX^'^X^'^A* = Hence, 

there is a unitary matrix V such that 
AX‘'^ = BX^^V. Let V be an eigenvector ot V 
(with eigenvalue e'*) such that X u # 0, and 
define u \=X''^v. Note that u is nonzero. This 
gives Au = e>^Bu, which implies that 0 e V. 

This theorem, along with the LFT machinery 
developed earlier, can be used to give p-based 
derivations of several standard results in linear 
systems theory, such as the ® Pm^v 

Lemma and the Kalman-Yacobovitch-Po^v 
Umma. These are relatively straightforwar 

exercises and will not be pursued further here. 
9.5. S = 2,F = 0 

The block structure considered has 5 
f = 0. A cumbersome example which esUb 
lished the same conclusion appeared in Ander- 


Note that for each such A 2 - 62 / 2 . the 
structural radius of A 2 ) is simply Vy, 
which by assumption is less than one. With 
respect to the structure 

A:= {diag[d|/ 2 , d,!,]: ^ C), 

llieorem 4.3 implies that < L 

(c) Consider the product of two linear fractional 
transformations with different A, in B,. 

0 

For any ye (0,1), it is easy to choose 
fle( 0 , 1 ) so that the spectral radius of the 
above product is greater than one. For such 
choices, then, we must have 

inf fj(D‘''MO ''^)aL 

where D be the scaling set associated with 
A. Otherwise, by Lemma 4.9, the spectral 
radius of any product of these LFTs would 
be less than one. 

Remark 9.7. A deeper analysis can show that by 
proper choice of y and a, the value of 

inf 

DeO 

can be made arbitrarily close to 1 + Vz while 
fiA(A/)<l- 


\l-af 

y(M, -l2)AM. h) = 

0 
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9.6. S = 1,F = 2 

Next is an example for a block structure with 
5 = 1 and F = 2. Again, the example here is 
minimal—^no smaller could be a counter-example 
for this block structure. It is broken down into 
eight facts. 

(a) Let ^ 2 = {diag[(5,, eC}. Then for 
any complex r ^ 0 , 

0 - 

r 1 = 1 . 

T 0 


(b) Let fl 6 C with |a| < 1. Define G on |«5| ^ 1 as 

l + a 6 ~ 


0(8) 


0 


1 — a<5 


1 - ad 


(9.11) 


.l + a6 

Note that everywhere in the unit disk, G is 

ro I/tI 

defined and looks like „ . Hence 

Lt 0 J 

from (a) 


sup flA,(Cr((5)) = 1. 


(c) G(d) in (9.11) can be written as a linear 
fractional transformation. In particular, 
define the matrix M by 


M:-- 


—a 0 — 2 a 0 ' 

0 a 0 2 a 
0 10 1 
10 1 0 


(9.12) 


It is simple to verify for each |<5|:^1, 
0(6) = Af). 

(d) Define Ai-.= {d/ 2;(5 eC), and A 2 ;= 
{diag[ 6 i, 62 ]:!$, eC} and A the augmenta¬ 
tion of the two sets. Certainly n^(M) makes 
sense (dimensions are compatible), and 
fi\(M) ^ 1, since ^ 12 ( 1 ^ 2 ^ - 1- Using (b) and 
(c), and Theorem 4.3, gives ^^(Af)^!. 
Therefore PA(Af) = 1. 

(e) Define the usual scaling sets Di and Dj 
compatible with A, and A 2 . For any D 2 e D 2 


Di'^y(6l2, M)D2^'^ 

r 0 


f^l — ad 

y7,l + a8 



1 -I- a 6 ~ 
I-ad 


0 


(9.13) 


Hence, with some simple calculus, it is easy 
to verify that for any /3 ^ 1 , 

sup a(Di^y( 6 l 2 . A^)D 2 -''^) ^. 

iaisi/fl p - |fl| 


(f) Fact. Let y> 0 . If there is a A,eA,, 
o(Ai) s 1/7 such that 


• / - Af,iAi is invertible; 

• d[y(A|, Af)] i 7; 


then 




inf a 

-/or ( 

’)(f" 

Af ,2 

D,eD| 

A 0 J 

'Am,, 

^22 




x( 


D 


-in 




This fact is simply the contra-positive of 
Lemma 4.9. 

(g) If we choose a ^ ] such that 


P + \a\/p-\a\^ff, 


then we can apply the results from (e) and 
(f) above to conclude that 


inf a 

Di D| 




This logic is as follows: first suppose is 
chosen so that (/I + |a|)/(j 8 - |a|) ^/3. Then 
from equation (9.13) we know that for every 
D 2 G D 2 , there is a A e C with |A| < l/j0 such 
that 


a(D'^V(A/2. M)D2^'^)^^. 


This satisfies the conditions of (f), therefore, 
for each D 2 g D 2 


inf 0 

f),eD, 



V 

Mu 

Mn\ 

[1 0 

D\V\ 

A /21 

A/ 22 / 


v 0 

0 

V 


X 

£>2 




(9.14) 


Carrying out the infimum over D 2 in (9.14) 
yields 

inf d(D''^AfD“''^)^/ 8 , 

where D is the diagonal augmentation of D, 
and D 2 . Therefore the question becomes: 
“What is the largest j 8 such that 

) 8 -E|a|// 8 -|a|>^?”. 


Simple algebra gives the largest P as 
^ lai-n-f-V^f+fiHTT 


Note that as \a\/'\, the quantity + \^. 
(h) In summary let e > 0 , Choose a e C, |a| < 1 
such that 


|a| + 1 - 1 - V|fl|^ 


Define M as in (9.12). Then, with respect to 
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the augmented structure described in (d), 
liAiM) = 1 but inf > 1 + 

y/2-e. 

9.7. MeR"**", S = 0, F = 2 
If M is real, and the block structure A consists 
of two full blocks, then the smallest perturbation 
A making / - Af A singular will actually be a 
real matrix, rather than complex. The proof is 
rather simple using the V set. We also note that 
this result can be found in Redheffer (1959). An 
elementary result from linear algebra is the key 
idea. 

Lemma 9.8. Suppose r is a positive integer, and 
H 6 R'’"' is symmetric. Then 

{t,*H r, :T,eC\ l|r/|| = n 

= {V^Hr,:r,eW, ||r;|| = l}. 

In view of this, suppose Af g 
A={diag[A,. AjjiA, AjeC'"''"’) and 

the optimal D scaling has been computed. Note 
that in a two-block problem, if the infimum is 
not achieved, then it must be that either Af,2 = 0 
or M 21 = 0, and = max (d(Af|,), 0(^22)). 

Then, with singular vectors, it is possible to 
construct a real perturbation of the form 
A := diag [Ai, 0] or A .= diag [0, A2] such that 
/-A/A is singular, and d(A) = l/|i^(A/). Next, 
consider the case when the infimum is achieved. 
Let D be the optimal scaling, and define 
W := D*^^A/D~*^. Since the optimal D scaling is 
of the form diag [d]/„,/,„], where </i>0, it is 
clear that W is still real. Hence, Oeco(Vw')- 
Then, for some U e V e 

W = a,UV* + U2^2Vl 

If U and V are partitioned with respect to the 
block structure as 



then Vh' is 

Vw := {r,*E^E,-FtFt)r, -.riea. |h|| = 1}. 

By assumption, the D scaling is optimal, so 
0 e CO (Vw-) = Vw Using the lemma, this implies 
there is a tj g R^ with |»/|| = 1 such that 
\\Eiri\\ = \\FiTi\\ for J = l, 2. It is easy then to 
construct real, orthogonal matrices Qi and Q 2 
such that QiEiT, = F,ri. Defining Q := 
diag [Cl, Q 2 ] yields 

de.(/-(io)M)=0. 


which shows what we had claimed—in the 
two-block (full blocks) ^ problem, with M real, 
the minimizing perturbation may be taken to be 
a real matrix. 

The next theorem is a mild generalization of 
these ideas. 

Theorem 9.9. Let ArcR""" be a given 
structure, and define the following four aug¬ 
mented structures: 

A,,,= {diaglAR, A,, AjJiAjjgAi,, 

A, gR"”*"'. AzeR"**"*}, 
A,„ = {diag [Ar, a,, Aj] : Ar g Ar, 

AiGR"'’*'", AjgC"’’*"*}, 
An-r = (diag [Ar, A], A2] : Ar G Ar, 

A,gC"'’‘"', AzgR"’’'"’}, 
Arcc = (diag [ Ar, A,, A2 ]: Ar e Ar, 

AigC"’’'"', AzgC**"*}. 

Then for M e 

fiAjAf) = = ^^(A/) = Ma„(M). 

Proof. The proof follows from the previous 
discussion and Theorem 4.3. 

9.8. MeR"*",S = 0,F = 3 
Unfortunately, the argument used above 
breaks down in this case, and no longer may the 
smallest perturbation be assumed real. The 
following is from Skogestad (1987) and Packard 
and Doyle (1990). The perturbation set is 3 1 x 1 
blocks. Let U,Ve R^**^ be 


u = 

1 

y 

^ “ 
or 

, v = 

"-|8 

a 

0 “ 

-y 


_y 

— ar_ 


a 

y _ 


where a, y, p e R, and have been chosen so that 
U^U=VW = l 2 (that is easy to do). Define 
M ■.= VV^ efL^*\ Then d(A/) = l. and r^GC^ 
||r/|| = 1, parametrized by 

e'^cosei 
e'^sin e J’ 

gives 
Vm = 

cos 20 1 

(y^- or^) cos20-(-4cos(V'- 0)yacos 0sin 6]' 

e, v», ^6 r|. 

It is easy to see that 0 G by choosing 
6 = (2n l);r/4 and ij/ - <f> = (2m -I- l)jt/2 for 

any integers n, m. The only vectors r, which lead 
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to 0 e Va# are 


V 




which is always complex. Consequently, the only 
matrices satisfying A € A, o(A) = 1, and I - MA 
singular are complex perturbations. 


9.9. M€R"’‘",S = 1,F=1 
Again, the smallest perturbations are in 
general complex. Suppose G{z) is a stable, nth 
order, SISO transfer function with ||G|U-1, 
|C(1)|<1, and |C(-1)|<1. The state space 
matrix M of this transfer function will have 
H^{M) = 1, but all of the perturbations A = 
diag[d,4,, 62] satisfying a(A) = l, and det(/- 
AfA) = 0 will be complex. 


9.10. Optimal scalings for inf d(D''^MD 

DcD 

with M € R"*" 

If the matrix M is real, then the minimum 
point in the convex hull of is always real, so 
each block of the optimal D e D can be chosen 
to be real. The proof is very simple. 

Theorem 9.10. Let Dr be the set of real, 
symmetric members of D. If M is real, then 

inf a(D‘'^AfD-''2) = inf 

DeD 

(9.15) 

Proof Let D e D be given, with 0 = 0^+ jD,, 
and Note that Dr = Df>0, 

E Dr, and D/ = -Df. Then 

+ jD,)M - p^{D, + jDi) < 0. (9.16) 

Hence, the real part of (9.16) is also symmetric, 
negative definite so 

A™„(M^D,M-/9^D,)<0, 

which implies that 

< p, 

so the iniimums are the same. 


one (recall, the block structure consisted of a 
repeated scalar block, and a full block). We 
explore this type of manipulation in more detail. 
With these relationships established, the robust 
performance properties of an uncertain linear 
system are investigated, and stated in terms of 
structured singular value tests. Finally, the 
connections between the ^-theory and Riccati 
equations for testing norm bounds are briefly 
reviewed. 


10.1. Transfer function matrices and state space 
matrices 

Let Af £ be given, partitioned as 

usual, and define the transfer function matrix 

G(z) := I„, m) = M22 + M2,(2/„ - Af„)-'Af,2. 

Suppose that is a block structure. 

Define A/> as 

Ar:= {diag[6,/„. A]: 6 , e C, A e A}. 

Applying Theorems 4.3 and 6.5, the following 
statements are equivalent; 

(1) p(M]])< 1 and max p^(G(e^^)<l; 

Ue|0. 2jr| 

(2) p(A/,,)< 1 and max M))<\\ 

Se|(),2r) 

(3) p{Mu) < 1 and max M)) < 1; 

*|EC 
l«ll-1 

(4) 

Hence, the peak value of p of a frequency 
response is related to a larger p problem on the 
state space matrix of the transfer function in 
question. This generalizes the example in 
Section 4, where the ||-||,o norm (maximum 
singular value across frequency) was considered. 

Similar results are po.ssible when the upper 
bound is used instead of p. Suppiose that 
D c C'”'*'” is the scaling set associated with A, as 
in (3.6). For any D e D, define 

A/u Af.jD-''^ 1 


10. TRANSFER FUNCTIONS, STATE SPACE 
MATRICES, p AND ROBUST PERFORMANCE 

In this section we begin by establishing some 
relationships between transfer function matrices 
and matrices made up of state space realizations. 
We have already seen one instance of such a 
connection. In Section 4, it was proven that a 
finite dimensional linear system is stable, and has 
||■||s<l if and only if the structured singular 
value of the state space system matrix is less than 


Also, let 
A„ 

A„ := {diag [6,I„, Aj] ; <5, e C, A2 £ C"’"'”}. 

Note that p±^{-) is simply the maximum singular 
value, and that Ajv is p-simple. Then, the 
following are equivalent. 

(1) p(M„)<l and inf ||D‘'^GD-''^|U< 1. 
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(2) p(Mu) < 1 and 

M)D < 1. 

1*1*1 

(3) p(Afn)<l, and 





■ 


inf niaxp^(y’(d/„,Mo))<l. 

LI 6U O £ L 
|6|^1 

(4) ' inf < 1- 

DeD 

(5) inf 

£>eD 

A^€C"“", X<^X’>0 



Also, if M g jjjg„ jjjg scalings can 

be chosen to be real, so that the following are 
equivalent. 

(1) p(Afn)<l and inf \\D^'^GD-'^\l <1, 

/)cD 

(2) p(M„)<land inf \\D^^GD~^^\<1, 

OcDp 

(3) inf 

DeDp 

XsR”^^,X = X^ M) 



These relationships are very significant. Consider 
the simple situation where D:={/„}, in other 
words, an unsealed transfer function. The 
equivalences imply that the linear system is 
stable, and has H-H^ norm less than one if and 
only if there is a state-coordinate transformation 
such that the transformed state space 

matrix 



01. 

fA-"'^ 0] 

o 


L 0 ij 


is a contraction. This is intimately related to the 
characterization of norms using Riccati 
equations (Willems, 1971a). This will be 
discussed further at the end of this section. 


10.2. State space!frequency domain tests for 
robust performance 

We begin with a matrix M e 
0(n+np+m)x(nxnp+m)^ partitioned as below, relat¬ 
ing several variables of a linear system by 


JCi + l 


L J 


Mu 

M 21 

W3. 


Mx2 M,3“ 


Xk 

M 22 Mzs 


dk 

M'^2 -^33« 




( 10 . 1 ) 


The uncertainty is modeled by a feedback loop 
from z to w through a structured A e A, where A 
is a prescribed m x m block structure (note that 


Fiq. 10. Uncertain system as an LFT. 


we have assumed that the number of disturbance 
inputs equals the number of errors, and that the 
perturbation matrices are square—this can all be 
trivially generalized to include nonsquare situa¬ 
tions). Hence, the uncertain system’s output 
error e^ is driven by the input disturbance d*, 
and the state equations are given as 

[';;■]=y(«.A)[;;]. (,0.2) 


With respect to the partition, y(M, A) is 


Mu 


\M,i 

-M21 

A/izJ" 

-^ 23 - 


A(/ - W3,A)-'[A/3, 


Af32]. 


This is shown in Fig. 10. Define three augmented 
block structures, Ajy, \<i and as 

A^ := {diag Az]: <5, e C, Aj e C"'’"""}, 

A.5 := {diag [A^, A): A^ e A^, A e A}, 
Ap:={diag[A2, A]; Aj e A e A), 

along with the corresponding scaling sets D/v, 
and Dp. Motivation for this notatidn is that the 
subscript N could mean norm or nominal, 5 
could mean state space, and P could mean 
performance. We begin with the main result for 
linear, time-invariant perturbations (Doyle et al. 
1982; Doyle and Packard, 1987). 


Theorem 10.1 (Time-invariant, robust 
performance). Given the matrices and sets as 
defined above, the following conditions are 
equivalent: 

(1) there exists a constant Pe[0, 1) such that for 
each fixed AeB^, the uncertain system 
(10.2) is well-posed (/- Af33A is invertible), 
stable, and for zero-initial-state-response, 
the error e satisfies ||e||2:^i8 IMIlzi 

(2) PA,(Af) < 1 (SSp test); 

(3) p(Afi,) < 1 and max M)) < 1 

e6((),2nl 

(FDp test). 

Proof. Introduce two intermediate statements; 
1(a) |UA(Af3.3) < 1 and max A)) < 1. 

2(a) p(A/i,) < 1 and max M)) < !• 

AfiC.ldlsSl 
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The proof that 1 01 (a) follows from the 
definition of stability for a hnite dimensional, 
linear, time-invariant, discrete time system, the 
relationship between norms and I 2 gain, and 
the equivalence between pi and the H-H. norms 
for transfer functions, as developed in Section 
10. Items 1(a), 2 and 2(a) are equivalent by 
Theorem 4.3, while 2(a) and 3 are equivalent by 
Theorem 6.5. 

Remark 10.2. Item 1 in this theorem is the 
desired robust performance conclusion. Item 
1 (a) rephrases Item 1 , using the fi 
characterization of |HU<1. Items 2 and 3 are 
known, respectively as the “state space fi test” 
(SS^) and the “frequency domain pi test” 
(FDfi). Both of these tests involve computing pi 
for various matrices. Recall that upper and lower 
bounds for pi are all that can be computed. 
Hence, we will investigate the additional 
conclusions that are possible when the 
a(D‘^AfD“'^) upper bound is used to imple¬ 
ment the computational tests of Items 2 and 3. 

Remark 10.3. The FD^ test is what is most 
commonly associated with the structured singu¬ 
lar value and is often referred to as a ^-plot. It is 
essentially a Bode magnitude plot with /i(-) 
replacing &(■) or l-j. The SSpi test was introduced 
in Doyle and Packard (1987). 

10.3. Upper bounds 

Using the upper bound in 

place of p, we can derive sufficient conditions for 
robust performance. The resulting state space 
upper bound test (SSUB) and the frequency 
domain upper bound test (FDUB) are 

2' inf CT(Di'^MDj'^)<l (SSUB), 

3' max inf d[Dj.'^y(e^»/„, M)Dp‘'^] < 1 

0G(O,2jr] D/icDp 

(FDUB). 

While the various p tests given in Theorem 10.1 
are aU equivalent, these two upper bound tests 
are very different. In particular, recalling the 
results from the previous section on scaling a 
transfer function with a constant similarity 
transformation, the SSUB condition is actually 
equivalent to 

inf max d[D)Py(e'''/„, A/)Dp'' 2 ]<l. 

DpcDp 0 e( 0 , 2 ^] 

(10.3) 

This condition is much stronger than the 
frequency domain upper bound test, since in 

(10.3), the same Dp e Dp must work for all 


6 € [0, 2;r]. For that reason, we call equation 

(10.3) the frequency domain constant D test, 
FDCD. Listed, from strongest to weakest, the 
various conditions are: 


inf 


0 


mf max 

DpcDp Be| 0 , 1 n \ 

max inf 9[Dj.'^y(e'"/„. M)D^ 

Be|0 2;r] DpcDp 

Condition 1 in Theorem 10 1 



(SSUB), 

]<1 

(FDCD). 

1<1 

(FDUB), 


(FD/i), 


(SSm). 


Note that in both instances where the implica¬ 
tion is given as rather than ^ , there truly is a 
gap. Also, there are two such gaps between the 
state space tests, SSUB and SS^, while there is 
only one gap between the frequency domain 
tests, FDUB and FD^. The top conditions are 
the strongest, and are equivalent to a very strong 
form of robust Lyapunov stability (Boyd and 
Yang, 1989). 

Given that the upper bound is computable, 
one might ask which test should be used, the 
state space upper bound test, SSUB (equiv¬ 
alently FDCD), or the frequency domain upper 
bound test, FDUB? The answer depends on the 
assumptions that are made about the perturba¬ 
tions. If the SSUB is used and the bound 
satisfied, then the robust performance conclusion 
holds for time-varying perturbations (and with 
proper interpretation, cone bounded nonlinear 
perturbations). 


Theorem 10.4. Let M be given as in (10.1), 
along with an uncertainty structure A. If there is 
a 6 D 5 such that 

d{DY^MDs^^) = jS < 1. (10.4) 

then there exist constants Ci^C2 > 0 , such that 
for all perturbation sequences with 

Ait 6 A, ^(Ait)<l//3, the time-varying, uncer¬ 

tain system 

= (10.5) 

is zero-input, exponentially stable, and further¬ 
more, if {dk)X-Q e then 

C 2 (l - 0 ^) llxlll + llelli £ lldlli c, lixoll". 

In particular, ||e||l ^ 0^ |ld||i + Cj ||xo||^ 
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^o/. Note that Ds will appear as = 
diag [Di, d^I, D], where D, = Df>0 be 
C""". Using equation (10.4), it is easy io show 
that regardless of A^eA, a(A0<l/j8, the 
norms of pertinent vectors satisfy 


Let c, and Cj be the square roots of the 
maximum and minimum singular values of 
Summing and taking limits yields the final result. 


Unfortunately, this test (like the SSfi) does not 
scale in a convenient manner. In other words, if 
there is a D^eDi such that = 

1 .001, it is impossible to conclude anything 
about the robust performance characteristics of 
this system. It is necessary to scale the 
perturbation channels and/or disturbance chan¬ 
nels (this amounts to scaling rows of M to 
produce a modified system M^^,) until a Ds can 
be found such that a(Ds^M,„Di''^) < 1, and 
then robust performance with respect to the 
scaled down uncertainty and performance norm 
is guaranteed. For example, let L = 
diag [/„, 0.8/„^, (1/1.2)4,]. Suppose that there is 
a Ds e D.v such that a{Ds‘LMDs < 1. Then it 
is possible to conclude that for perturbations 
satisfying o(A*)s0.8, the error is bounded by. 

Since FDUB is a weaker condition than the 
SSUB, it is “closer” to the exact condition for 
robust performance under linear, time-invariant 
perturbations. Therefore, if the perturbations 
are better modeled as linear, time-invariant 
perturbations, this frequency domain test is 
more appropriate. Also, this tesl scales, that is, 
if 

max inf = 

Se[().2;i| DpeDr 


Postlethwaite (1988), Khargonekar and Kaminer 
(1991), and Bamieh and Dahleh (1992). 

10.4. ^ norms, Riccati equations and LMIs 
We now consider the relationship between the 
bounds given above and Riccati equations for 
computing the norm of the discrete time 
system 

x^^i=Axu + Buu, 
y* = Cx* -I-Du*. 

Let Af 6 be the block state space 

matrix of the system 



Assume that A is stable (p(A) < 1) and define 
E:=A + B{I-D'D)-'D'C. 

G:= -BiI-D'D)-'B'. 

Q := C'il - DD')-'C. 

Suppose E is nonsingular and define a symplectic 
matrix as 

_r£ + G£'-'Q -G£'-‘] 

[ J’ 

It can be shown that the following statements are 
equivalent; 

(a) ||D-t-C(z4-A)-’fl|U<l. 

(b) S has no eigenvalues on the unit circle and 
||C(/-A)-'fl-l-Dl|<l. 

(c) with I-D’D-B'XB>0, (/+ 

GA')''£ stable, and 

E'XE -X- E'XG{I + XG)-'XE + Q=0. 

(d) 3Ar >0 such that I - D'D - B'XB>0 and 
E XE -X- E'XG(l + XG)-'XE + Q<0. 


then the conclusion is that for all A e A, with 
ct(A) < \/p, the perturbed system is stable, and 
the II 11. norm of the transfer function from the 
disturbance to error is s/3. Hence, peak values 
other than one still give useful information. 

However, if the frequency domain test is used, 
no general conclusion can be reached about 
time-varying perturbations (Packard and Doyle, 
1990). In Safonov (1984). .some connections 
between the frequency domain test and robust 
stability to cone bounded nonlinearities are 
developed. 

For reference, continuous-time versions of 
these theorems, as well as theorems with 
more sophisticated assumptions about the 
structured perturbations are found in Chen and 
Desoer (1982), Doyle et al. (1982), Foo and 


(e) 3X > 0 such that 


A 

RT 

[X t 

ni-A 

B 

X O' 

C 

D. 

.0 1 

Jlc 

D. * 

* .0 /. 


(f) 37'nonsingular such that 



TAT- 

CT-i 




Note that (c) is a Riccati equation, and the 
SSUB in (f) is equal to fi because of the block 
structure. It is equivalent to (e), which is two 
LMIs. The connection between (d) and (e) is 
just the Schur complement formula for positive 
definite matrices. 
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n. QUADRATIC LYAPUNOV FUNCTIONS FOR 
UNCERTAIN SYSTEMS 

Some computable results on the quadratic 
stability of linear systems under structured, 
linear fractional uncertainty are possible. Again, 
consider positive integers n and m, and suppose 
Let A be a structured 
perturbation set with Assume that 

HiXM- 2 z)< 1, so that A) is well defined for 
all A € 

Let {Aa}a=.o with A^ e be given, along with 
an initial condition JCn e C", define e C" by the 
uncertain difference equation 

x,^, = Sr(M,A,)x,. (11.1) 

In this formulation, the matrix M,, may be 
thought of as a nominal state space model and 
Ajk e Ba as a norm bounded perturbation from 
an allowable perturbation class, A. The matrices 
Mi 2 , M 21 , and M 22 reflect prior knowledge on 
how the unknown perturbation affects the 
nominal dynamics, Afi,. 

Definition 11.1. The pair {M, A) is quadratically 
stable if there exists a P e C"**", with P = P* > 0, 
such that 

max A)]*Py(A/, A) - P) < 0. 

The definition simply implies that there is a 
single quadratic Lyapunov function, V(x):= 
x*Px, that establishes the stability of the entire 
set 


{y(M, A);A6B^}. 

Equivalently, the definition implies that there is 
a positive definite P e C"*" such that 

max a(P"^y(M, A)p-^'^) = y < 1. 

Hence, with respect to a single coordinate 
change defined by P'^, A^) is always a 

contraction, regardless of A;^ 6 B^. As the 
uncertain system in (11.1) evolves, the Euclidean 
norm of P'^x*, ||P’^x*||2, decreases by at least a 
factor of y every time step k, and hence 
robustness with respect to time varying pertur¬ 
bations is guaranteed. Note that if both M is 
real, and AcR'"’'"', then by using an argument 
similar to that in Theorem 9.10, the matrix P, if 
it exists, can be chosen to be real. 

Using Theorem 4.10 and the fact that in some 
instances (when 25 -I- P £ 3), fi and the upper 
bound are always equal, we can establish 
necessary and sufficient conditions for a pair to 
be quadratically stable, in terms of a scaled 
state space test, and/or a scaled ^ norm test. 
Several cases are outlined below, along with a 


chain of equivalences which produces the result. 
As usual, define the transfer function G(z) as 

C(2) ;= M 22 + M2i{zl - 

Note that this is the transfer function of the 
linear system that the perturbation A/, “sees”. 

11.1. Real state space data, 1 full real 
perturbation 

Suppose that ^ g and that 

A-ijmxm Assume that 0(^22) <1. For any 
p^Qmxn p^p.5.0, let 

P''"Af,2] 

L M 22 J’ 

Also, define 

A;,; = { diag (A,, A2]: A, e R"Aj e R". 

Then, using Theorem 4.3 and the results from 
Sections 9.7 and 10, the following statements are 
equivalent. 

(1) There exists P = P*>0 such that 

max 

(2) There exists PeR”"”, P = P^>0 such that 
max d[P''^y{M. A)P"''"1<1. 

AeBa 

(3) inf max A)] < 1. 

/'eR'”'" AeB^ 

>0 

(4) inf 

p^pr ^0 

(5) inf inf 

/•eR"“" t/|>0 



(6) inf inf 

j,-n 

P=^P'^Q 



(7) inf 

P^PT;>0 


/rp*'^ 0 1 

1 

5 


M 


VL 0 u 

o 

ij/ 


(8) p(M„)<l and ||G|U<1. 

The main point here is that the uncertain system 
is quadratically stable with respect to full block, 
norm bounded, real perturbations (Condition 1) 
if and only the X. norm of the transfer function 
that the perturbation sees is less than one 
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(Condition 8). Conditions 2-7 are intermediate 
^ps which link the two conditions together 
This same style is used in Sections 11.2-11.4. 


11.2. Complex state space data. I full complex 
perturbation 


Suppose that Af g and 

A = C^^’". Assume that b(M 22 )<l. For 
PeC"*'" with P = P*>o, let 


that 

any 


demonstrated an application of the Main Loop 
theorem. In that example, LFT arguments were 
given to prove that the pair (M, C'"*'") is 
robustly stable if and only if ||G||„<1, where 
G(z) = M 22 + ~ That result, 

along with Section 11.1 and this section Combine 
to form the following theorem (Willems, 1973; 
Popov, 1962; Khargonekar er a/., 1990). 


L M2\P \ 

Also, define 

Ac:= {diag[A,, Aj]: A, e C'”'". AjeC"”'"'}. 

Then, using Theorem 4.3 and the results from 
Sections 9.1 and 10, the following statements are 
equivalent: 


Theorem 11.3. Suppose that Af € 
with a(A/22)<l. Define G(2) ;=Af22 + 
— Mii)~'Mi 2 . Then, the conditions 

(1) the pair (Af, R"”"") is quadratically stable; 

(2) the pair (Af, C'"’"") is quadratically stable; 

(3) the pair (Af, C"”"") is robustly stable; 

(4) p(Af,,)< 1, and ||G|U<1; 

are equivalent. 


(1) There exists P e C'"'", P = P* > 0 such that 
max CT[P''^y(Af, A)P“''^] < 1. 

(2) inf max CT[y(AfA)] < 1. 

PeC"»" A6B4 

(3) inf pA,(Af'’)<l. 

PeC"**" 

P*P*>0 

(4) inf inf 

P*=P*>0 


It is important to note that Conditions (1) and 
(3) become incomparable (neither implies the 
other) when the perturbation set becomes 
structured (Packard and Doyle, 1990; Rotea et 
aL, 1991). 

11.3. Complex state space data, \ repeated 
complex perturbation 

Suppose and A = {dl^:de 

C}. Let As be defined as 



(5) inf inf 

PeC'*'''’ di>() 
p = p*>0 



(6) inf 

p^p*>0 



Ar:={diag[A,, : A, e d^eC). 

Then, using Theorem 4.3 and the results from 
Sections 9.4 and 10, the following statements are 
equivalent: 

(1) There exists PeC'^", P = P^>0 such that 
max d[P''^y(Af, 62/„)P"''^] < 1. 

(2) inf max o[y(Af'’, 62/„)]< 1. 

PeC"" 

(3) inf p^,(A/'’)<l. 

p^Qnnn 

(4) inf inf 

p=p^^o z>2=DJ>n 


(7) p(Mn)<l and ||G|U<1. 

Some interesting connections between different 
notions of stability can be made at this point. To 
do so, consider the definition of robust stability 
given below: 

Definition 11.2. The pair (Af, A) is robustly 
stable if 

max p(y(Af, A)) < 1. 

Recall the example in Section 4, which 



L 

0 




< 1 . 


(5) inf inf 

PeC"*" /> 26 C"'*''" 
p-p*>n D2 »«oj>» 



(6) p(Mn)<l and inf <\. 

D2^Df>i) 

In this section, the matrix 61^) is a 


AUTO 29:1-H 
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rational function of the scalar, complex para¬ 
meter d. We have shown that quadratic stability 
with respect to such a parameter can be 
ascertained by determining if the convex set 




PeC"*'", DjEC" 


X = X*>0, M*XM 


is nonempty. 

11.4. Complex state space data, 2 complex full 
blocks 

Suppose that M e and A is 

A ;= {diag [A,, A^]: A, e C'"'*''"'} c 
and let 

Ac :={diaglAo, A]; ApeA 6 A}. 

Then, using Theorem 4.3 and the results from 
Sections 9.3 and 10, the following statements are 
equivalent. 

(1) There exists PeC""”, P = P*>0 such that 
max d[P''^y(M, A)P-*'"]<1. 

A C: 

(2) inf max a[y(Af ^ A)] < 1. 

PeC'”'" AcBa 
F»P*>0 

(3) inf ^^,(AfO<l. 

p=p*>n 


(6) inf inf 

PeC"*" J2>0 
p«P*>() 

/r p"^ 0 0 " 

o 0 0 

\L 0 0 I„,_ 



(7) p(Mh)< 1 and 



Hence quadratic stability with respect to two full 
complex blocks of uncertainty is equivalent to an 
optimally scaled small gain condition. Note that 
for any or > 0, 


r 

' d2>0: 



is either empty or is a convex set. 


(4) mf inf 

/>eC"*” d,,di'>a 
P = P*>0 



(5) inf inf 

>»=.p*>0 



11.5. Conclusions 

Some of these results are well-known, and 
available in the literature, although the treat¬ 
ment here is more unified. The results relating 
quadratic stability and H and for full block 
perturbations (Sections 11.1 and 11.2) are 
proven for SISO systems in Popov (1962) and 
Willems (1973) and for MIMO systems in 
Khargonekar et al. (1990). The results for two 
complex blocks is from Packard and Doyle 
(1990) while the result for a single complex 
repeated scalar perturbation, is, to our knowl¬ 
edge, new. Similar results are easily derived for 
continuous-time systems, using a bilinear trans¬ 
form. By defining 



and noting that 

A*P + PA <0-«^P-‘'2A*P‘^-t-P*'2>4P-''^<0 
«- a ( y ( fi8 , P ‘'^ AP“‘'^))<1 
-«^o(P‘'='y(98, A)P-"^)<1, 

the results relating H-H. (possibly a scaled norm, 
as in Sections 11.4 and 11.3) and quadratic 
stability can be derived in the same manner. 
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12. m-synthesis via optimally scaled lfis 
This paper so far has only considered 
^-analysis. The problem of ^/-synthesis is much 
more difficult, and will be discussed in this 
section. In general, ^i-synthesis methods have 
focused on minimizing fi of some rational matrix 
over stabilizing controllers using the frequency- 
domain upper bound (FDUB) and have been 
successfully used in many applications. 
Nevertheless, the theoretical basis for fx- 
synthesis is much weaker than for ju-analysis. 
^is section will consider a ^.t-synthesis problem 
involving only constant matrices, to explore the 
potential difficulties in a simple setting. For an 
introduction to /i-synthesis in the rational case, 
see Balas et al. (1991). 

Suppose that a matrix M depends on a free 
parameter Q. How can Q be found so as to 
minimize In this section we consider this 

problem when M depends on a free matrix Q in 
a linear fractional manner, and we attempt to 
minimize the upper bound for rather 

than fx^{M) itself. Tliis problem is first reduced 
to an affine, rather than linear fractional, 
transformation, and then partially solved using a 
elementary extension to matrix dilation theory 
(Davis etai, 1982; Power, 1982). In the lemmas 
to follow, F denotes either the real or complex 
field. 

Lemma 12.1. Let ReF"*", 7'er‘'^ 

and V eF^", where r,i^n. Let ZcF""^" be a 
prescribed set of positive definite matrices. Then 

inf d[Z''\R +UQil-TQyW)Z~''^] 

QeF'’“.Z£^ 

tJci(/~rQ)¥=0 

= inf alZ'^^XR + UQV)Z-''^]. 

0eF"'.Ze3' 

Proof. For any T e the closure of the set 

{Q{I-TQy':Qer^\ det {I-TQ)=^0}. 

is all of F'''^', which shows that the infimums are 
the same. 

Hence, in order to solve general linear 
fractional transformation optimization problems, 
only affine transformations need be considered. 
We also assume (without loss in generality) that 
U is full column rank, and that V is full row 
rank. The first lemma addresses the unsealed 
problem, and comes from Davis et al. (1982) and 
Power (1982). 

Lemma 12.2. Let R, U, V, be given as above. 
Suppose and Fj, e F'"are 

chosen such that [V 1!^], j ate both 


invertible, and that 

Let or>0. Then 

inf &[(«-l UQV)]<a, 

QfF*' 

if and only if 

A„,„[t/l(/?/?*-rr=/)t/x]<0. ^ ’ 

The next lemma partially answers the synthesis 
question when similarity scalings are included. 
The proof is in Doyle (1985a) and Packard et al. 
(1992). 


Lemma 12.3. Let R, U, V, and Fj be given 
as above. Let or>0 and ZcF’^" be a given set 
of positive definite, Hermitian matrices. Then 


inf a[Z^’\R + UQV)Z-'^]<a, 

CeF”' 

ZeZ 


if and only if there is a Z e Z such that 

A_[F,(f?*Z/?-,r^Z)F.:]<0. (12.2) 

and 

<0. 02.3) 


Note that the condition imposed on Z in 
equation (12.2), is convex, therefore, if the set Z 
is itself convex, determining solutions of 
equation (12.2) is a convex feasibility problem. 
Similarly, the condition imposed on Z“"* in 
equation (12.3) is convex in Z "‘, so if the set 
Z~* is convex, this is also a convex feasibility 
problems is exploited in Packard ef al. 
(1991, 1992) where some robust control prob- 
(1991, 1992) where some robust control prob¬ 
lems are formulated, and recast as convex 
optimizations, using this scaled linear fractional 
transformation approach. Unfortunately, the 
complete problem, which involves both Z and 
Z~* conditions, is more difficult, and at the 
moment, unsolved. In some special cases, it may 
be possible to obtain computable necessary and 
sufficient conditions. For instance, if 


Z = 




then both conditions define open intervals in the 
real line, and it is easy to check if these intervals 
intersect (moreover, the intersection is either 
empty or convex). More generally though, the 
set of “good Zs” may be a disconnected set. 
Specifically, given matrices R, U and V, and 
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or > 0, let Zgooa be 

^|ood(^) 

:=lzeZ: inf aiZ^'^iR + UQV)Z-'^) < a 

( QeJl(C) 


It is this set, Zgood(o^)> which may be 
disconnected. In particular, let o' ;= 1, and 

Z- {diag[z,, Z2. l];z, >0, Z2<0}, 


and define matrices 

1 -1 0 

I? = | -1 1 IOV 2 

0 \/2 0 


U = V^ = 


1 

-1 
L 0 


0 

0 

iJ 


Applying the formulae, we have that Z € Z^^^a if 

and only if z, >0, zi + Z2 > 2, and — + 

■^1 

— >200. Clearly, the region of good Zs in the 
^2 

Z1-Z2 plane consists of two slivers near the axis, 
which is not a connected set. Unlike the analysis 
problem the level sets of scalings in the synthesis 
problem are not convex. We are currently 
investigating the implications of this property. 


13. SUMMARY OF SOME RELATED WORK 
This section outlines some work related to this 
paper, beginning with a brief history of the early 
development of the n theory. This outline is not 
intended to be exhaustive or complete, but 
simply to touch on a few of the topics nearest to 
this paper that were not considered in detail. 
LMIs are discussed as potentially unifying 
theoretical and computational tools. The re¬ 
lationship between n and quadratic vs L, notions 
of robust performance and robust stability is 
considered next, followed by with mixed real 
and complex perturbations. The section ends 
with model validation and generalizations of fi. 

13.1. History of early work 
In this section, we will briefly review the ideas 
that most influenced the original development of 
the n theory. These remarks are drawn mainly 
from earlier papers (Doyle, 1982; Doyle et al., 
1982; Fan et al., 1991), but are repeated here for 
the convenience of the reader. 

An obvious influence on the developement of 
the n theory was the work in so-called Robust 


Multivariable Control Systems from the late 
1970s, (IEEE, 1981) which in turn drew heavily 
on earlier work in stability analysis (e.g. Zames, 
1965; Desoer and Vidyasagar, 1975; Willems, 
1971b; Safonov, 1980), particularly the small 
gain and circle theorems. These theorems 
established sufficient conditions for stability of 
nonlinear components connected in feedback. 
The emphasis in the early robustness work was 
on small gain type conditions involving singular 
values that were both necessary and sufficient for 
stability of sets of linear systems involving a 
single norm bounded but otherwise uncon¬ 
strained perturbation. Another emphasis for 
much of the robustness theory was on using 
singular value plots as a means of generalizing 
Bode magnitude plots to multivariable systems. 

While methods based on singular values were 
gaining in popularity, it became evident that 
their assumption of unstructured uncertainty was 
too crude for many applications. Furthermore, 
the problem of robust performance was not 
adequately treated. Freudenberg et al. (1982) 
studied these issues using differential sensitivity 
and suggested that something more than singular 
values was needed. It was a natural step to 
introduce structured uncertainty of the type 
considered in this paper (see Safonov (1978) for 
an early treatment). The so-called conservative- 
ness of singular values was based on the fact 
that the unsealed bounds 

could be arbitrarily far off, and research was 
begun to provide improved estimates of p, with 
an initial focus on the nonrepeated, complex 
case (5 = 0). 

It was obvious that the sharper bounds 
max p{QM) ■ max p{AM) = 

^ inf d(D''^MD “''"), (13.1) 

could help alleviate the conservativeness some¬ 
what. The upper bound is similar to the 
multiplier methods that were used in nonlinear 
stability analysis to reduce the conservativeness 
of small gain type methods (Willems, 1971b) but 
the use of both upper and lower bounds, and the 
questions of how close the bounds were and how 
to efficiently compute them were new and open. 
As we saw in Section 6, the equality of the lower 
bound and /i is relatively straightforward and not 
surprising. What is remarkable, even in re¬ 
trospect, is that the upper bound is often close to 
fi and is in fact equal to fi for certain simple 
block structures. 

There was substantial numerical evidence for 
the upper bound results before they were 
proven. Engineers at Honeywell’s Systems and 
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Research Center, particularly loc Wall, began 
routinely using a simple generalization of 
Osborne’s routine (Osborne, 1960) to approxi¬ 
mate the upper bound in (13.1) and gradient 
search methods to find a local maximum for the 
lower bound. Osborne’s algorithm minimizes the 
Frobenius norm rather than the maximum 
sinpilar value, and the scalings produced can be 
us^ to approximate the upper bound. The 
consistent closeness of the bounds, usually 
within a few percent, suggested that there was a 
deeper connection between the bounds. Ironi¬ 
cally , nunimizing the Frobenius norm remains 
the cheapest method of approximating the upper 
bound. Safonov (1982) suggested a somewhat 
less general approximation to the upper bound 
based on Perron eigenvectors which is com¬ 
parable to Osborne in speed and accuracy. 

While the ^ framework arises naturally in 
studying robust stability with structured uncer¬ 
tainty, the use of fi to treat directly the problem 
of robust performance with structured uncer¬ 
tainty was first explicitly noted in Doyle et al. 
(1982). As noted above, this is a consequence of 
the intimate connection between /r and LFTs 
(Doyle, 1985b; Packard, 1987). In retrospect, 
it is clear that Redheffer (1959, 1960) had 
developed the foundation of this connection in 
his work on LFTs in the late 1950s. In fact, as 
noted earlier, Redheffer had even proven that 
the upp>er bound in (13.1) was an equality for the 
case where 5 = 0 and f = 2. While Redheffer’s 
results were not well-known in the control 
community until the fi theory was already 
well-developed, the rediscovery of his work has 
since had an important influence, not only on the 
further development of fi but in other areas as 
well (e.g. see Doyle et al., 1989). 


13.2. Linear matrix inequalities 
We have seen in this paper how LMIs arise 
naturally in both ^ analysis and synthesis in the 
computation of upper bounds. The general LMI 
problem involves sets of the form 


af= 


fdiag[A',, . . . , Xs, xj, . . . , Xf/]; 
I X.eC'‘^\X. = x:,x,€R 


}, (13.2) 


and a list of matrices A,, /?,, C,, D,. The simplest 
general LMI problem is to determine whether 
there exists X e such that 


A:XAi~B:XB,-^XQ^C:X + D,<Q Vr. 

Depending on the particular problem, the < 
may be a <. It is easy to see that these inequality 
conditions produce a set of solutions which are 
convex, which makes LMIs attractive computa¬ 
tionally. (In the synthesis problem in Section 12, 
there are additional constraints that destroy 


convexity.) This is a decision problem; the 
answer is yes or no. Sometimes, however, the 
B„ Ci, and Di are functions of a real, 
positive parameter a, and we want to know, for 
example, what is the largest a for which there is 
no solution. Typically this involves an iteration 
on or, and consequently, answering the decision 
question many times. 

Recall that the upper bound for /i can be 
rewritten as an LMI of the form 

3X>0 :M*XM - p^X < 0. (13.3) 

It has recently been shown that a number of 
other problems can be reduced to solving LMIs. 
In Wang et al. (1991) balanced truncation 
model reduction is extended to uncertain LFT 
systems, with similar extensions of the para- 
metrization of all stabilizing controllers in Lu et 
al. (1991). The LFT/LMI machinery not only 
extends the standard results in important ways, it 
simplifies the proofs, often substantially. Excit¬ 
ing new developments in handling real para¬ 
metric uncertainty (Young et al., 1991) and 
model validation (Newlin and Smith, 1991) will 
be outlined in subsequent subsections. In all 
cases, LMIs play a central role in computation of 
solutions. We believe that LMIs will replace 
Lyapunov and Riccati equations, which are both 
special cases of LMIs, as the central computa¬ 
tional problems in robust control. 

The problem of solving LMIs can be viewed in 
a number of ways, from solving a set of linear 
equalities to minimizing the eigenvalues of a 
Hermitian matrix function (Beck, 1991). One of 
the goals of our current research is to develop 
fast, reliable algorithms for solving LMIs which 
are comparable to what is available for solving 
Riccati and Lyapunov equations. Several re¬ 
searchers have already begun looking at this 
question. 

One approach to solving LMIs is to convert 
them to eigenvalue optimization problems which 
results in convex, non-differentiable functions 
for which numerous optimization methods have 
been developed. Boyd and Yang (1989) compare 
the efficiency of two convex programming 
algorithms, Kelley's cutting-plane algorithm and 
Shor's subgradient algorithm. Boyd and Yang 
find Kelley's cutting-plane algorithm to be most 
effective of the two, since it converges in the 
fewest number of iterations. An alternative 
convex programming method which has been 
used for LMI problems is the ellipsoid method, 
which is also a cutting-plane method. Although 
these methods are easy to implement, they are 
generally too slow to warrant considerable 
attention. Overton (1990) studies the optimality 
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conditions, and develops quadratically conver¬ 
gent algorithms for LMIs. 

Recently, interior point methods have been 
applied to LMl problems with favorable results. 
Interior and exterior point methods are used to 
convert constrained minimization problems to 
differentiable, unconstrained minimization prob¬ 
lems, to which optimization algorithms such as 
Newton’s method are applied (Nesterov and 
Nemirovsky, 1989, 1990). Jarre (1991) has used 
an interior point algorithm for a problem similar 
to LMIs which required substantially fewer 
iterations than does the cutting-plane algorithm 
used by Boyd and Yang. A similar approach is 
taken by Boyd and El Ghaoui (1993). We are 
currently investigating alternative functions for 
solving LMIs using both interior and exterior 
point methods (Beck, 1991). 

13.3. jU, 3, and L| 

We have considered several different measures 
of robust stability and performance in Section 10 
from SS|U to the SSUB. We will concentrate on 
these two measures, and compare them briefly 
with another very important measure that has 
emerged in the L| theory of robust performance 
with structured uncertainty. Space constraints 
preclude a review of the L| theory, which has 
undergone a dramatic and impressive develop¬ 
ment in the last five years in the work of 
Khammash and Pearson (1991) and Dahleh and 
Khammash (1991) and references therein. For 
simplicity, we will refer to the SS^ test as ^ and 
the SSUB upper bound as 3 (since it is directly 
related to quadratic stability), and focus our 
attention on the robust performance problem, 
which clearly includes robust stability as a special 
case. 

The ju, 3 , and Lj tests all guarantee robust 
performance, but with different assumptions 
about perturbations and the norm used for 
measuring the performance objective. The and 
3 theories are used for induced norms, while 
the Lt theory is used for induced norms. A 
second distinction is that the fi theory treats LTI 
perturbations, and the 3 and L, handle 
Nonlinear and Time-Varying perturbations 
(NTV). This is summarized in the table below. 


LTI NTV 


Li fi 3 

L. n L, 


The cases on the diagonal, L2/LTI and L./NTV, 
are both necessary and sufficient for robust 
performance. The L./LTI case is necessary and 


sufficient for robust stability, but the robust 
performance question is open. The 3 case 
(L2/NTV) is sufficient for robust performance, 
and recent results, obtained independently using 
very different methods by Shamma and Megret- 
skii, suggest that it is necessary as well. Recall 
that in general, jU is computed using bounds, but 
that 3 involves solving LMIs, so is attractive 
computationally. L| is also easy to compute, 
involving only the evaluation of L, norms and 
finding the spectral radius of a positive matrix 
(Khammash and Pearson, 1991). 

As a final comparison, it can be easily shown 
that the tests are ordered, with 

H^3r<Ly. (13.4) 

The interpretation of (13.4) for a given system is 
that if the 3 test passes, the n test must pass, 
and similarly for L, and 3. It was shown above 
that fi^3. The inequality 3 £ L, follows from 
the equivalence of the SSUB and the FDCD 
problems, the fact that the L| norm of a 
convolution kernal is greater than the //„ norm 
of its transform, and the results in Khammash 
and Pearson (1991). The inequalities are 
typically strict and it is possible for the gaps to 
be arbitrarily large. 

It is not clear exactly what the implications of 
these results are for control design or for further 
research. Clearly there is a need for more 
refined results, and the ability to both combine 
LTI and NTV uncertainty and exploit additional 
structure such as the slowly-varying nature of 
some perturbations. The results in Safonov 
(1984), Packard and Teng (1990) and Packard 
and Zhou (1989) suggest how this might be done 
in the LFTIn/3 framework, but much more 
work is needed. We also need more precise 
modeling and ID methods to exploit the detailed 
structure of the uncertainty in our models. 

If one accepts 3 as the measure of robust 
performance, a rich theory can be developed, 
with generalizations to uncertain systems of the 
conventional theories of robust stability and 
performance, balanced realizations and model 
reduction (Wang et al., 1991) stabilization (Lu et 
al., 1991) and model validation (Newlin and 
Smith, 1991). It is not surprising that the easiest 
generalizations of standard results to uncertain 
LFT systems is done using the 3 framework. 
Indeed, most of the standard results rely on 3 
machinery, but since ^ and 3 are the same for 
these simple block structures, we are less aware 
of the distinction. Once we begin extending our 
results to systems with uncertainty, the distinc¬ 
tion becomes significant. Of course, a key 
feature of the 3 theory is that computation 
involves solving LMIs. 
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13.4. fi with real perturbations 

In recent years a great deal of interest has 
arisen with regard to robustness problems 
involving parametric uncertainty. These prob¬ 
lems involve uncertain parameters that are not 
only norm bounded, but also constrained to be 
real. Robustness problems involving parametric 
uncertainty can be reformulated as fi problems 
where the block structured uncertainty descrip¬ 
tion is now allowed to contain both real and 
complex blocks. This mixed p. problem can have 
fundamentally different properties from the 
complex p problem studied in this paper (where 
the block structured uncertainty description 
contains only complex blocks) and these 
properties have important implications for 
computation. In this section we give a brief 
review of some recent results in this area. 

It is now well known that real p problems can 
be discontinuous in the problem data (see 
Barmish et al., 1989). As well as adding 
computational difficulties to the problem this 
sheds serious doubt on the usefulness of real p as 
a robustness measure in such cases, since the 
system model is always a mathematical abstrac¬ 
tion from the real world, and is computed to 
finite precision. However it is shown in Packard 
and Pandey (1991) that mixed p problems 
containing some complex uncertainty are, under 
some mild assumptions, continuous in the 
problem data (whereas purely real p problems 
are not). This is reassuring from an engineering 
viewpoint since one is usually interested in 
robust performance problems (which therefore 
contain at least one complex block) or robust 
stability problems with some unmodeled dynam¬ 
ics, which are naturally covered with complex 
uncertainty. Thus in problems of engineering 
interest, the potential discontinuity of mixed p 
should not arise. 

Recent results in Rohn and Poljak (1992) 
show that a special case of computing p with real 
perturbations only is NP complete. While these 
results do not apply to the complex only case, it 
is certainly true that the general mixed problem 
is NP hard as well. These results strongly suggest 
that it is futile to pursue exact methods for 
computing p in the purely real or mixed case for 
even moderate (less than 100) numbers of real 
perturbations, unless one is prepared not only to 
solve the real p problem but also to make 
fundamental contributions to the theory of 
computational complexity. Furthermore, it may 
be that even approximate methods must have 
worst-case combinatoric complexity (Demmel, 
1992). 

These results do not mean, however, that 
“practical” algorithms are not possible, where 


“practical” means avoiding combinatoric (non¬ 
polynomial) growth in computation with the 
number of parameters for all of the problems 
which arise in engineering applications. Practical 
algorithms for other NP hard problems exist and 
typically involve approximation, heuristics, 
branch-and-bound, or local search. Results 
presented in Young et al. (1992) strongly suggest 
that an intelligent combination of all these 
techniques can yield a practical algorithm for the 
mixed problem. 

Upper and lower bounds for mixed p have 
recently been developed, and they take the form 
of generalizations of the bounds for the complex 
p problem presented here (i.e. by applying the 
mixed p bounds to complex p problems one 
recovers the standard complex p bounds). The 
upper bound was presented in Fan et al. (1991) 
and involves minimizing the eigenvalues of a 
Hermitian matrix. This can also be recast as a 
singular value minimization which involves 
additional scaling parameters to the complex p 
upper bound. It is shown in Young and Doyle 
(1990) that the mixed p problem can be recast as 
a real eigenvalue maximization and that this in 
turn can be tackled via a power algorithm, giving 
a lower bound for mixed p. A practical 
computation scheme for these bounds has 
recently been developed (Young et al. (1992)) 
and will be available shortly in a test version in 
conjunction with the ^-Tools toolbox (Balas et 
al., 1991). 

The quality of these bounds, and their 
computational requirements as a function of 
problem size, are explored in Young et al. 
(1991). While the bounds are usually accurate 
enough for engineering purposes, in a significant 
number of cases of interest, they are not. This is 
in contrast with the purely complex nonrepeated 
case, where no examples of problems with large 
gaps have been found. The use of branch-and- 
bound schemes to improve upon existing bounds 
has been suggested by several authors (see 
Balakrishnan et al., 1991; Sideris and Pena, 
1989, 1990; de Gaston and Safonov, 1988 and 
references therein). There are some important 
issues and tradeoffs to be considered in 
implementing such a scheme, which can greatly 
impact the performance. A selection of results 
from a fairly extensive numerical study of these 
issues is presented in Young et al. (1991) and a 
branch-and-bound scheme is proposed which 
should form the basis of a practical computation 
scheme for mixed p. This will be further 
explored in Newlin et al. 1993). 

The upper and lower bounds from complex p 
theory not only serve as computational schemes, 
but are theoretically rich as well. Connections 
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between the bounds and various aspects of linear 
system theory have been established, and further 
work in this area appears to have great promise. 
A theoretical study of the mixed fi bounds may 
yield new insight as well, and this is a subject of 
current research. Initial results in this area are 
presented in Young and Doyle (1993), where it 
is seen that mixed fi inherits many of the 
(appropriately generalized) properties of com¬ 
plex fi, although as has already been seen, in 
some aspects the mixed fi problem can be 
fundamentally different from the complex ft 
problem. 

Problems involving robustness properties of 
polynomials with coefficients perturbed by real 
parameters have received a great deal of 
attention in the literature. This type of 
robustness problem leads to a (real or) mixed fi 
problem. Several celebrated “Kharitonov-type” 
results have been proven for special cases of this 
problem, such as the “affine parameter varia¬ 
tion” problem (see Barlett et al., 1988 for 
example), and the solutions typically involve 
checking the edges or vertices of some polytope 
in the parameter space. It can be shown that 
restricting the allowed perturbation dependence 
to be affine leads to a real fi problem on a 
transfer matrix which is rank one. 

The rank one mixed fi problem is studied in 
detail by Chen et al. (1991) see also the 
references therein. The authors develop an 
analytic expression for the solution to this 
problem, which is not only easy to compute, but 
has sublinear growth in the problem size. They 
are then able to solve several problems from the 
literature, noting that these problems can be 
treated as special cases of “rank one fi 
problems” and are thus “relatively easy to 
.solve”. Even the need to check (a combinatoric 
number of) edges is shown to be unnecessary. 
While many of these results were apparently 
well-known (Chen et al., 1991) provides a direct 
comparison between the polynomial and /x-based 
approaches. 

This rank one case is also studied by Young 
and Doyle (1990) where it is shown that for such 
problems fi equals its upper bound and is hence 
equivalent to a convex problem. This reinforces 
the results of Chen et al. (1991) and offers some 
insight into why the problem becomes so much 
more difficult when we move away from the 
“affine parameter variation” case to the 
“multiUnear” or “polynomial” cases (Sideris and 
Pefia, 1989, 1990). These correspond to fi 
problems which are not necessarily rank one, 
and hence may no longer be equal to the upper 
bound and so may no longer be equivalent to a 
convex problem. These results also underline 


why there are no practical algorithms based on 
“edge-type” theorems, as the results appear to 
be relevant only to a very special problem. 
Furthermore, even in the very special “affine 
parameter case” there are a combinatoric 
number of edges to check. 

13.5. Generalization of ft 
In this section we review an alternative 
formulation of fi due to Fan and Tits (1986) and 
use it to consider one of several possible 
generalizations of fi. The most impiortant 
motivation for this generalization comes from 
the model validation problem (see Smith and 
Doyle (1992) and Newlin and Smith (1991) for 
background). 

For simplicity, the Fan-Tits formulation is 
considered here for the full block only case 
(5 = 0). For any vector or matrix A with n rows 
let A, denote the rows of A corresponding to the 
ith block of A. Thus A, has m, rows. Also, let 
P := I„ be the identity matrix. An alternative 
expression for fi is 

/i = max {ar;or ||x,|| < ||A/,x|| Vi e {1, . . . , F)}. 

(13.5) 

To see that this is equivalent to (3.3) in 
DeAnition 3.1, note that when det (/ - Af A) = 0 
there is an x # 0 that satisAes (/ - AAf )x = 0. 
This X achieves the maximum in (13.5). 
Conversely, any x that achieves the maximum 
provides a way to constructing a A: set A, equal 
to the dyad that satisAes x, = A,M,x. 

An LMI formulation of the upper bound 
follows easily from (13.5). Again, we consider 
the full block only case. 

fi 

»3x#0:a^||x,||'=<|lA/,x||"Vr 
»3x #0:x*(Af;A/, - o-^F;f;)x >0 
»3x=)tO:x*(M*DM-a^D)x2:0 VD e S 

where S := (d = ^ d.P*P, :d,>0 Vi) = D. 

It follows that 

a>fi^3De3) :M*DM- a^D<0. (13.6) 

This is the same as the LMI in equation (3.11). 

The generalization of ft that we will study 
depends on a block structure as before along 
with an index that speciAes certain blocks as 
special or distinguished (Newlin and Smith, 
1991). As an example, consider equation (13.5) 
in the case of two full blocks: 

^ = max {or: a ||xi|| £ ||Af]X|| and 

X 

orlMsIlMjxll}. 
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Suppose the second block has been designated as 
special. Then the generalization is 

Ai = max{Qr:o'||jCi||s||A/ix|| and 


In this example, the designation of the second 
block as special means that the direction of the 
se9ond inequality is reversed and the scaling 
changed. 

The lower bound for this generalization of fi, 
though notationally awkward, is very similar to 
the standard lower bound, and a [jower 
algorithm is being investigated. There is no 
upper bound similar to the upper 

bound, but there is a generalization of the LMl 
above. Again consider our two block example. 


a^fi 


(; 


a^PtPy)x^0 


‘(MjA/z - a-^P^P2)x < 0 
<ii>3x=^0:x*iM*DM-P\(x)D)xsiO VD e £» 
where 3) := {diP*Pi + djP^Pi -di > 0; d2<0) 
and P(a) = (X^P'^P, + a-^P^P 2 . 


It follows that 


O' > ^ 4= BD e S?: M^DM - P{a)"D < 0. 

We see that D is just as in the case of standard 
LMI upper bound for ^ except that we require 
for some blocks that Di < 0 rather than D, > 0. It 
is expected that algorithms for computing 
positive definition solutions to LMIs will be 
easily generalized to solve this problem. 
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A non-scheduled H" robust flight controller has been designed for a 
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Abstnct —This paper presents an flight control system 
design case study for a supermaneuverable fighter flying the 
Herbst maneuver. Ihe Herbst maneuver presents an 
especially challenging flight control problem because of its 
large ranges of airspeed, angle of attack and angular rates. A 
fixed controller has been developed via the mixed- 
sensitivity problem formulation for 20 linearized models 
representing the maneuver. Both linear and nonlinear full 
model evaluations indicate that this single /f* controller 
together with a fixed LQR inner loop feedback have 
achieved "robust stability" and "robust performance" for the 
entire maneuver without gain scheduling. 

1. INTRODUCTION AND BACKGROUND 
Future fighter aircraft development will place 
increased performance requirements on the 
design of the flight control system. Maneuvering 
envelopes are expanding into flight regions 
characterized by significantly larger levels of 
modeling uncertainty than encountered in 
current flight control designs. This expansion of 
the flight envelope poses a challenging control 
problem that requires guaranteeing both robust 
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Stability and robust performance in the presence 
of large plant parameter variations, unmodeled 
dynamics, and nonlinearities. Traditional control 
design techniques that ignore the effects of these 
modeling uncertainties will likely produce 
designs with poor performance and robustness. 
Recent advances in modern control theory, most 
notably the //* synthesis technique (Safonov et 
ai, 1989; Chiang and Safonov (1988), and the 
references therein) offer the promise of a design 
technique that can produce both high- 
performance and robust uncertainty-tolerant 
controllers for next generation aircraft. 

To mature this synthesis methodology, the Air 
Force Wright Research and Development 
Center (WRDC) initiated the Robust Control 
Law Development for Modern Aerospace 
Vehicle program. In this four year program, 
models for both a hypervelocity and super¬ 
maneuverable vehicle were developed, perfor¬ 
mance specifications and uncertainty models for 
the two vehicles were defined (Haiges et al., 
1989, 1990). The control laws for both vehicles 
were documented in Haiges et al. (1991). 

The focus of this paper is to present the 
flight control design methodology developed for 
the supermaneuverable vehicle. For complete¬ 
ness, a description of the supermaneuverable 
vehicle and a summary of the performance 
specification and uncertainty model development 
effort is also provided. 

In phase one of the aforementioned four year 
Air Force WRDC program, the F/A-18 Hornet 
was selected as the basis for the super¬ 
maneuverable vehicle due to its agile perfor¬ 
mance and extensive high angle-of-attack aero- 
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dynamic database. A pair of gimballed engine 
nozzles, which allow in-flight thrust vectoring 
and thrust reversing were added to the model to 
enhance its supermaneuverability. This set of 
additional control effectors enables the vehicle to 
maneuver with extra axial force and moment 
control about its three axes (pitch, roll, yaw). 
Figure 1 shows the basic vehicle configuration. 

The Herbst Maneuver (Herbst, 1980) was then 
selected to demonstrate the fr° design technique 
because of its extreme dynamic behavior: large 
angles of attack, rapid rotational rate changes, 
large angular accelerations, and wide variations 
of dynamic pressure and speed. As described in 
Herbst (19^) the maneuver can be initiated 
from level fli^t at an altitude of 10,000 ft and a 
velocity of 300 ft sec"'. A pitch-up is com¬ 
manded until a maximum angle of attack of 80° 
is reached. This causes flight path angle and 
altitude to increase and causes velocity to 
decrease rapidly until the wing stalls. At a 
post-stall velocity of 50 ft sec"', and 85° bank 
angle command is initiated using engine thrust 
vectoring to rotate the flight path heading by 
180°. Once the aircraft begins to roll, the angle 
of attack is decreased and velocity is increased. 
At this point, the maneuver is complete. Figure 
2 shows the actual maneuver trajectory. 

In phase two, the flying quality specifications 
and uncertainty models were developed. A 
performance specification for the regulator 
design was also developed from the maximum 
allowable dynamics envelope described in 
Hodgkinson (1982). The performance design 
constraint is to achieve a feedback regulator with 
closed-loop singular-value frequency-response 
bounded by such an envelope. 

The sources of uncertainty in this maneuver 


were divided into four different types: 

Type 1. Uncertainty in the aerodynamic data 
base from wind tunnel test error, which includes 
Yp, L'p, U, Li,, Mo, N'p. K, K, L'sa, 
^SA. and NJjr (see McRuer etal. (1973) for 
definitions of standard aerodynamic derivatives). 
Type 2. Time delay at plant outputs (12.5 msec). 
Type 3. Neglected sensor/actuator dynamics (14 
actuators, five sensors). 

Type 4. Variations in structural mode frequency 
(±5.5%), damping (±31%), and shapes (±10 — 
15%) due to mass variations. 

Type 5. Variations in mass and inertia pro¬ 
perties (±10 — 24%). 

The robustness design objective is to fly the 
Herbst maneuver with robust stability in the 
presence of all these uncertainties. Together 
with the above performance specifications, the 
regulator design should provide “robust perfor¬ 
mance” throughout the maneuver envelope. 

In this paper, a novel multi-model design 
concept is presented. It is shown that with a 
robust equalization inner loop, an eight-state 
3x3 mixed-sensitivity H” controller (feeding 
back angle of attack a, sideslip angle j9, and 
bank angle rate to appropriate linear 

combinations of aerodynamic and engine-thrust- 
vectoring actuators) is capable of flying the 
Herbst maneuver without gain scheduling. This 
special “non-scheduling” capability provides 
immediate design advantages: 

• Reduced software/hardware implementa¬ 
tion effort. 

• Lower software/hardware maintenance 
requirements. 

• Guaranteed robust performance throughout 
the maneuver envelope. 
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The latter is possible because the time-variations 
in the controller due to gain-scheduling are 
eliminated and time variations in the model are 
greatly reduced by the robust equalization 
inner-loop. The net effect is that time-variation 
is reduced to the point that conditions of 
small-gain stability robustness and performance- 
robustness criteria can be made to hold for all 
time-variations in the plant using a linear-time- 
invariant /f” controller. 

Section 2 describes the general multi-model 
flight control design methodology. Section 3 
shows the results of applying this method to the 
design of Herbst maneuver flight control system. 
Conclusions are presented in Section 4. 

2. CONTROL LAW DESIGN APPROACH 

The objective of the flight control design is to 
find an robust controller capable of flying the 
Herbst maneuver with robust performance in the 
presence of the uncertainties described in 
Section 1. 


2.1. Control structure 

The overall structure of the flight control 
system uses an explicit model-following ap¬ 
proach inherited from the DMICS (Design 
Methods for Integrated Control Systems) con¬ 
cept presented in Shaw et al. (1988). Figure 3 
shows the block diagram, which contains the 
following functional elements: 

(1) Maneuver Command Generator (MCG)— 
M{s). 

(2) Configuration Management Generator— 
CMG. 

(3) Feedforward controller— K^is). 

(4) Feedback regulator— 

(5) Control selector—CS. 

The maneuver command generator (MCG) is 
the forward path model of the explicit 
model-following control system. Its output 
signals are used to form the error and 
feedforward terms for the model-following 
regulator. The MCG is designed to directly 
embody the desired flying quality characteristics 
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Fig. 3. General flight control block diagram. 


for each mode of flight. The primary inputs for 
the MCG are the pilot’s controller commands 
(e.g. longitudinal stick force). Its outputs are 
open-loop time-histories of various aircraft 
attitude angles and rates which, if perfectly 
tracked by the feedback controller K^is), 
result in an ideal maneuver. For the controller 
which we designed, the signals output by the 
MCG were commands for angle of attack (a), 
sideslip angle (fi), and bank angle rate {(int)- 

The functions of feedback regulator ^ 1 ( 5 ) are 
to: 

(1) precisely track the MCG commands; 

( 2 ) reject plant disturbances; 

(3) reduce sensitivity to plant parametric 
variations; 

(4) provide robust stability for the sytem. 

The flrst three tasks can be handled by 
minimizing the sensitivity function S = {I + 
G/C])-'. In flight control, the fulfillment of 
fourth requirement can be facilitated by a 
nonlinear control selector (CS) which accepts as 
its inputs control generalized controls (which are 
equivalent to angular accelerations in pitch, roll, 
and yaw) and generates appropriate commands 
to the physical control effectors on the aircraft 
depending of the flight condition (Shaw et al., 
1988). The use of generalized controls provides a 
decoupling between the design and implementa¬ 
tion of control law gains from the selection of 
specific effectors for producing the desired 
control accelerations. In addition, the regulator 
design is simplified by inclusion of the CMG 
(configuration management generator). The 
CMG is a nonlinear element which generates an 
approximation to the desired trim solution 
throughout the flight envelope, so the linear 
flight control system regulator functions as a 
perturbational device. 

The feedforward controller K 2 is a command 
response shaping filter which can be used to 
^peed up the aircraft response if the regulator is 
too sluggish. If the regulator has a faster control 
bandwidth than the MCG, K 2 can be dispensed 


with, without degrading the command response. 
In fact, the Bode sensitivity theorem indicates 
that most of the feedback issues such as 
disturbance rejection, plant parametric sen¬ 
sitivity reduction and so forth are not a function 
of K 2 {s), rather, they are only a function of the 
feedback regulator Ki{s). 

The control selector (CS) is used to distribute 
to the aircraft’s control effectors the generalized 
control moment commands {P, Q, R) — P, Q, R 
are standard notation for the rates of change of 
and aircraft’s roll, pitch and yaw angle, 
respectively (McRuer et al., 1973). The control 
selector attempts to neutralize the effects of 
changes in dynamic pressure or engine thrust so 
that the effective control forces and moments are 
constant throughout the maneuver. It is a 
nondynamical nonlinear device designed to 
compensate for some of the aircraft’s non- 
linearities so that the overall response of the 
aircraft angular rates P, Q, R to inputs to the 

control selector is very nearly -/^xs in the 

s 

mid-frequency range, below wing and fuselage 
bending mode frequencies and above the 
frequency of aerodynamic response modes— 
about 3-20 rad sec~' in our case. Therefore, its 
gains are a function of the vehicle’s operating 
state (e.g. Mach, altitude, thrust setting) and 
are scheduled throughout the maneuver. It also 
can provide control redistribution in the event of 
actuator saturation or failure (Shaw et al., 1988). 
In general, the control selector transformation is 
based upon available control power. 

The main issues in the control design— 
command issues and feedback issues—are 
addressed by the MCG, the feedforward 
controller, and the feedback regulator. It was 
proved in Pernebo (1981) that, for linear control 
systems, the command issues can be separated 
from feedback issues completely. In other 
words, the nominal command response can be 
“shaped” by a prefllter and feedforward (in the 
present case, also collectively known as the 
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MCG) both designed after the feedback 
controller loops. The particular choice of 
feedback in no way limits or changes the set of 
nominal command response transfer functions 
that are attainable via feedforwards and/or 
prefilters. This philosophy was executed thro¬ 
ughout the design process. In this paper, we 
focus primarily on the ''feedback issues*' of the 
problem, i.e. how to design a feedback regulator 
with good tracking pierformance and large 
stability robustness. 

2.2. Regulator design—performance and 
robustness 

Regulator design has two fundamental objec¬ 
tives, Robust Tracking Performance and Robust 
Stability. To achieve robust performance, one 
needs to define a proper feedback structure and 
a set of suitable performance specifications. To 
accomplish robust stability, one has to determine 
a set of linear models to be stabilized in the 
presence of uncertainties in pre-defined sets. The 
following sections elaborate these concepts in 
detail. 

Performance specification. As discussed in 
Section 2.1, the MCG is designed to embody 
desirable flying qualities properties for each 
mode or condition of flight. This is done by 
utilizing command filters patterned after the 
low-order equivalent system (LOES) transfer 
functions for aircraft response that are described 
in the military standards (MIL-STD-1797, 1987). 

In Hodgkinson (1982), the author defines 
frequency-domain limits on the maximum 
allowable dynamics that may be added to the 
ideal LOES transfer functions without degrading 
the flying qualities. This envelope of maximum 
allowable dynamics, N{s), is shown in Fig. 4. 
For the study described herein, N{s) has been 
interpreted as a performance specification as 
follows. 


Let M{s) be the desired Ideal” aircraft 
response, e.g. as described by the military 
standard LOES transfer functions 

y = AY{i)d.,„k, 

where y is the vectoi of output to be controlled 
and (5rt,ck is the pilot’s control stick position. By 
definition, when the maximum allowable un- 
noticeable dynamics are added to the system this 
relation becomes 

y = A/(s)Af(s)«5.„ek. 

Since the MCG is designed to provide the 
desired “ideal” flying qualities, the transfer 
function of the MCG may be substituted for 
M{s). 

From the block diagram in Fig. 3, it is found 
that the overall input to output relation for the 
system is given by: 

y = A/,(5)Af(j)d«,.ek. 

where 

Thus, it is seen that if Ni(s) can be maintained 
within the envelope of N{s), the added dynamics 
from the closed-loop tracking system will be 
unnoticeable to the pilot. Hence, the gain/phase 
envelope of N{s) becomes the “feedback 
performance specification” for the regulator 
design. 

As a final note, the N{s) specification 
described in Hodgkinson (1982) was developed 
for aircraft response in the pitch axis. However, 
due to a lack of available standards for other 
performance specifications, it was adopted as the 
specification for all of the regulator channels in 
our design. Proper bandwidth scaling between 
position and rate variables will be imposed at a 
later stage of the /f* design. 

In supermaneuvering flight, the rate of roll 
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of the aircraft body about its velocity vector 
Vt is important to be fed back for tracking 
performance—see Fig. 1. In particular, the 
so-called “rotational part’’ of this variable 
defined by 

/* cos (or)-♦-/? sin (or), (1) 

provides a useful measure of what a pilot would 
like to control during high angle-of-attack rolling 
such as in the Herbst maneuver. Basically, in 
such large angle maneuvers, neither the body 
jr-axis nor the velocity vector (wind x-axis) can 
be used as the roll-axis reference, they change 
directions themselves. Thus, to achieve a 
coordinated maneuver, controlling body roll-axis 
motion P has been replaced by control of the 
rotation about the total velocity vector. The 
variable described in (1) representing a 
combined motion of roll and yaw (with zero 
sideslip angle) serves this purpose satisfactorily. 
In other words, precisely controlling this variable 
enables the aircraft to perform a “coordin¬ 
ated” large angle roll maneuver with zero 
sideslip angle. Details of this concept can be 
found in Kalviste (1986). This “decoupled” 
response (roll with zero sideslip angle) is one of 
the fundamental requirements of tracking 
performance. 

Uncertainty models. As described in Section 1, 
five sources of uncertainty were identified for the 
current design study. The aerodynamic uncer¬ 
tainties were based upon the differences between 
two aerodynamic data bases—wind tunnel and 
flight test data. This formed the structured 
uncertainty models for the aircraft stability and 
control derivatives. For the purposes of 
controller design, the types 2-5 uncertainties 
were treated as unstructured. However, in the 
subsequent linear evaluation of our design, it 
proved convenient to exploit the fact that in 
actuality the type 4 and 5 uncertainty in our 
problem turns out to be dominated by a known 
nonlinear function of a single real parameter, 
viz. the total weight of the aircraft (which varies 
as fuel is consumed). 

Type 2 uncertainty was treated as unstructured 
multiplicative uncertainty at the plant outputs. 
The neglected structural modes and sensor 
dynamics were considered as frequency bounded 
multiplicative uncertainty at the plant outputs. 

Linear model development, "rwenty 56-state 
linear-time-invariant models were generated by 
linearizing a faithful high-order nonlinear simu¬ 
lation model at various points along the nominal 
Herbst-type maneuver trajectory described in 
Section 1. As shown in Fig. 2, position A is the 
starting point. Position B is the design point due 
to its “central position with respect to all the 


other linear models (this will be elaborated in 
Section 2.3). Position C is the turning point, 
which is also considered to be the “worst case” 
due to its rapidly changing aerodynamics. At 
position D, the maneuver is complete. All the 
aerodynamic data were based on a full F/A-18 
database that was integrated into the nonlinear 
simulation model. The load condition of the 
aircraft was characterized as carrier landing gross 
weight with empty fuel tank. 

The 56 states in the linear models from 
position A to D represent actuators, sensors, 
structural modes, engine states, and the standard 
six degrees-of-freedom flight dynamics. The 
available control actuators are stabilities, ailer¬ 
ons, rudders, pitch and yaw thrust vectoring. 
Throughout much of the maneuver, the engine 
thrust vectoring actuators provide most of the 
control energy, since other regular control 
surfaces are not as effective at the relatively low 
airspeeds which are characteristic of the 
post-stall Herbst-type maneuver. The distribu¬ 
tion of control energy to the available control 
actuators has been carefully designed via the 
control selector described in Section 2.1. 

The standard offset derivative technique was 
used to linearize the aircraft model about each 
particular operating point. Since the operating 
point continuously changes throughout the 
maneuver, one obtains a different linearized 
model for each time. Thus, although each such 
model is a linear-time-in variant model, the 
model varies with time. Furthermore, the rate of 
time variation is only slightly slower than our 
design bandwidth. In other words, the models 
might not be adequate to predict the stability 
and performance at each operating point along 
the nonlinear maneuver trajectory. Fortunately, 
after the robust equalization inner-loop feedback 
is closed, the amount of variation between 
models becomes sufficiently small that the 
small-gain stability theorem can be used to 
guarantee both stability and performance rob¬ 
ustness throughout the maneuver. 

Model reduction is always needed to get a 
suitable set of states to be stabilized by the 
regulator. This will be elaborated in Section 3 
for the Herbst design. 

2.3. Multi-model design concept 

After the linear models and design require¬ 
ments are developed, the actual design process 
can be shown in Table 1. 

In this section. Steps 2 and 3 are discussed in 
detail. Results for a continuous controller that 
demonstrates only the major design concept are 
presented in this paper. The digitization effects 
involved in Step 4 will be examined later. Model 
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Table 1. Design flow chart 



reduction for this design is presented in Section 
3. Further discussion on the model reduction and 
its robustness issues can be found in Safonov and 
Chiang (1988b). 

Robust equalization loop design. This particu¬ 
lar step plays a crucial role in the entire design 
process. If the //” design procedure had been 
directly applied to various linear models along 
the trajectory individually, each design could 
have only assured robust performance for one 
single linear model. Moving one controller to 
another operating point could not be guaranteed 
to produce a stable response. This is because 
other models along the trajectory are never 
considered in the single point design process. 
Implementation in the traditional fashion would 
require massive gain scheduling of the controller 
throughout the maneuver. Neither the system 
robustness could be guaranteed after the gain 
scheduling, nor would a simple fixed linear-time- 
invariant controller implementation be possible. 

A new multi-model design concept that 
employs an extra inner loop can remove the 
disadvantages and inadequacies of the gain- 
scheduled “single” point design approach. It has 
long been known that feedback can reduce the 
sensitivity to plant uncertainty. Zames (1981) 
characterized this property in terms of shrinking 
an H'‘ ball of uncertainty with feedback. With 
this concept in mind, the role of inner loop 
becomes obvious: It can “shrink” the “distance” 
between different models, thereby reducing the 
modeling uncertainty between them. 

Our design process began with analyzing the 
“distance” between linear time-invariant models 
of the plant G(r) obtained by linearizing at 


several different along the nominal Herbst 
maneuver trajectory. Recall that the control 
selector has been designed so that the mid¬ 
frequency range (3 rad sec~‘< <u <20rad sec"*) 
response to P, Q, R is approximately 
With the control selector in place, this same 
mid-frequency response in the P, Q, R channels 
occurs irrespective of the aircraft attitude and 
velocity throughout the entire maneuver. Of 
course, at lower frequencies the responses may 
vary substantially. However, the fact that the 
mid-frequency responses are all about the same 
greatly eases the robust control design task. It 
means that a fixed pure-gain inner-loop control¬ 
ler having loop singular value Bode plot 
magnitude crossover frequencies in this mid¬ 
frequency range has the potential to stabilize all 
the plants, in addition to drawing their 
low-frequency-range frequency responses more 
closely together, shrinking the distance between 
them so that they are all fit inside a smaller //°° 
ball. Because we have that all the plant vari¬ 
ations have about the same 1/s mid-frequency- 
range behavior and because the low-frequency 
differences though drastic, are not so extreme as 
to alter the phase by more than about 90° in any 
channel, a pure gain or simple LQR inner-loop 
feedback with crossover in the 3-20 rad sec'* 
range can be expected to do this robustly. We 
decided to use an LQR inner loop feeding back 
the five states a, P, P, Q, R of the nominal 
reduced-order linearized plant G. The LQR 
penalty function used was Jo (jt'jc- t-pu'u) dt 
(with the scalar p > 0 chosen using the LQR 
return-difference equality as a guide) to produce 
an inner-loop crossover frequency of about 
10 rad sec"'. Examining the response to the cr, 
P, channels, we found, not unsurprisingly, 
that the 10 rad sec"* inner-loop also shrinks the 
distance between these models. 

To measure the distance between two models, 
say G and Go, we used the spectral radius of the 
multiplicative error 

|^n.«(A(y(W)|, 

where A = (G - Go)G,7'. However, if the two 
plants have different numbers of right-half plane 
poles the distance between them is defined to be 
infinite. 

The following inequality gives some indication 
why the multiplicative error, instead of the 
absolute error |Am,x(C' “ Go)|, is cho.sen for the 
inner loop design: 

|A„„[A1| = |W(G-Go)Go"']| 

^|A,„JG-Go]||A„„[Go*]1. 

Clearly, the multiplicative error can be large 
even when the absolute error is small if 
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Amu[(7o *] is large, as is usually the case at high 
frequencies. 

Why is the multiplicative error important in 
multi-model design? The reason 'is that when the 
multiplicative error between models is small, it 
guarantees that the logarithmic Bode plot of a 
perturbed model stays inside an error band along 
the nominal model, which from the control 
design point of view is a way of assuring robust 
performance for both models with single 
feedback control law. In papers by Safonov and 
Chiang (1988a) and Safonov et al. (1990) the role 
and use of the multiplicative error was 
elaborated in detail. In particular, a “Robustness 
Theorem” presented in these papers provided a 
sufficient condition for several models to be 
stabilized by a single feedback. (Another 
approach, involving the “Gap Metric”, which is 
mathematically much more subtle but accom¬ 
modates uncertain right-half plane poles, might 
also have been used to measure the distance 
between two models (Georgiou and Smith, 
1990). However, as all our plant models turned 
out to be stable, gap metric analysis was 
unnecessary.) 

High performance H* loop design. The 
standard mixed-sensitivity weighting strategy 
(Safonov and Chiang, 1988a) was used in our 
problem formulation 


min 


Wi5 


1 . 


( 1 ) 


where the low-frequency loop transfer function 
(L(j) = GF) is shaped by penalizing the 
sensitivity function S = {I + GF)~' to reject the 
plant disturbances and noises and the high- 
frequency L{s) is shaped by penalizing the 
complementary sensitivity T = GF{I + GF)~' to 
cope with uncertain actuator nonlinearities, 
unmodeled structural modes and sensor/actuator 
dynamics. Figure 5 shows the plant augmenta¬ 
tion strategy associated with this approach. The 
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numerically robust loop-shifting formulae de¬ 
veloped in Safonov et al. (1989) were used to 
find the optimal H“ controller. 

A salient feature of the mixed sensitivity 
approach is that it treats the uncertainty as 
unstructured. The frequency dependent weights 
W,(j) and ^^(r) are used by the control designer 
to specify the desired loop-shapes. The “all¬ 
pass” property of optimal control laws 
ensures that the singular value Bode plots will 
precisely conform to those specified by Wi(f) and 
W 3 (.r) (see Safonov and Chiang (1988a) for 
details). However, there is a serous drawback to 
this property, namely, the resulting mixed- 
sensitivity controller always blindly cancels 
the LHP poles of the plant, which can seriously 
damage the system robustness against structured 
(parametric) uncertainties associated with uncer¬ 
tain lightly-damped pole locations, even though 
the overall loop transfer function is shaped 
exactly to the specifications (weighting functions 
WJ"' and IV^'). As for the reason why there 
exists such an unpleasant phenomenon, Chiang 
et al. (1990) examined the SISO case and Sefton 
and Glover (1990) examined the MIMO case. 

The robust equalization loop removed this 
difficulty completely. It provided large multivari¬ 
able stability margin against structured para¬ 
metric uncertainty of the plant by damping the 
uncertain lightly-damped “dutch-roll” modes 
with inner loops. This automatically resolves the 
problem of lightly-damped pole/zero cancella¬ 
tion associated with the mixed-sensitivity 
problem formulation. The result is shown in 
Section 3. This inner-loop strategy for increasing 
the plant parametric robustness was first 
proposed in Chiang et al. (1990). Compared to 
other approaches such as ju-synthesis (Doyle et 
al., 1987) it is our opinion that this approach 
offered a simpler and more direct solution in this 
application. 

Exact “feedback decoupling” is a non-trivial 
design task (Falb and Wolovich, 1967; Isidori et 
al., 1981). However, via the mixed-sensitivity 
H“ design method described above, feedback 
decoupling becomes an extremely simple task. 
Inside the control bandwidth |lVi(/(u)|»l and 
the constraint on the cost (1) imposed by VVi(.r) 
ensures that the sensitivity function {S = {1 + 
GF)~') will be smaller than |/lWi|«l, thus 
ensuring that the closed-loop transfer function 
T = I — S will be approximately equal to the 
identity matrix (/). Clearly, / is diagonal and 
therefore decoupled. 

3 HERBST MANEUVER DESIGN 

An H“ controller has the same order as the 
plant, so it is desirable either to approximate the 
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//“" controller by a reduced order controller as 
final design step or to use a reduced order plant 
model so that the resultant controller will be 
of correspondingly reduced order. We chose the 
latter approach. 

The 56-state full order model was reduced at 
design stage by applying the following simplifica¬ 
tions, 

(1) Sensors and structural modes are fast as 
compared to design bandwidth (lOrad sec“'), 
and may therefore be neglected. 

(2) Actuators dynamics and nonlinear effects 
can be neglected in the plant models and 
accounted for in the uncertainty specification. 

(3) The engine states are decoupled from the 
other dynamics in this maneuver, and may be 
removed. 

(4) The trajectory related states such as Euler 
angles (0, 0, 0), altitude (h), and total velocity 
(V|„i) naturally diverge during maneuvering flight 
and therefore should not be stabilized by 
feedback. Instead one should stabilize the 
rotational states: angle of attack (or), sideslip 
()8), and angular rates (P, Q, R) so that the 
pilot/MCG may guide the aircraft through the 
desired maneuver by commanding those trajec¬ 
tory states. 

A singular perturbation model reduction routine 
was used to remove the unwanted states, leaving 
the linear model with five states (or, /J, P, Q, R) 
to be stabilized by the regulator. 

Further simplification of the linear models was 
permitted due to the use of the decentralized 
DMICS control structure described in Section 
2.1. For the regulator design, the control 
selector is assumed to be “perfect”. That is, the 
demanded control forces (i.e. the generalized 
controls) are exactly achieved by the aircraft’s 
aerodynamic and propulsive control effectors. 
With this assumption, the rows of the “B 
matrix” of the linear model corresponding to the 
P, Q, R state derivatives may be replaced by a 
generalized control effectiveness matrix (a rank 
three standard basis matrix), which includes 


three inputs and allows direct command of P, 
and R, 

Since all the states were available in our 
model, a fixed LQR inner loop gain with unity 
“state” and “control” weighting was then 
developed to the design model (position B) and 
the worst case model (position C) of the 
maneuver. Using “unity” weighting keeps the 
control energy normal and leaves the system 
bandwidth unchanged. If the unity LOR weights 
are not enough to minimize the multiplicative 
error between models, a heavier weighting 
strategy can be employed until the goal— 
multiplicative error is less than one, is achieved. 

Figure 6 shows that the multiplicative error 
between the five-state “design model” (at 
maneuver point B) and the 51-state “evaluation 
models” (at maneuver points A, C and D 
without five trajectory states) has been reduced 
to less than unity throughout the control 
bandwidth 0-10 rad sec“^ after the unity weight¬ 
ing LOR inner loop is closed. In other words, all 
the linear models have been approximately 
equalized by a single LOR feedback. Moreover, 
this fixed LOR gain also stabilized all the linear 
models (either five-state or full 56-state) along 
the maneuver from point A to D (see Fig. 7). 
The robustness theorem of Safonov and Chiang 
(1988b) implies, roughly speaking, that a 
controller will produce similar performance for 
all plants whose deviation from the nominal has 
a multiplicative error of less than about -3 dB 
inside the control loop bandwidth- We see from 
Fig. 6 that, with our robust equalization 
inner-loop in place, this conditions holds for 
controllers having bandwidth up to about 
10 rad sec ^ Consequently, it follows from the 
robustness theorem that a fixed /P" controller 
should be able to accomplish the outer loop 
design without gain scheduling. 

It is well known that linear quadratic regulator 
feedback is highly robust with guaranteed 
(0.5, co) gain margin and ±60 degree phase 
margin in every feedback channel (Safonov and 
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Athans, 1977). In our case, a fixed LQR gain has 
created an extra benefit for the outer loop 
design, namely, it makes the outer loop H“ 
design much more robust against plant para- 
meteric uncertainty. 

The //“ weighting matrix W, was chosen as 
W 3 = diag (100/J^ 100/^^ 10/i)~’, which roughly 
coincides for tu > 10 with the lower bound from 
the specification constraint on maximum un- 
noticeable dynamics shown in Fig. 4. Although a 
higher bandwidth also meets the constraint, the 
aircraft control design does have high-frequency 
constraints at and beyond 10 radsec~' (due to 
actuator rate limits, unmodeled structural 
modes, etc.) that are addressed by the 
high-frequency roll-off imposed by W 3 . In other 
words, the choice of limits the //" design 
bandwidth to about 10 radsec~' so that the 
high-frequency unstructured uncertainties cannot 
drive the system into instability (see Fig. 6 ). The 
different roll-off rates were used to ensure that 
the positions (or, ^) and rate (/Xr„,) channels are 
penalized equally. Without such consideration, 
high-frequency lead would enter the control 
dynamics, amplifying sensor noise and causing 
undue actuator saturation. 

The low-frequency penalty W, was chosen to 
be 


W, = p 


/ j + 0.01 

\0.02s -I-1 



where the DC gain p is used as a design “knob" 
to push the sensitivity function 5 down as much 
as possible, until the complementary sensitivity 
T is pushed against Wf'. Since the H’“ controller 


will contain the dynamics of Wj"', the zero 
location ( 0 . 01 ) was chosen for the low-frequency 
range disturbance rejection, the p>ole location 
( 100 ) was chosen to ensure a well-posed 
state-space solution of the H“‘ mixed-sensitivity 
problem (Chiang and Safonov, 1988). As a 
consequence of minimizing the sensitivity func¬ 
tion, commands will be tracked well and wind 
gust disturbances can be substantially 
attenuated. 

Some output scaling between position vari¬ 
ables and rate variables is necessary in 
multivariable control system design. All of the 
rate variables (P, Q, R, /i„„) are scaled down by 
a factor of ten. so that ten degrees-of-position 
deflection are comparable to one deg sec~’ rate 
increment (since the design bandwidth is 
approximately lOradsec”'). 


3.1. Design results 

Linear evaluation. In this section, the results 
of nominal design (five-state at position B) and 
the results of connecting the same controller 
with the evaluation models (Sl-state at positions 
A, C, D) are shown. 

Figure 8 illustrates the “perfect” loop shaping 
S/T functions and step response for the design 
model (five-state at point B). Figure 9 shows the 
loop shaping for different operating points. 
Notice that the shapes of the sensitivity functions 
indicate that the wind gust disturbance at 
frequencies below Iradsec'^ will be attenuated 
by a factor of 1000 throughout the maneuver. 
Figure 10 shows the step responses. Clearly, the 
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Fig 8 Results of design for five-state nominal model at position B 


tracking loops (a, completely de¬ 

coupled at design point (position B) and are 
coupled around the worst case operating point 
(position C). 

With the robust equalization inner loop, it is 
not surprising to see that at a particular 
operating point, the design can tolerate a lot of 
structured parametric uncertainty, yet still can 
maintain the outer loop tracking and reject the 
high-frequency unstructured uncertainties. 

Numerically reliable algorithms (Tekawy et 
ai, 1989) were used to compute the Structured 
Singular Value (SSV) of the multivariable 
stability margin seen by the aerodynamic 
uncertainties. The 51-state evaluation models 
were used here as well. Figure 11 shows the 
complete interconnection of robustness test. The 
dashed lines in Fig. 12 show that in such a high 
angle-of-attack and low dynamic pressure man¬ 
euver, the system with a “robust equalization 
inner-loop” has “robust stability” against all the 
anticipated uncertainties ot type 1, 2 and 3. 
Uncertainty types 4 and 5 turn out to depend 
almost entirely on a single parameter, viz. the 
aircraft total weight (as determined by its current 
fuel load). Accordingly, to evaluate the effects of 
including these additional uncertainties, it 
sufficies to examine how the SSV plots for 
uncertainty types 1-3 vary as aircraft weight 
varies. We did this for several possible values of 
aircraft weight and found that the SSV plots 
varied little and remain below 0 dB as the weight 
varied between its minimum and maximum 
values as shown in Fig. 13. Thus robust stability 


IS guaranteed for all of the uncertainty types 
1-5. 

The set-up for “robust performance” evalua¬ 
tion is shown in Fig. 11, where the weighting 
filter Wy is cascaded with an extra fictitious 
uncertainty block (Ape^r) and evaluated simul¬ 
taneously with the rest of uncertainties (A^.b, 
Aaci> Asbn) (Doyle et aL, 1982). The solid lines 
in Fig. 12 are the results of robust performance 
evaluation for 51-state models at position A, B, 
and C of the maneuver with the aircraft at its 
nominal weight—similar plots result for other 
values of total aircraft weight between its 
minimum and maximum (type 4 and 5 
uncertainty). This implies that this design can 
track commands accurately in the presence of all 
the anticipated uncertainties (types 1-5). 

(Plant and uncertainty data are available from 
the authors. Interested parties may recieve the 
data by sending a blank 3.5 in 1.44 MB IBM 
floppy and a self-addressed stamped envelope.) 

Nonlinear evaluation. After the linear evalua¬ 
tion of the design, it is necessary to check the 
robust performance from a nonlinear perspec¬ 
tive. The main objective at this stage of the 
design was to evaluate the robust performance of 
the //“ design in the presence of the nonlinear 
aerodynamics of the Herbst maneuver. Other 
issues including actuator rate limits, man¬ 
euverability and kinematic coupling effects can 
also be explored. 

The eight-state optimal /f* controller was 
implemented in our complete nonlinear Simula- 
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Fig II Block diagram for robustness evaluation 


tion program with the standard six degrees-of- 
freedom equations of motion, nonlinear ac¬ 
tuators, sensors and structural modes. The MCG 
and CMG were also implemented to generate 
the proper command profile of the Herbst 
maneuver. 

In Fig. 14, an angle-of-attack command and a 
roll pulse command were generated from the 
MCG. The engine level was set to full-throttle to 
build up the proper required angle of attack for 
this nameuver. Figure 15 shows the associated 



angular rates of the airplane. As shown in Fig. 
16, once the angle of attack reached its 
maximum (78°) and the total velocity dropped 
down to its minimum (55 ft sec"'), a roll pulse 
command (^rnt) was initiated to rotate and turn 
the aircraft heading 180°. Figure 17 shows the 
responses of the control surfaces—horizontal 
tails (($;/), leading/trailing edge flaps (df., 
and thrust vectoring (6rv) They are all well 
within saturation limits as expected. 

As the linear analysis predicts, the following 
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Fig. 12. SSV of total robust stability (□), individual robust stability (dashed), and total robust performance 
(A) against uncertainty types 1-3 (aerodynamic, actuators and sensors). 
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Fig. 15. Full model nonlinear simulation—altitude {h), total velocity heading angle (y) 


facts are observed from the nonlinear time 
history: 

• The aircraft is stabilized by the fixed 
eight-state 3x3//“ controller and the fixed 
LQR pure gain inner-loop controller thro¬ 


ughout the maneuver without gain 
scheduling. 

• The aircraft tracks the angle of attack (or) 
and roll commands (/iroi) closely. 

• Only two degrees peak sideslip angle (/3) 
occur after roll command (^rm) is initiated. 


PBDG: PGM(—) vb lime QBDG: QGM(—) vs time 



Sec 

Pio. 16. Full model nonlinear simulation—angular rates {P, Q, R). 
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FiO. 17. Full model nonlinear simulation—horizontal tails (6„), leading edge Haps (d/v), trailing edge 

(d/r)i thrust vectoring 


4. CONCLUSIONS 

This paper presents a new approach involving 
the use of theory for the design of a robust 
controller for a supermaneuverable fighter 
performing a Herbst-type maneuver. The fighter 
model is a modified F/A-18 Hornet with a pair 
of gimballed thrust vectoring nozzles. With the 
supermaneuverability provided by the thrust 
vectoring, the challenging Herbst-type man> 
euver, was selected. The uncertainty model and 
flying qualities specifications for this maneuver 
were identified and cast into the design process. 

Both linear robustness analysis and full 
dynamics nonlinear simulation have shown the 
combined power of //" and the robust 
equalization inner-loop. The use of the inner 
loop as an equalization loop for multi-model 
mixed sensitivity //" control synthesis is an 
essentially new idea. This equalization property 
was the key to being able to design a fixed (i.e, 
not gain-scheduled) controller for which robust 
performance can be guaranteed throughout the 
wide range of plant time-variations associated 
with the Herbst maneuver. The role of 
inner-loop feedback in equalizing plants subject 
to large variations is in addition to, and should 
not be confused with, its important and better 
known role (Safonov et aL, 1990) in damping 
plant poles so as to avoid the robustness 
problems that result from the inevitable 
cancellation stable plant poles by zeros of H’" 
mixed-sensitivity controllers (see, for example, 
Sefton and Glover, 1990). 

By combining mixed-sensitivity //“ control 
synthesis with the robust equalization inner- 


loop, we have been able to achieve a robust 
flight control design that not only can stabilize 
the aircraft throughout the entire maneuver in 
the face of anticipated structured and unstruc¬ 
tured uncertainties and time-variations, but also 
possesses approximately decoupled tracking 
performance throughout the maneuver en¬ 
velope. Most importantly, the new multi-model 
design concept presented here allowed the 
eight-state 3x3 controller to be imple¬ 
mented without gain scheduling. 
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Robust Integrated Flight/Propulsion Control 
Design for a STOVL Aircraft Using /f-infinity 
Control Design Techniques *t 

SANJAY GARG^ 

Proper formulation of the optimal control synthesis problem is shown to 
lead to centralized controllers for integrated flight/propulsion systems that 
provide robustness to plant parameter variations and modelling 
uncertainties. 


Key WoiA —Robust control; robustness analysis; control system synthesis; centralized control; flight 
control; propulsion control; integrated plant control; control applications 


Abstract —Results are presented from an application of 
control design methodology to a centralized integrated 
flight/propulsion control (IFPC) system design for a 
supersonic Short Take-Off^ and Vertical Landing (vSTOVL) 
hghter aircraft in transition flight. The emphasis is on 
formulating the //„ optimal control synthesis problem such 
that the critical requirements for the flight and propulsion 
systems are adequately reflected within the linear, central¬ 
ized control problem formulation and the resulting controller 
provides robustness to modelling uncertainties and model 
parameter vpjiations with flight condition. Experience gained 
from a preliminary based IFPC design study performed 
earlier is used as the basis to formulate the robu.st control 
design problem and improve upon the previous design. 
Detailed evaluation results are presented for a reduced order 
controller obtained from the improved control design 
showing that the control design meets the specified nominal 
performance objective as well as provides stability robustness 
for variations in plant system dynamics with changes in 
aircraft trim speed within the transition flight envelope. 

INTRODUCTION 

The trend in future military fighter/tactical 
aircraft design is towards aircraft with 
new/enhanced maneuver capabilities such as 
Short Take-Off and Vertical Landing (STOVL) 
and high angle of attack performance. An 
integrated flight/propulsion control (IFPC) 
system is required in order to obtain these 
enhanced capabilities with reasonable pilot 
workload. An integrated approach to control 
design is then necessary to achieve an effective 

• Received 1 October 1991; received in final form 2 June 
1992. This paper was not presented at any IFAC meeting. 
This paper was recommended foe publication in revised form 
by Guest Editor P. Dorato. 

fThe original version of this paper was presented at the 
1991 American Control Conference which was held in 
Boston, MA, U.S.A. 

t Advanced Control Technology Branch, NASA Lewis 
Research Center, Cleveland, OH, U.S.A. 


IFPC system. Such a design approach is 
currently being developed at NASA Lewis 
Research Center under an in-house research 
effort. This methodology is referred to as 
IMPAC—Integrated Methodology for Propul¬ 
sion and Airframe Control (Garg et al., 1991). 
The significant features of the IMPAC methodo¬ 
logy are that it consists of first designing a 
centralized controller considering the airframe 
and propulsion systems as one integrated system 
and then partitioning the centralized controller 
into decentralized subsystem controllers for 
state-of-the-art IFPC implementation. 

The major issue related to the centralized 
controller design portion of IMPAC is the choice 
of the control synthesis technique that “best” 
suits the IFPC objectives. Not only should the 
synthesis technique provide for means of 
formulating the design criteria for the centralized 
control design such that it adequately reflects the 
performance specifications of the “total” system, 
i.e. the airframe integrated with the propulsion 
system, but is should also result in controllers of 
reasonable complexity with guaranteed perfor¬ 
mance and robustness characteristics. Recent 
advances in control theory (Doyle et al., 
1989; Safonov et ai, 1989) and computational 
algorithms to solve for optimal control laws 
(Anon., 1989; Chiang and Safonov, 1988) have 
made this theory a viable candidate to be applied 
to complex multivariable control design prob¬ 
lems. A preliminary investigation of the 
applicability of //„ control theory to the 
centralized feedback controller design portion of 
the IMPAC approach was conducted earlier via 
an example IFPC design study (Garg et al., 


129 



130 


S. Garc 


1990). The results reported in Garg ei al. (1990) 
are quite encouraging in that they demonstrate 
that control theory has the promise to 

provide the framework to meet the requirements 
of a centralized IFPC design. However, the 
control design of Garg et al. (1990) was a 
preliminary design and detailed evaluation of 
that design identified various areas in which the 
controller performance needed to be improved. 
The objectives of this paper are to develop 
insight into formulating a robust control problem 
with the control design framework and to 
improve upon the control design of Garg et al. 
(1990). Towards this goal results are presented 
from an //. based IFPC redesign for the linear 
model of a STOVL aircraft in transition flight 
considered earlier in Garg et al. (1990). 

The paper is organized as follows. The vehicle 
models to be used for control design and 
evaluation are first discussed. The control 
design is then presented along with some 
discussion of the formulation of the IFPC design 
objectives within the framework of the 
control problem. The emphasis is on formulating 
the problem such that the resulting controller is 
robust to modelling uncertainties and parameter 
variations with changes in flight condition. 
Evaluation results are presented for a reduced 
order approximation of the improved 
controller and improvements over the controller 
design of Garg et al. (1990) are demonstrated. 


VEHICLE MODEL 

The vehicle considered in this study is 
representative of the delta winged E-7D 
supersonic STOVL airframe powered by an 
enhanced version of a high performance military 
turbofan engine (Akhter et al., 1989). The 
aircraft is equipped with the following controls; 
ejectors to provide propulsive lift at low speeds 
and hover; a 2D-CD (Two-Dimensional, 
Convergent-Divergent) vectoring aft nozzle with 
afterburner for supersonic flight; a vectoring 
ventral nozzle for pitch control and lift 
augmentation during transition; and jet reaction 
control systems (RCS) for pitch, roll and yaw 
control during transition and hover. A schematic 
diagram of the aircraft with relative location of 
the various control effectors mentioned above is 
shown in Fig. 1. Engine compressor bleed flow is 
used for the RCS thrusters and the mixed engine 
flow is used as the primary ejector flow. Detailed 
ducting diagrams of the engine and discussion of 
the ejector STOVL concept are available in 
Akhter et al. (1989). 

Two separate computer simulations, one for 
the aircraft (six-degree-of-freedom aerodynamic 
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Fig. 1. Control effectors for E-7D aircraft. 
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model with steady-state engine performance 
model) and one for the propulsion system 
(aero-thermo dynamic model) were used to 
assess performance capabilities of the aircraft 
and to generate open-loop linear models for 
control design (Akhter et ai, 1989). The 
procedure for generating integrated airframe and 
propulsion models for control design and 
evaluation from the two separate simulations is 
discussed in Garg et al. (1990). The integrated 
linear design model used in this study is of the 
form 


x = AX-l-Bu; y = Cx + Du, (1) 
where the state vector is 
x = 

[«, V, w, p, q, r, 0, e, N2, N25, T41, T3, P6]^ 

( 2 ) 

with 

u = Axial velocity (ft s"’), 

V = Lateral velocity (ft s“’), 
w = Vertical velocity (ft s“‘), 
p = Roll rate (rad s~‘), 
q = Pitch rate (rad s“'), 
r= Yaw rate (rads“*), 

0 = Roll attitude (rad), 

6 = Pitch attitude (rad), 

N2 = Engine fan speed (rpm), 

N25 = High pressure compressor speed (rpm), 
T41 = High pressure turbine inlet temp. (°R), 
T3 = Combustor inlet temp. (°R), 

P6 = Tailpii)e entrance total pressure (psi), 

the control inputs are 

u = [fie, fia, fir, AQR, AYR, ARR, WF, A8, 

ETA,A78,ANG79, ANG8], (3) 

with 

fie = Elevator deflection (deg) 
fia = Aileron deflection (deg) 
fir = Rudder deflection (deg) 

AQR = Pitch RCS area (in^) 

AYR = Yaw RCS area (in^) 
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ARR = Roll RCS area (in^) 

WF = Fuel flow rate (Ibm hr~‘) 

A8 = Aft nozzle throat area (in^) 

ETA = Ejector butterfly angle (deg) 

A78 = Ventral nozzle area (in*) 

ANG79 = Ventral nozzle vectoring angle (deg) 
ANG8 = Aft nozzle vectoring angle (deg) 

and the outputs are 

y = fF,K, y, (4) 

with 

V = True airspeed (ft s“') 

V = Acceleration along flight path (ft s~^) 
y = Longitudinal flight path angle (deg) 

P = Sideslip angle (deg) 

$ = Rate of change of sideslip angle (deg s~ ^). 

The other outputs are as discussed under state 
description except that the angular positions and 
rates are in degrees. The integrated plant 
systems matrices, A, B, C, and D, are listed in 
the Appendix. 

Some discussion of the choice of control inputs 
u for the linear design model is relevant here. 
The choice of outputs y is discussed later in the 
paper. The E-7D aircraft is equipped with left 
and right clevons on the trailing edge of the 
delta wing. Collective deflection of the elevons 
provides the classical elevator pitch control while 
differential use of the elevons provides the 
aileron roll control. So the elevator {6c) and 
aileron (<!)a) along with the rudder (<^r) are used 
as the airframe control inputs in the design 
model. Only three RCS areas, AQR, AYR and 
ARR, are used as RCS control inputs in the 
linear design model whereas the full nonlinear- 
model has five controlled RCS areas. The 
reasons for this are that the nose pitch RCS and 
the two yaw RCS thrusters provide thrust in only 
one direction as shown in Fig. 1, and the wing 
tip roll RCS thrusters arc to be used 
differentially for roll control and collectively for 
pitch control. For instance yaw RCS thrusters 
provide only forward thrust, so left yaw RCS is 
used for right yaw and right yaw RCS is used for 
left yaw in the nonlinear model. Using both left 
and right yaw RCS areas as control inputs in the 
design model can result in a control design that 
uses the two areas differentially to enhance yaw 
control which is inconsistent with the actual 
implementation. Similar reasons apply for pitch 
and roll RCS area selections. An RCS 
distribution logic that will distribute the three 
design model RCS commanded areas to the five 
actual areas in the nonlinear control implemen¬ 
tation is shown in Fig. 2. Since the nose pitch 
RCS thruster only provides positive (pitch up) 
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RCS area 


Left roll 
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pitching moment, a negative AQR command is 
distributed among the left and right wing tip 
RCS thrusters to generate the commanded 
negative pitching moment taking into account 
the relative pitch control effectiveness of the 
nose and wing tip RCS thrusters. 

Figure 2 is presented here to clarify the 
discussion on choice of RCS areas for linear 
design model although the logic of Fig. 2 is not 
used to evaluate the control designs presented in 
this paper. Note that there is an absolute 
nonlinearity in the relationship from commanded 
RCS areas to compressor bleed flow demand in 
that although the RCS area may be positive or 
negative depending on the desired direction of 
RCS thrust, the compressor bleed flow demand 
(WB3) to generate the thrust is always positive. 
For a linear model trimmed about zero RCS 
areas, this relationship is of the form WB3i = 
Ki \AiR\ where WB3i is the demanded bleed 
flow due to AiR command with i representing Q, 
Y or R for pitch, yaw or roll RCS, respectively, 
K, is an appropriate constant and \-\ represents 
absolute value. This nonlinearity was not taken 
into account in the linear control design of Garg 
et al. (1990) and the resulting controller was 
shown to lead to unacceptable performance 
when evaluated with this nonlinearity. In this 
paper, the critical absolute nonlinearity from 
RCS area commands to WB3 demand is 
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accounted for in the linear control design process 
by a proper formulation of the control design 
problem within the framework. The details of 
this formulation are discussed later in the paper. 

The control inputs, WF, A 8 , ETA, A78, 
ANG79 and ANG 8 in the vector n, are the 
propulsion system inputs. The ejector butterfly 
valve angle (ETA) controls the engine airflow to 
the ejectors, thus providing a means of 
controlling ejector thrust. There are separate 
control valves for the left and right ejectors, 
however the two valve angles are set to be equal 
in the aircraft simulation because no test data are 
available on the differential use of the ejectors 
for roll control. Therefore only one butterfly 
valve angle is used as the control input in the 
design model. The other flve propulsion system 
control inputs in the design model were just as 
defined in the full cycle-deck engine simulation 
(Akhter et ai, 1989). 

The flight phase considered in this study is the 
decelerating transition during approach to hover 
landing. This phase presents a challenging 
control design problem becuase the control of 
the aircraft is transitioning from aerodynamically 
generated forces and moments to those gen¬ 
erated by the propulsion system. For this study, 
three linear integrated models were obtained 
corresponding to steady-state level flight at trim 
speeds of Vo = 60, 80 and 100 knots with trim 
flight path angle yo=~3deg for all three 
models. The trim strategy used to generate the 


Table 1. Trim control for linear 
MODELS (listed AS PERCENT (%) OF 80 
KNOT MODEL TRIM VALUES) 


Vo (knots) 

60 

80 

100 

A8„(%) 

J1 

100 

268 

ETA„(%) 

96 

100 

118 

A78o(%) 

133 

100 

32 

WF„ (%) 

116 

100 

71 

N2„(%) 

104 

100 

90 


linear models is as follows, (i) The aircraft pitch 
attitude was set at do = 7 deg to provide 
adequate visibility during landing, and the 
elevon settings were chosen to correspond to 
elevator deflection deo = 20 deg so that adequate 
elevator control authority is available for 
maneuvers during transition (ii) the aft nozzle 
vectoring angle was set to ANG 80 = 0 so that the 
total vectoring authority is available for pitch 
control, and the ventral nozzle vectoring angle 
was set to ANG79o = 64 deg to ensure that the 
aircraft can be trimmed at the low speed (60 
knots) with small but nonzero aft nozzle thrust, 
hence nonzero throat area. This trim strategy 
leaves adequate A 8 and ANG79 control 
authority available for these controls to be used 
for active speed and pitch control in transition; 
(iii) the aircraft is then trimmed by using the 
three thrusts—aft nozzle, ventral nozzle and the 
combined ejectors, as the trim controls. The 
resulting trim values of A 8 , ETA, and A78 for 
the three trim speeds are listed in Table 1. Also 
listed in Table 1 are the trim values of fuel flow 
(WF) and engine fan speed (N2) which define 
the engine operating point. Note that the 
quantities shown in Table 1 are a percentage of 
the corresponding trim values for the 80 knot 
model. As seen from Table 1, thrust is 
transferred from aft nozzle to ventral nozzle as 
the aircraft slows down and the engine operating 
point is moved up to a higher gross thrust level 
(increased WF and N2) to generate the 
propulsive lift needed to compensate for the loss 
in aerodynamic lift. 

Open loop frequency-domain and time- 
domain analyses of the three linear models 
indicated that the 80 knot model provides a 
•‘good average” of the three models in terms of 
input/output re.sponse behavior. Thus the 80 
knot model was used for control design with the 
60 and 100 knot models being used to evaluate 
the stability and performance robustness of the 


Table 2. Eigenvalues of linear models 



Eigenvalue 


Description 

60 

80 

100 

V;, (knots) 

-7.9 X 10'^ 

-8.5 X lO "" 

-7.9 X 10 ‘ 

Spiral mode 

-0.09±;0.24 

-O.ll ±/0.28 

-0.12±/0.31 

Phugoid mode 

1,05 

1.29 

1.44 

Unstable SPt 

-1.64 

-1.73 

-1.77 

Roll mode 

-1.74 

-2.09 

-2.45 

Stable SP 

0.05 ±>1 99 

-0.23 ±;2.27 

-0.48 ± >2.53 

Dutch roll 

-5.58±/0.74 

-4.12 

-3.33 

Rotor speeds 


-7.11 

-3.83 


-29.73 

-29.39 

-27.73 

Air temps 

-38.67 

-38.21 

-39.95 


-172.0 

-199.7 

-277.1 

Pressure mode 


t Short period. 
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nominal design, develop controller scheduling 
and evaluate off-design performance. The 
eigenvalues of the linear models for the three 
flight conditions are listed in Table 2 and the 
airframe modes are identified in terms of their 
‘'classicar' interpretation (McRuer et al., 1973). 
As seen from Table 2, the aircraft is unstable in 
pitch with the short period (SP) mode becoming 
less unstable as the trim speed decreases. Also 
the dutch roll mode damping is very low and it 
decreases with trim speed with the mode going 
unstable at 60 knots. The engine temperature 
and pressure modes are very fast compared to 
the engine rotor dynamics and aircraft modes. 
Ideally these fast modes will not be considered in 
the linear design model in order to reduce the 
complications in the control synthesis procedure. 
However, the approach taken in this study is to 
include these modes in the design model and 
reduce the complexity of the controller through 
order-reduction after initial controller synthesis. 

CONTROL DESIGN 
Design methodology 

The IFPC design problem discussed earlier 
was formulated as a command tracking, 
disturbance rejection problem within the frame¬ 
work of the general mixed sensitivity control 
problem (Safonov and Chiang, 1988). The 
detailed block diagram for the f/ao formulation of 
the IFPC feedback control design is shown in 
Fig. 3. In Fig. 3, z are the controlled variables 
and z^ are the corresponding reference com¬ 
mands. The three transfer functions that are of 
interest for such a problem are the sensitivity 
function 5(5), the complementary sensitivity 
function T{s), and the control transmission 

Wc(s) 

I- 1 

Weighted 
control 
rate 

Weighted 
control 

Weighted 
complemen¬ 
tary sensitivity 

Weighted 
sensitivity 



function C(5). These represent the closed-loop 
transfers from the reference commands and 
disturbances to tracking errors, controlled 
variables and commanded control inputs, resp¬ 
ectively. In order to influence both the low- 
frequency and high-trequency properties of the 
closed-loop system it is desirable to find a 
controller K(5) which minimizes a weighted 
norm of a combination of these three transfer 
functions, i.e. 

min \\H{s)\U with H{s) 

Stabilizing X(t) 

Ws{s) ■ S{s) 
Wris) ■ r(5) 
Ms) ■ C(j) J 

where ||A/(i)|U = sup (a,„„[if(y(w)]). 

VJ 

The weighting functions Wg, Wr, and Wc(5) 
are the “knobs” used by the control designer to 
“tune” the controller K(5) such that the design 
objectives are met. For instance choosing to 
be large at low-frequency ensures good com¬ 
mand tracking performance and choosing Wr to 
be large at high frequencies ensures robustness 
to high-frequency unmodelled dynamics. Wc are 
chosen to ensure that control actuation bandw- 
idths and control rate and deflections are 
practically achievable. 

The //, tracking formulation of Fig. 3 allows 
for feedback of plant measurements other than 
just tracking errors as inputs to the controller. 
This formulation then allows the simultaneous 
design of inner loop plant augmentation 
(stability or response shaping) and command 
tracking system. Such plant augmentation is an 
integral part of flight control design since the 
overall objective is to design a system for desired 
piloted handling qualities and not just an 
automatic command tracking system. Also, it 
has been shown in Chiang et al (1990) that if a 
tracking problem is formulated within the 
framework as purely a servo-mechanism prob¬ 
lem, i.e. with controller inputs being just the 
tracking errors, then the controller will be 
such that its transmission zeros cancel the stable 
poles of the design plant thus resulting in a 
closed-loop system that will have almost no 
robustness to plant parameter variations that 
result in change in open-loop pole locations. 
Allowing for feedback augmentation within the 
control formulation overcomes this problem 
of lack of robustness as demonstrated in Garg et 
aL (1990). 

Design specifications 

The vectors u and y in Fig. 3 are the 
integrated design model inputs and outputs, 


Fig. 3. Block diagram for formulation of IFPC feedback 
design. 
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respectively, as discussed in the previous section. 
The controlled variables z were selected to be 

i = [Vv,Qv.Y,Pv.P,my (5) 

with Vt; = V + 0.1V", Qv = q + 0.3d, Pu=p + 
0 .1^ and the others as discussed under plant 
outputs. This blending of controlled variables 
was chosen to provide the response types that 
are desirable for good handling qualities (Anon., 
1970; Hoh and Mitchell, 1986) in transition 
flight. The choice of Vv corresponds to designing 
an acceleration (deceleration) command system 
with velocity hold, and the choice of Qv and Pv 
correspond to designing a rate command-attitude 
hold system. The break frequencies for switching 
from rate to attitude command for the case of 
Qv and Pv, and from acceleration to velocity 
command for the case of Vv, were chosen based 
on open-loop control effectiveness studies. For 
instance, the elevator (de) is effective in pitch 
rate control in the frequency range of 0.3 rad s~' 
to 10rads~' and is effective in pitch attitude (6) 
control for frequencies below 0.3rads“‘. The 
choice of y in z provides for flight path angle 
control and the choice of N2 provides for 
tracking the fan speed commands generated by 
the engine operating schedule logic. 

Designing the feedback controller K(s) to 
provide decoupled command tracking of the 
individual elements of z will result in a system 
that provides independent control of accelera¬ 
tion, pitch, flight path angle, roll and sideslip 
from the various pilot control effectors such as 
stick, throttle and rudder pedals, etc. thus 
reducing pilot workload, and also control of the 
propulsion system operating point (N2) inde¬ 
pendent of the aircraft motion. Independent 
control of roll (Pv) and sideslip angle (/9) will 
result in a control system that provides 
automatic turn coordination thus further reduc¬ 
ing pilot workload. 

The choice of outputs y, in equation (4), is 
consistent with the various quantities used for 
feedback in classical aircraft control augmenta¬ 
tion (McRuer et al., 1973). Rate feedback 
provides for improved damping while position 
(angles) feedback improves natural frequency of 
the appropriate mode. 

Based on the experience gained from the 
control design study of Garg et al. (1990) an 
important criterion for control design is that the 
closed-loop system be robust to the absolute 
nonlinearity associated with the bleed flow 
demand from the RCS area commands—see 
earlier discussion. Also, the control design of 
Garg et al. (1990) was such that it did not further 
stabilize the dutch roll mode, i.e. the open-loop 
dutch roll mode was also a mode of the 


closed-loop system with the designed controller. 
This resulted in an unstable closed-loop system 
for the off-design integrated model at 60 knots 
due to the dutch roll mode being unstable at that 
flight condition. For this redesign effort, 
stabilization of the dutch roll mode to guarantee 
stable closed loop system over the transition 
flight conditions being considered was included 
as a design speciflcation. 

Control design 

The design plant inputs and controlled outputs 
were normalized by maximum allowable deflec¬ 
tions (Um„) ^nd maximum commanded values to 
be tracked (Zc^ax). respectively. The were 
chosen to be reasonable deviations from the 
nominal (trim) values such that the total control 
deflection limits (as incorporated in the actuator 
models in the nonlinear simulation) will not be 
exceeded. Some of the u^ax values correspond¬ 
ing to the propulsion system controls were 
further reduced to ensure that the total control 
usage will be within the limits imposed by safety 
requirements. For example, the allowable engine 
fuel flow for a given operating point is limited by 
the engine acceleration/deceleration schedule. 
The Zcmaa Were chosen based on handling 
qualities control requirements and open-loop 
control effectiveness analysis of the design plant 
to ensure that each element of z can be 
commanded individually to its maximum value 
within its frequency range of interest without 
exceeding value for any of the control 
inputs. The Zc„,„ and u^ax used in the control 
design are listed in the Appendix. The singular 
values of the scaled design plant for the six 
controlled variables defined in equation (S) are 
shown in Fig. 4. The fact that the minimum 
singular value in Fig. 4 is less then one implies 
that there exist combinations of numerical values 
of commands z^ such that although each element 
of Ze is less than its maximum value, the 
combined commands z^ cannot be tracked 
without exceeding the control limits for 

some control input. 

The sensitivity weights Ws and the com¬ 
plementary sensitivity weights for each of the 
controlled variables were chosen to be first- 
order, to provide adequate frequency response 
shaping without overly increasing the resulting 
controller order. The Wj zero and pole for each 
controlled variable were chosen to result in a 
low-frequency gain of 1000, gain crossover 
frequency of 3-4 times the control bandwidth 
desired for good handling qualities for the 
controlled variable of interest, and a high- 
frequency gain of 0.1. Such a choice of Wg 
reflects the desire to synthesize a sensitivity 
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function which gives good steady-state tracking 
performance in the presence of disturbances and 
low-frequency modelling errors, good tracking 
performance up to the desired bandwidth of 
control and reduced emphasis on tracking at 
high frequencies where there are significant 
modelling errors and uncertainties. The W, were 
chosen to obtain a low-frequency gain of 0, gain 
crossover frequency of about 1.2 times the 
corresponding Ws gain crossover frequency, and 
a high-frequency gain of IOC). This choice of Wy 
ensures that the plant outputs are not penalized 
at low frequencies where command tracking is to 
be emphasized while at high frequencies the 
plant outputs are penalized heavily to provide 
controller gain attenuation for robustness to 
high-frequency unmodelled dynamics. The sen¬ 
sitivity and complimentary sensitivity weights for 
each controlled variable z, are listed in the 
Appendix. 

As part of the control weighting Wc, both the 
control inputs and control rates were weighted 
with in Fig. 3 chosen to be the inverse of Un,a„ 
and ITu chosen to be inverse of maximum control 
rate for each control input. The Umax values used 
in the control design are listed in the Appendix. 
Since using full order actuator models for each 
control input would have resulted in a very 
high-order controller, first order actuator ap¬ 
proximations were used in the control design. 
Describing function analysis (Ogata, 1970) of the 
full order actuators was first performed to 
determine the degradation in actuator bandwidth 
due to rate limiting when control commands 
corresponding to u^ax used at all frequencies. 
The worst-case rate-limited actuator bandwidth 


was then used as the bandwidth for first-order 
design actuators. For example, the pitch RCS 
area actuator bandwidth is 20 rad s""^ but the rate 
limit is 3.0ins~^ which results in a worst case 
bandwidth of 6.0 rad s'* for AQR^mi of 0.7 in^. 
Using such an approximation for the actuators in 
the design guarantees performance robustness 
and stability in the presence of actuator rate 
limiting. The actuator model numerical data are 
listed in the Appendix. 

The bleed flow demand from the RCS acts as 
a disturbance on the engine operating point. In 
the propulsion system simulation and the linear 
model generated from this simulation, bleed flow 
appears as an external disturbance. However, 
when the airframe and propulsion models are 
integrated there is no explicit dependence on 
bleed flow since the bleed flow is generated from 
the RCS area commands which are the inputs to 
the integrated model. As pointed out earlier, the 
linear design model does not account for the 
absolute nonlinearity from the RCS area 
commands to bleed flow demand, so the effect of 
the RCS areas on the engine states as it appears 
in the integrated linear design model is 
erroneous. The linear design model was 
modified such that the elements of the control 
effectiveness matrix, B, from the RCS areas to 
engine states are zero, i.e. the RCS areas do not 
affect the engine operating p)oint, and bleed flow 
was added as an external disturbance affecting 
the engine dynamics as in the linear engine 
model. For control synthesis, the bleed flow 
disturbance was modelled as a scaled exogenous 
input with a scaling factor of 7 Ibm s ', which is 
the maximum possible RCS bleed flow, and was 
considered to be a measurement available for 
feedback. Although bleed flow is not measur¬ 
able, it can be estimated as a function of RCS 
areas and ambient and bleed duct pressure 
conditions or the bleed flow feedback can be 
implemented as nonlinear feedback of the RCS 
area commands. Since the RCS is used for 
control of aircraft angular rates, the bleed flow 
disturbance was filtered by a first order filter 
with a bandwidth of 3.5 rad s'' which is 
representative of desired angular rate control 
band widths. 

A procedure to account for plant model 
uncertainties within the framework of the //« 
control problem is to use exogenous disturbance 
inputs which mimic the effects of model 
variations (Reichert, 1989). Such an approach 
was used in the current design study to provide 
enhanced lateral/directional stability by ap¬ 
propriate damping of the Dutch roll mode and 
robustness to changes in the lateral/directional 
dynamics with decreasing trim airspeed. Side 
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force, rolling moment and yawing moment 
Changes were represented in the form of scaled 
exogenous disturbances in lateral acceleration 
(u), roll acceleration {p), and yaw acceleration 
(r) with scaling factors of 0.5fts~^, 0.1 rad s~^, 
and 0.1 rad s“^, respectively. The overall 
control synthesis problem was then formulated 
as that of command tracking with bleed flow and 
lateral/directional disturbance rejection. 

The design plant as discussed above is of 38th 
order consisting of the 13th order integrated 
airframe/propulsion system design model, first 
order actuators for the 12 control inputs, first 
order sensitivity and complementary sensitivity 
weights for the six controlled variables, and the 
first order filter for the bleed flow disturbance. 
So the controller using the algorithm of 
Doyle et al. (1989) will be of 38th order. The 
controller was obtained using a modified (by the 
author) version of the HINF CONT function of 
the Robust Control Module software developed 
by Integrated Systems Inc. (see Anon., 1989). 
The //« control design results with this design 
plant are shown in Fig. 5 in terms of the closed 
loop maximum singular values of the combined 
weighted functions weighted errors 

<^in.i.[Wse(/tD)], weighted controlled outputs 
<^iii.*lWTxO‘<w)i. weighted controls c7ma»[W„u(/(o)] 
and weighted control rates Omax[WuU(/(w)j 
with the commands Zj as inputs. ||//(/(u)||« = 10 
is achieved for this controller as seen from 
Fig. 5. In general a control design with 
||//(/a>)||®=s 1 ensures that all the design speci¬ 
fications that are formulated through the various 
weightings will be met. However, this was not 
the case in the present design, because as stated 
earlier the minimum singular value of the scaled 
design plant itself was much less than one. The 
fact that control efforts greater than u„,.„ will be 
required to track some combinations of com¬ 
mands is evident from the large (>1) maximum 
singular values of weighted controls at low 
frequencies. The fact that maximum singular 
value of weighted errors is greater than one at 
low frequencies indicates the performance/ 
control trade-off made in the minimization 
procedure. The fact that the maximum singular 
values of the weighted controlled outputs and 
control rates are less than one over all 
frequencies indicates that the //„ controller 
provides adequate “loop gain” roll-off for the 
closed-loop system to be robust to unmodelled 
high-frequency dynamics and that the control 
rate limits will not be exceeded for any 
combinations of commmanded variables. Fur¬ 
ther evaluation results are presented in the 
following section with reference to a reduced- 
order controller obtained from the controller. 



Frequency, rad/s 

Fig. 5. Closed-loop weighted norms for controller. 

CONTROLLER REDUCTION AND EVALUATION 

The controller order was reduced from 38 to 
14 using a combination of modal residualization 
(Kokotovic and Sannvie, 1968) and balanced 
realization (Moore, 1981) reduction techniques. 
The order was first reduced from 38 to 22 by 
modal residualization of the high-frequency 
controller modes. The 22nd order controller was 
reduced to 16th order by a balanced realization 
approximation which was then reduced to 14th 
order again by residualization of the high- 
frequency modes. The state-space system matr¬ 
ices for the 14th order controller are listed in the 
Appendix. The eigenvalues of the final reduced 
order controller were bounded by |A,| < 10 thus 
indicating that digital implementation of the 
controller can be achieved with reasonable 
sampling rates. The performance with the full 
order and reduced order controllers is compared 
in Fig. 6 in terms of the maximum and minimum 
singular values of the closed-loop tracking 
system, tr„a,fT(/w)] and tT„„„[T(/a>)] with 
z(j) = T(j)z,.(a'). Clearly, there is no significant 
loss in tracking performance with the reduced 
order controller as seen by the excellent match 
for the closed-loop singular values up to a 
frequency of 20rads“‘. Note that other modern 
control reduction techniques such as frequency- 
weighted balanced realization (Enns, 1984) were 
not used because it was not clear how to apply 
these techniques for a controller structure which 
includes external disturbances as inputs (bleed 
flow in this particular case). 

Extensive frequency-domain and time-domain 
analyses were performed to evaluate the 
closed-loop performance and robustness with the 
linear design model. All these analyses indicated 
that the reduced-order controller provides 
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Fig 6. Closed-loop tracking singular values with full and 
reduced-order controllers 


decoupled command tracking bandwidths and 
reasonable control actuation requirements. Note 
that although the open-loop design model is 
decoupled in the longitudinal and 
lateral/directional dynamics, coupling between 
these axes is introduced through the use of RCS 
control. For instance, using the yaw RCS will 
result in change in velocity because the yaw RCS 
generates axial force. So the fact that the control 
design achieves decoupled response in lon¬ 
gitudinal and lateral/directional axes while using 
RCS for active control is significant. 

The performance of the control design m the 
presence of the absolute nonlinearity from 
commanded RCS areas to compressor bleed flow 
demand was evaluated by including these 
nonlinear effects in the linear 80 knot design 
model. The numerical data for this absolute 
nonlinearity is listed in the Appendix. An 
example result is shown in Fig. 7 in terms of the 
closed-loop pitch attitude and engine fan speed 
response to a pitch variable (Qv) command. The 
Qv command was chosen to correspond to a 
transient pitch rate command and a steady-state 
pitch attitude command. Also shown in Fig. 7 is 
the corresponding closed-loop system response 
with the control design of Garg et al. (1990). The 
current design provides the desired improvement 
over the control design of Garg et al. (1990) in 
that it maintains good pitch tracking and fan 
speed regulation performance in the presence of 
the absolute bleed flow nonlinearity. Thus the 
formulation of the control problem to reject 
bleed flow disturbance with the modified design 
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Fig 7 Pitch tracking performance in the presence of bleed 
Flow nonlinearity 


plant, the modification being the removal of the 
RCS area effect on the engine dynamics, was 
successful in providing performance robustness 
to the bleed flow absolute nonlinearity. 

In the transition flight phase, typical pilot 
control tasks are acceleration/deceleration to a 
desired airspeed while maintaining flight path 
angle or change to a desired flight path angle 
while maintaining airspeed. The IFPC design 
performance for these two typical tasks is shown 
in Figs 8 and 9. From Fig. 8(a), we note that the 
controller provides good command tracking of 
the transient acceleration and steady-state 
airspeed hold command while maintaining good 
regulation of pitch attitude, flight path angle and 
engine fan speed. The propulsion system 
actuation requirements, shown in Fig. 8(b), are 
quite reasonable and well within the limits 
established for the “small perturbation” linear 
control design (as seen in comparison with the 
values used for control and control rate 
normalization—see data in the Appendix). The 
resulting propulsion system generated thrusts are 
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Fig. 8. Cloaed-loop response for transient acceieration/steady-state velocity hold command. 


shown in Fig. 8(c) where 

FGT = Total propulsion system gross thrust 

(ibf) 

FG9 = Gross thrust from aft nozzle (Ibf) 

FGE = Gross thrust from ejectors (Ibf) 

FGV = Gross thrust from ventral nozzle (Ibf). 

FGT is an estimate for the total thrust that 
would be generated by the propulsion system if 
all the airflow were to exit by the aft nozzle at 
some specified nominal ambient conditions. As 
the aircraft speed increases, the aerodynamic lift 
will increase with a corresponding increase in 
drag. This will result in decreased requirement 
on propulsion system generated vertical thrust 
and an increase in the required axial thrust. The 
thrust responses of Fig. 8(c) are consistent with 
these requirements. The control design pre¬ 
sented here corresponds to the “inner-loop” of 
an IFPC system. In actual implementation, there 
will be an “outer-loop” which will generate 
commands for the engine fan speed (N2) based 


on the total thrust requirements (FGT) to keep 
the engine operating on a desired schedule. 
From Fig. 8(c) we note that during the transition 
flight phase, an acceleration command for the 
aircraft results in decreased total propulsive 
thrust requirements and hence corresponds to an 
actual deceleration command for the engine rotor 
speeds. 

Figure 9(a) shows that the control design 
provides good tracking of flight path angle 
command with decoupling from velocity, pitch 
attitude and fan speed. The propulsion system 
control actuation requirements are reasonable as 
seen from Fig. 9(b). As is to be expected, Fig. 
9(c) shows an increase in ejector thrust to 
support the aircraft climb rate and a correspond¬ 
ing increase in total propulsion system thrust 
generation requirement. 

Another important design criterion discussed 
earlier was to provide stability robustness to 
changes in the lateral/directional dynamics 
specially with reference to the low damping of 
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the dutch roll mode. For the current control 
design, the open-loop dutch roll mode was no 
longer a dosed-loop mode with the 80 knot 
design plant. Moreover, all the closed-loop 
modes were well damped with the smallest 
damping ratio being 0.4. Detailed stability 
robustness evaluation of the control design to 
variations in the plant system A matrix due to 
changes in trim speed were conducted using 
structured singular value robustness tests 
(Doyle, 1985). The procedure for this stability 
robustness evaluation is discussed in the 
following. 

First the variations in the airframe portion of 
the plant A matrix with changes in trim speed 
were identified. With the 80 knot model A 
matrix as the nominal, 13 elements of the 
airframe portion of the A matrix showed a 
change of 25% or greater in going from 80 to 60 
knots or 80 to 100 knots. These elements 
correspond to change in axial (x-axis) force due 
to pitch rate, side force (y-axis) due to roll rate 
and yaw rate, vertical force (z-axis) due to axial 
speed (u), vertical speed (w) and pitch rate, 
rolling moment due to roll rate and yaw rate, 
pitching moment due to axial speed and pitch 
rate, and yawing moment due to side velocity 
(v), roll rate and yaw rate. Also the percentage 
change in the A matrix elements was approxim¬ 
ately symmetrical about the nominal A matrix. 
For a given trim speed, Vq, these 13 elements of 
the A matrix were then modelled using the 
multiplicative uncertainty form as 

A,, = A^{l + K‘^^V), ( 6 ) 

where A*’ is the i,j element of the 80 knot model 
A matrix, AK = K - 80, with units in knots, and 
K'i, were chosen to be the gradient with respect 
to velocity for the A(i,j) element from the 60 
knot to 100 knot models, i.e. K'^={A\^- 
A“)/40. The numerical values for K‘{, are listed 
in the Appendix. With such an uncertainty 
model, the perturbation A(r) for robustness 
analysis is of the form A(j) = AV • I where I is 
an identity matrix of dimension 13, and an upper 
bound for structured singular value stability 
robustness measure considering AV to be 
complex is given by Apkarian (1989) 

fi = sup [p(M(/a)))], (7) 

where p[A] is the spectral radius of matrix A 
(maximum absolute eigenvalue of A) and M(j) 
is the interconnection matrix for robustness 
analysis which represents the nominal closed- 
loop system taking into account the structure of 
A(r). A detailed discussion of creating the 
interconnection matrix for this particular study is 


beyond the scope of this paper. The reader is 
referred to Doyle (1985) for a theoretical 
discussion of the subject of robustness analysis 
for structured uncertainties and to Anon. (1989) 
for a detailed exposition on creating the 
interconnection matiix for various forms of 
uncertainty models. The structured singular 
value for A matrix uncertainty as modelled by 
equation (6) is shown in Fig. 10. Also shown in 
Fig. 10 is the corresponding result with the 
control design of Garg ef tU. (1990). From Fig. 
10, p = 0.02 for the current control design which 
implies that for the A matrix uncertainties as 
modelled, the closed-loop system will remain 
stable for velocity perturbations of up to 
AV = 1/p = 50 knots. For the design of Garg et 
al. (1990), p = 0.065 which implies that the 
closed loop system stability with that control 
design would have been guaranteed only for 
velocity perturbations up to IS knots. In fact, for 
AV = IS knots as modelled, the open-loop A 
matrix is such that the dutch roll mode is just 
unstable and the control design of Garg et al. 
(1990) was found to be unstable for this 
perturbation. The closed-loop system with the 
current control design was stable for the full 
off-design 60 and 100 knot plant models thus 
indicating that the use of exogenous disturbance 
inputs in the if. problem formulation was 
successful in building-in robustness to variations 
in the lateral/directional dynamics. 

Since the design plant is an integrated flight 
propulsion system, robustness of the design 
closed-loop system to changes in the propulsion 
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Fig. to. Stability robustness measure p(M) tor variations in 
airframe portion of plant A matiix. 
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system dynamics is also an important criterion to 
be met. As mentioned in Sobey and Suggs 
(1963) the engine acceleration (fan/compressor 
speed changes) response to fuel flow characteris¬ 
tics can vary over a wide range due to the 
cumulative effect of various variables such as 
wear and differences in manufacturing toler¬ 
ances, variable geometry setting, inlet distortion, 
environmental conditions, etc. For the purposes 
of robustness analysis, this variation in propul¬ 
sion dynamics can be represented as uncertainty 
in the A matrix and B matrix elements 
corresponding to the change in rotor speed rates 
(N2, N25) with change in rotor speeds and fuel 
flow, respectively i.e. A(i,j) and B(i, k) with / 
and j = 9, 10 and k = l \n the design model of 
equation (1). Structured singular value robust¬ 
ness with the 80 knot design model and the 
current controller design was performed con¬ 
sidering independent multiplicative perturba¬ 
tions in these six elements of the A and B 
matrices. This analysis gave n = 1.276 which 
implies a stability margin SM= l//x = 0.79, i.e. 
the closed-loop system will remain stable for 
changes of up to 79% from the 80 knot model in 
these A and B matrix elements. Thus the control 
design provides robustness to large variations in 
the engine dynamic response characteristics. The 
actual change in these elements between the 
design model and the off-design 60 and 100 knot 
models was much less than 79% thus indicating 
that the design controller will provide robustness 
to changes in engine dynamics due to changes in 
the engine operating point within the transition 
envelope. 

Note that the robustness results presented so 
far are for stability robustness and not 
performance robustness. For the STOVL aircraft 
in transition flight, the control effectiveness of 
the various effectors changes significantly with 
speed. As speed decreases, the effectiveness of 
the aerodynamic control surfaces decreases and 
the control of aircraft attitude angles and angular 
rates is transferred from aerodynamic surfaces to 
propulsion system generated moments via RCS 
and nozzle vectoring controls. In general, it is 
very difficult to provide performance robustness 
to control effectiveness changes while maintain¬ 
ing nominal performance unless one is willing to 
accept very complex and high-order controllers. 
Controller scheduling is then required to provide 
satisfactory performance over the design flight 
envelope. A simple scheduling scheme of the 
form 

K(5) = Ks-K^(5), 

where K(j) is the scheduled controller, Ks are 
scheduling gains which are a function of airspeed 


(V), and is the nominal controller (14th 

order for the 80 knot design point), was 
developed for the STOVL aircraft IFPC design 
(Garg and Ouzts, 1991). This scheduling scheme 
exploits the robustness characteristics of the 
nominal collector and was found to lead to 
satisfactory performance over the transition 
flight envelope. 

^ The robust IFPC controller discussed in this 
paper is currently being partitioned into separate 
airframe and propulsion system controllers for 
ease of implementation and to allow for 
independent closed-loop propulsion system vali¬ 
dation. The nonlinear safety and limit logic 
aspects of the propulsion system design are also 
being considered (Garg and Mattern, 1991). 
Piloted simulation evaluation of the overall 
controlled system, is planned in the near future. 

CONCLUSIONS 

The Integrated Flight/Propulsion Control 
(IFPC) system design presented in this paper 
demonstrates the applicability of an control 
synthesis technique to integrated control design 
for complex systems such as Short Take-Off and 
Vertical Landing (STOVL) aircraft. The capabi¬ 
lity to address closed-loop performance, robust¬ 
ness and control actuation trade-offs within the 
framework of an //« control problem formula¬ 
tion was demonstrated. In particular, the //« 
control design for the IFPC problem, as 
formulated herein, provided robustness to the 
absolute nonlinearity associated with the com¬ 
pressor bleed flow demand from the Reaction 
Control System and to changes in the plant 
system dynamics over the transition flight 
envelope while achieving acceptable nominal 
performance. The IFPC design corresponds to a 
highly advanced pilot control mode for the 
STOVL aircraft wherein the pilot has independ¬ 
ent direct control of flight path angle and 
acceleration along flight path. The integrated 
control makes the propulsion system thrust 
magnitude and vectoring management transpar¬ 
ent to the pilot thus resulting in reduced pilot 
workload. Robust control design also reduces 
the requirements on controller scheduling thus 
simplifying control implementation. 
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APPENDIX 

Numerical data 

The integrated system is of the form: 


X - Ax + Bu -t- Wiv; y = ( >-I- Du 4 Fk , 


with X, u, and y a.s defined by equation.s (2), (3) and (4), respectively, and iv — WB3—the compressor bleed flow which is 


modelled as a disturbance on the plant. 

The system 

matrices are as 

follows. 
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The scaling used to normalize the controlled variables (z) and the control inputs (a) are as follows; 

*™.« = (7.6 6.3 4.0 10.2 5.0 ZOO.O]’’, 

■n... = [5 0 5-0 10.0 0.7 0.7 0.7 1000.0 20.0 8.0 45.0 10.0 10.0]^. 

The sensitivity weightings IV^, and the complementary sensitivity weightings HV; for each of the controlled variables z, was 
chosen to be of the form; 


11^ + 1000 b,s 

10a,j + r " 0.0016fS + r 


with o, = 27.92, 11.17, 31.41, 22.33, 22.33, 13.4, and 6, = 0.233, 0.093, 0.263, 0.185, 0.185, 0.111 for i = l,..., 6. 
The maximum values for the control rates (a„„) used to determine were chosen to be; 

•im« = [52 52 120 3 6 3 36,000 240 45 240 20 27.5]^ 

The first order actuators used in the control design are of the form 


s + a. 


with a, = 20, 20, 20, 6, 10, 6, 13, 20, 10, 20, 5, 6 fon = 1,... , 12. 

The 14th order controller, obtained after reduction of the optimal controller is of the form; 



with the controller state-space matrices as follows; 
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•“’ -3.20x10 

r* -6.35x10“ 
2.13 X 10 
1.02x10“ 
1“’ 5.73x10 

r’ 2.92 X 10“ 
r’ 1.08x10 
I’ 1.49x10 

-2.52 X 10 ’ 
6.62X10 ’ - 

-4.71 X 10 ’ 
1.61 X 10 ’ - 

-1.73 X 10 * 
-6.68 X 10 ' 


2.23 X 10 * 
-7.76 X 10“’ 
1.93 X 10 ’ 

1.19 X 10 ‘ 
8.32 X 10* 
2.44 X 10“’ 
1.03 X 10“' 

-2.92 X 10 ’ 
-2.49 X 10 ’ 
-1.16 X 10 ’ 

4.19 X 10 * 
-2.43 X 10 * 


4.04 X 10* 
-6.49 X 10 ’ 
-2.56 X 10“’ 
-6.89 X 10 ' 
-2,23 X 10 ’ 
l.%x 10“’ 
-3.03x10 ' 
-9.79x10 * 
-9.81 X 10* 
-1..54X 10 ’ 
-1.13X10 ’ 
3.02 X 10 * 


1.21 X 10 ’ 
-4.00 X 10“* 
2.17 X 10 ’ 
-7.86 X 10 ’ 
1.80 X 10“* 
2.85 X 10“’ 
1.24 

3.50 X 10 ’ 
-9.81 X 10 * 
-4.62 X 10 ’ 
-4.83 X 10 * 
-3.61 X 10 * 


4.98 X 10 
-1.32 X 10“ 
9.58 X 10“ 
-3.21 X 10 
4.33 X 10 
1.36 X 10 


6.48 X 10 ’ 
-1.71x10 ’ 
1.23 X 10 ’ 
-4.16X 10 ’ 
5.15 X 10 * 
1.74 X 10 * 


1.10 X 10 ’ 
-2.91 X 10“' 
2.12 X 10 ’ 
-7.08 X 10 * 
9.57x10 ’ 
3.00 X 10 ' 


-2.20 X 10“* 
2.09 X 10 * 
2.53 X 10 ' 
1.75 X 10 ' 
1.98 X 10“* 
-6.88 X 10 ’ 


7.41 

9.52 

-1,91 X 10 

-2.47 X 10 

-4.22 

5.84 X 10’ 

3.40 X 10 ’ 

4.55 X 10 ’ 

-8.78 X 10 ’ 

-1.16X 10 ' 

-1.94X 10“’ 

3.97 X 10 * 

1.88 X 10“’ 

2.39 X 10 ’ 

-4.86 X 10“’ 

-6.28 X 10 ’ 

-1.07 X 10“’ 

2.48 X 10* 

9.52 X 10 ’ 

1.23 X 10“' 

-2.46 X 10 ' 

-3 I9x 10“' 

-5.43 X 10 ’ 

1.02 X 10 ’ 

3.80 X 10“’ 

4.43X10 ’ 

-9.79X10 ’ 

-1.23x10 ’ 

-2.16 X 10 ’ 

1.29 X 10 * 

4.65 X 10“’ 

6.02 X 10 ’ 

-1.20 X 10 ’ 

-1.55x10 ’ 

-2.65x10 ’ 

1.78 X 10“' 


-4.70 X 10 * 

-1.28 X 10 ’ 

-2.32 X 10 ' 

1.21 X 10 

2 

4.06x10 ’ 

-6-83 X 10 ’ 

1.63 X 10 ' 

-8.35 X 10 ' 

7.80 X 10 ’ 

-3.13 X 10 

3 

7.80 X 10 ’ 

1.38 X 10 ’ 

4.68 X 10 ’ 

-3.90 

1.38 

-4.34 X 10 


-7.60 X 10“' 

-1.31 X 10 ’ 

-1.79 X 10 * 

6.04 X 10 ' 

-2.27 X 10 ’ 

-9.37 X 10 

4 

1.29 X 10 ’ 

4.11 X 10“’ 

-6.66 X 10 ' 

3.19 X 10 ' 

-1,16X 10 ' 

-2.65 X 10 

4 

6.47 X 10 ’ 

-4,22x10 ' 

-1.19 X 10“’ 

8.63 X 10 ’ 

-1..36X 10 ’ 

-6.86 X 10 

S 

-1.82 X 10 ’ 

-2.05 X 10 * 

-1,91 X 10“' 

4.88 

-2.22 

-3.42 


1.42 

4.40 X 10 

1.07 X 10 ’ 

-2..14X lO ' 

1.15 X 111 ' 

-3.27 X 10 

2 

-7.58 X 10 ’ 

5.31 X 10“' 

-5.83 X 10 ’ 

1.49X lO ' 

-6.40 X 10 ’ 

-2.10 X 10 

■2 

3.94 X 10 ’ 

1,06 X 10 ' 

6.42 X 10 ' 

-I.12 X lO ' 

7 96 X 10 ’ 

-9.87 X 10 

2 

-5.98 X 10“’ 

8.44 X 10 ' 

-4.40 X 10 ’ 

3 84 X 10 ’ 

-3.38 X 10 ’ 

-5.26 X 10 


2.65 X 10 ’ 

-3.47 X 10 ’ 

1.19X 10 * 

-9.65 X 10 ' 

5.40 X 10 ’ 

-2.52 X 10 

3 

-3.81 X 10 ' 

1.90 X 10 ' 


The compressor bleed flow due to RCS area commands was calculated as follows for the controller evaluation: 

WB3 = 4.41 X |A0R| + 4.41 x |AYR| + 8.82 x |ARR|. 

The output matrices for the gros.s thrusts from the three "nozzles ”, corresponding to the results presented under the controller 
evaluation section are given by: 

r FG9l 

FGE = C,,.* + Dro» + fu.M', 



2.20 X 10“ 
-5.53 X 10“ 
-6.87 X 10“ 


3,87 X 10 ’ 0 0 0 0 -3.79 x It) 

-9.73 X 10 ’ 0 0 0 0 - 9 .55 x 10 


-1.21 X 10 


0 0 0 0 -1.19x 10 


1.81 X 10 

1.83 
2 28 


-1.99 X 10 
5.41 X 10 
6.64 X 10 


-9.63 X 10 
1.12 
1 37 


■1.71 X 10 
2..58X 10 
3.11 X 10 


0 0 0 0 0 0 - 4.25X 10 * 3.05x 10 4 52x 10“’ 2.43x10’ 5.37x10' 1.70x10 

0 0 0 0 0 0 2.07X 10 ’ -1.88x 10 1.05x10’ -1.76x10 5.32x10 ’ -2.76x10“ 

0 0 0 0 0 0 1 84X 10 ’ -2.33X 10 -4.20x 10 4,36x 10' 4.89x 10 -1.59x 10 


Frc = ^ “3,80x 10 -4.74x 10]’. 


The total gross thrust is given by: 


FGT = 1.027 X FG9 -I- 0,453 x FGE -t-l. 194 x FG9. 
The K'i used for robustness analysis were as follows: 


Ki X 10 “ 


3,3 3,5 4,4 4,6 5,1 5,5 

1.25 1.25 1.25 1.25 1.95 1.25 
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Robust Performances Control Design for a High 
Accuracy Calibration Device* 


M. MILANESE, G. FIORIO and S. MALANf 

A high accuracy calibration device has been modelled, and parametric and 
unmodeled dynamic errors have been evaluated. A controller was designed 
taking into account performance specifications, and robustness analysis was 
performed dealing with the derived nonlinear parametric and dynamic 
perturbations. ^ 

Key Words— Control applications; identification; error analysis; parameter estimation; robust control; 


Abstract —This paper presents a case study of robust 
performances control design. The physical plant under 
examination consists of a platform for calibration of high 
accuracy accelerometers, which has to assume the properties 
of an inertial body, despite the vibrations coming from the 
surrounding ground. Plant modelling and parameter 
estimation, control system design and robustness analysis of 
the designed controllers are described and discussed. Besides 
a simplified model of the plant, called the nominal model, 
perturbations arc also considered, taking into account 
parametric and dynamic uncertainties. The procedure 
followed for estimating model parameters, based on an 
unknown but bounded approach, is illustrated, and 
uncertainty intervals of parameter estimates are provided. 
Bounds of unstructured uncertainty are also derived from 
results of simulations aiming to evaluate the main effects of 
the unmodeled dynamics. 

The design has been carried on through iterative steps of 
“nominal’' design and robustness analysis. The design has 
been performed through //„ synthesis, based on the nominal 
model and taking into account the main performance 
specifications required for the present case study, i.e. 
stability, disturbance attenuation and command power 
limitation. The robustness analysis has been performed using 
recent techniques able to deal with frequency domain 
specifications and with mixed nonlinear parametric and 
dynamic perturbation, as required in the present case study. 

1. INTRODUCTION 

The problem originated from a national labora¬ 
tory, in the realization of a calibration device for 
high accuracy acceleration transducers. This 
device requires to work over a platform, whose 
conditions should be close to those of an inertial 
body. Unfortunately, the laboratory is located in 
the neighborhoods of heavy mechanical fac¬ 
tories, whose undesired effects are to generate 
vibrations in the surrounding ground. 

In these conditions, it is required to reduce the 
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r.m.s. value of the platform perturbing accelera¬ 
tion in the ratio 1:100 approximately, in order to 
have sufficiently negligible calibration errors with 
respect to the accuracy guaranteed for the most 
sensitive transducers. 

It comes out from a preliminary analysis that it 
is not convenient to use simply passive 
mechanical elements, such as springs and 
dampers, to solve the problem of reducing 
platform perturbing acceleration, because un¬ 
feasible parameter values should be required. 
On the contrary, the use of active elements, such 
as electromagnetic actuators driven in feedback 
or in mixed feedback-feedforward control 
schemes, leads to much more effective distur¬ 
bance attenuation. 

Figure 1 gives a sketch of the plant. A 
concrete rectangular platform P is supported at 
each corner by a set of three elements lying on 
ground; a spring, a damper and an electromag¬ 
netic force generator. Another platform fl, 
bearing the calibration device, is leant on the 
first one through similar mechanical elements, 
but without active generators. 

Accuracy is the main concern of the 
calibration device, and the desired accuracy must 
be guaranteed for the controlled system, despite 
of the limited and uncertain information 
available on the plant. Then a robust (worst- 
case) approach was taken, based on the typical 
two steps. 

• Identification of the approximate behaviour 
of the system in terms of a nominal model 
and a perturbation model, able to capture 
the discrepancies between the nominal 
model and the actual system (described in 
Sections 2 and 3). 

• Design of a control system which assures 
acceptable performances according to some 
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Fig. 1 The plant 


given specifications, not only for the 
nominal model, but for all considered 
perturbations (described in Section 4). 

It is well known that the type of perturbation 
model plays a key role. In the robust control 
literature, two main types of perturbations have 
been extensively studied: parametric (real) and 
dynamic (complex) perturbations, the former 
accounting for parametric variations, and the 
latter for unmodeled dynamics (see e.g Dorato 
and Yedavally, 1990 and Milanese et al, 1989, 
for a collection of recent papers). 

Recently, some work has been done on mixed 
types of perturbation (see e.g. Fan et al, 1991, 
Milanese et al., 1991), which seems able to 
better capture the physical information on the 
approximations introduced by the model. In fact 
a preliminary analysis of this case study has been 
performed in Fiorio et al (1990) taking into 
account parametric perturbation only. The 
results clearly showed that in this case the 
unmodeled dynamics play a key role in robust 
stability. In this paper, we perform a more 
complete study, using a mixed perturbation 
model. 

2 THE NOMINAL AND THE PERTURBATION 
MODELS 

The aim of this section is to illustrate the 
results of model building for a suitable 
representation of the plant. First, the nominal 
model is described. Furthermore, perturbations 
on this model are introduced, taking into 
account its main approximations. 

The nominal model is based on the following 
simplifying hypotheses. (1) The ground, as well 
as platforms P and B in Fig. 1, are considered as 
rigid bodies. (2) The ground has only vertical 
motion, and the structure has perfect symmetry 
at the four corners of the platforms. (3) Only 
linear equations are included in the model. 

The four electromagnetic actuators at the 
comers of the lower platform of Fig. 1 are driven 
by the same electric current i, according to the 


zj-y 





ground G 

Fig 2 The simplified model 

hypothesis that perfect symmetry gives rise to 
translation motion of the platforms along the 
vertical axis only. The simplified model is 
depicted in Fig. 2. In this figure, the four 
identical elements at each corner of the 
platforms are represented by only one equivalent 
parameter. 

In the nominal model, the vertical component 
of the ground acceleration Zc is considered as 
the only disturbance, which is denoted by d. The 
force / in Fig. 2 is considered as proportional to 
the current i, which is the only command 
variable of the system. So, it results: / = Kpi. 
The vertical component Z 2 of the upper platform 
acceleration is the controlled output, denoted by 
y. Thus, the system can be represented by the 
SISO structure of Fig. 3, where y(j), and 
d(j‘) represent system output, command variable 
and disturbance, respectively. M{5) and A{5) 
represent output sensor and actuator transfer 
functions, respectively. C(j) is the regulator 
transfer function to be designed. G( 5 ) and H{s) 
are the command to output and disturbance to 
output transfer functions, respectively. Their 
expressions are: 

Gis) = Na{S)D-\5), ( 1 ) 

H(s) = NHis)D-\s), ( 2 ) 



Fig. 3 The control scheme. 
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where 

^c('*) = *5^(1 + P2^2^s), (3) 

N„(s) = (1 + ^ifcr‘s)(l + ( 82*2 ‘j), ( 4 ) 

D(s) = m,m 2 /cr '^2 

+ [m,j 82 + rn2(/3, + /32mr‘*2-V 

+ [m,/cr‘ + m 2 (l:r‘ + * 2 ‘') 

+ PxP2kT'kW 

+ {P\ki'+ fi2k2')s + 1. (5) 

Output sensor is modelled by the transfer 
function 

= K^s{\ -\-sTy ( 6 ) 

Actuator transfer function A{s) is assumed as 
a constant Ka in the frequency range of interest: 

= (7) 

In order to take into account the approxima¬ 
tions introduced by this simplified model» the 
three main assumptions are briefly discussed. 

With regard to linearity assumption, the 
behaviour of the plant in normal operating 
conditions can be actually considered linear with 
good approximation, due to the very small 
displacements and accelerations present in these 
conditions, and to the fact that input current will 
be controlled to stay within the linearity range of 
the relation force-current. 

On the contrary, while the rigid body 
assumption appears to be likely for platforms P 
and B, ground deformability gives parasitic 
effects which may cause stability problems in the 
closed loop. Then the deformability of the 
ground is taken into account by means of an 
uncertainty represented in terms of a multiplica¬ 
tive perturbation A as indicated in Fig. 3, such 
that 

0 <fu<oo. ( 8 ) 

The choice of the weighting function W((d) is 
discussed in Section 3. 

The non perfect symmetry of the structure 
gives rise to rotating motion components of the 
platforms, with the effect of adding two 
pole-zero couples for each pole of the transfer 
functions of the nominal model. The pole and 
the zero of each added couple are very close, 
and close to the corresponding pole of the 
nominal model. Through simulation, we have 
verified that large asymmetries (up to 10 % on 
each parameter) give transfer functions which 
can be recovered with very good approximation 
by suitable assessments of the parameters of the 

simplified model of Fig. 2. 

In fact we assume that parameter vector p is 
known only to belong to a parameter uncertainty 


set n. The identification procedure described in 
the next section provides parameter estimates 
with their ranges of variations, able to recover 
this source of parametric perturbations, as well 
as others, for example the ones due to errors in 
measurements. 

3 IDENTIFICATION PROCEDURE 

The model described in the previous section 
contains several parameters, whose values have 
to be known. 

Constants K^, r, Kf. and Ka are given the 
values from the data sheets of the correspK)nding 
components: 

= 2 X 10*^ V sec^ m *, 

T = 2sec, Kf. =HJN A~\ (9) 

a:„ = io.oav"'. 

The remaining parameters of the model are: 
masses m, and m 2 , stiffness coefficients A:, and 
Ac 2 , damping coefficients j3| and ^ 2 - 

Also for these parameters, it could have been 
possible to disassemble the system and to 
measure them separately. Apart from practical 
difficulties, this approach is not appropriate 
because the assumed model is a simplified 
model, and its parameters have the nature of 
equivalent parameters, with implicit reference to 
some physical phenomena neglected in the 
mode!, as discussed in the previous section. For 
instance, stiffness parameter A:| is an equivalent 
parameter, taking into account stiffness of the 
springs sustaining platform P, the ground 
elasticity, and asymmetry of the structure. From 
physical considerations, it can then be argued 
that parameter m 2 is the less affected one by 
neglected dynamics. Consequently, this para¬ 
meter has been set to the value obtained by 
direct measurement; 

m 2 = 440 kg. (10) 

The remaining parameters are identifiable 
from the given experimental conditions, and 
have been estimated from measurements on the 
overall system. 

The available experimental data are: 

• Samples of frequency response command to 
output (module and phase) G(/a») with 
sinusoidal current generator supplying the 
actuator. Input and output measurements 
have been performed in open loop at about 
20 frequencies in the range 0.5-40 Hz, and 
with some different command amplitudes: 
0.5, 1 , and 2 A r.m.s. values (Figs 4 and 5). 

• Samples of frequency response disturbance 
to output |//(/ti>)|, (only module deduced 
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(ij(rad s‘‘) 


Fig. 4 Samples of |G(;<u)| with corresponding uncertainty intervals 


from spectral densities of vertical accelera< 
tion of both ground and upper platform B in 
open loop normal operating conditions. 
Frequency range is 0.63-57 rad s“\ with 
frequency resolution 1.25 rad s"‘ (Fig. 6). 
This range of frequency contains more than 
95% of both disturbance and output power 
(see Section 4), 

These data can be described by the equation 
y = F{p)^e, (11) 


where y is the vector containing the samples 
y = [|G(;w,)|, \Gijaj 2 )\. . •. , arg G(;tu,), 
arg G(/a> 2 ), . . . , 

\HijWr)\, mja>2)\. . . .]. (12) 

p is the vector containing the unknown 
parameters 

P = [*1. *2./5i./32. "I,], (13) 

and e is an error term. 



Fig 5 Samples of arg C(jto) with corresponding uncertainty intervals 
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Since accuracy is the main concern of the 
calibration problem, particular attention has 
been paid to the computation of parameter 
estimate accuracy, which depends on the 
knowledge about the error term e and on the 
estimation algorithm. 

A statistical description of e does not appear 
appropriate, mainly due to the modelling errors 
discussed above. An unknown but bounded 
error approach has been adopted, which requires 
information on measurement error bounds only 
(see e.g. Schweppe, 1973; Milanese and Vicino, 
1991b). 

Uncertainty intervals v, have been evaluated 
for each data, taking into account the accuracy 
of the measurement devices and the effects of 
unmodeled dynamics, expressed by the bounding 
function shown in Fig. 7. The used values are 
reported in Figs. 4-6. In such a way, the 
information on e is given as 

(14) 

where \\e\\l = max,v~^\e,\, v,>0 and v is the 
vector of error bounds corresponding to the 
uncertainty intervals on data. 

A least square estimate p” of the unknown 
parameters is obtained as 

:= argmin {||y - (1^) 

p 

where |1 • Hz denotes a weighted Euclidean norm, 
and V is a diagonal matrix with elements v^. 

Uncertainty in data induces uncertainty on t e 
parameter estimated values. The maximal range 


of variation of the ith component p]\ due to 
possible errors consistent with (14) is indicated 
as estimate uncertainty interval u,. The u,s have 
been evaluated by a method proposed by 
Belforte and Milanese (1981). The method is 
based on a quasilinearization technique and 
gives intervals certainly contained within the 
exact M,s. Indeed, it gives exact values if Bp^Udyj 
has constant sign for all y sY, the set Y being 
defined as 

Y=(y:\\y-y\Vi^\}- ( 16 ) 

where ^ denotes actual measurements. 

This condition is difficult to be checked with 
certainty. However, dp^ldy has been evaluated 
at several grid points, providing good evidence 
that the condition is met. This has been 
accomplished by using the formula (sec Belforte 
and Milanese, 1981) 


where 


^ = iA^VA)-^A^V, 
dy 


A 


dF 





(17) 

(18) 


In turn, — has been computed as a function 
dp 

of p by means of the symbolic manipulation 
package DERIVE. 

The identification procedure described above 
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has given the following results: 

*1 =pt=P\±Ul 

= (1.4 ± 0.3)10® Nin-‘, 

kl=P2 = pl±U2 

= (1.0 ± 0.3)10® Nin-\ 

Pi ^ Pi —Pi i “3 

= (4.8 ±1.0)10® Nsec m"', 

P2=P*=P4±U4 

= (1.7 ± 0.3)10® N seem"', 

mi=ps = p^s±Us 

= (4.2 ±0.7)10^ kg. 


(19) 


Note that other estimators could give smaller 
estimate uncertainty intervals. In particular, 
methods to compute minimal uncertainty inter¬ 
vals estimators have been proposed by Milanese 
and Belforte (1982), Milanese and Tempo 
(1985) and Milanese and Vicino (1991a). 
However, a least square estimator has been 
adopted because it is computationally faster and 
has better robustness properties with respect to 
inexact knowledge of uncertainty error bounds 
V,, as shown by Tempo and Wasilkowsky (1988). 

In order to complete the perturbation model, 
the weighting function W(a}) of (8) have to be 
evaluated. To this extent effects of ground 
deformability has been simulated by space 
discretization of the ground. This discretization 
introduces several couples of zeros and poles in 
the transfer function C(5) which are rather close 
to the imaginary axis. These low damped 


zero-pole couples give rise to peaks in the 
transfer function at relatively high frequencies 
(to > 1000 rad see”\ see Fig. 7). Now W((o) 
have to be chosen so that |C(y(U, p®)| (1 ± 
|IV(co)|) envelopes these peaks. The methods 
used for the design and the robustness analysis 
require W(a)) to be a rational stable function, 
whose order affects the complexity of the 
controller and the computational burden of the 
analysis. Then a first order, high pass (for 
Q) > 1000 rad sec“^) function has been chosen: 


W{(jd) = 


1.22/(0 
j(i) -h 1000 


( 20 ) 


In summary, the perturbed model which has 
been adopted for the purpose of robust 
performances regulator design, is the mixed 
parametric and dynamically perturbed model 
represented in Fig. 3. The forms of G{5,p) and 
H(s,p) are given by equations (l)-(5). Symbols 
G(s,p) and H{s,p) denote explicitly the 
dependence of the plant transfer functions on 
the uncertain parameter vector p. 

Parametric uncertainty is represented by the 
fact that parameter vector p is known only to 
belong to the parameter uncertainty set n defined 
as 


n={p6R^:l|p~p‘1l“<l}, (21) 

where p^^ denotes the ‘nominal’ parameter 
vector, whose components are given by the p% 
in equations (19), and u is the vector whose 
components are the estimates uncertainty inter¬ 
vals u, in the same equations. 



Fic 7 Bounds of perturbed transfer function G{s, p°)[l + A], in dotted line transfer function of discretized 

ground model 
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Unstructured uncertainty is represented by the 
multiplicative dynamic perturbation A, known 
only to belong to the modeling error set Af 
defined as 

A£*{A:||lV"'(a>) A(tu)|Usl, 0<aj<c»}, 

where 1^(0)) is given by (20). 

4. ROBUST REGULATOR DESIGN 
The feedback control configuration shown in 
Fig. 3 has been considered for output regulation 
of the plant. 

The loop transfer function 

Lis, p. A) = Ais)G{s, p)[l + A]M(j)C(j), (23) 

is now considered. Symbol Lis, p, A) denotes 
that the loop transfer function is affected by both 
structured and unstructured perturbations. 

Furthermore, the sensitivity function 
Sis, p. A) and its complement Tis, p. A) are 
considered; 

5 = (1 + L)"*, 7' = 1-S = L(1+ L)''. (24) 

The robust regulation problem requires to design 
a relatively low order controller C(j) such that 
the following specifications are met: 

(1) Closed loop is robustly stable with respect 
to the allowed structured and unstructured 
uncertainties, i e. Vp e 11 and VA e Ap. 

(2) Closed loop control guarantees distur¬ 
bance attenuation of 1:100 in r.m.s. 
values with respect to open loop opera¬ 


tion. This is obtained by imposing; 

\Sijfo,p, A)|^(/((u), 

Vp e n. VA € Ac, (25) 

0.63 £ (0 s 57 rad see" ‘, 

where Uiu)) is the bounding function 
shown in Fig. 8, and [0.63, 57] rad 8ec~' is 
the frequency range where the spectral 
density function Sjito) of the disturbance 
has more than 95% of its power (see Fig. 
9 ). 

(3) Command amplitude is limited, in 
order to guarantee that the current in the 
electromagnetic actuators does not exceed 
10 A r.m.s. This is obtained by imposing: 
\Hijw, p)S(jot), p, A)A#(/(w)C(ya))j4(/a))| 

6 X 10" Am~ ' sec^, (26) 
Vpell, VAeAc, 0.63 oi £ 57 rad sec"'. 

A systematic approach to a design problem of 
such a complexity is of the possibilities offered 
by the present state of the robust control 
literature. Consequently a design approach has 
been used, based on iterative ph ases of 
“nominal” design and robustness analysis, 
which, in case of failure, gives indications for the 
successive design phases. 

The design phase has been carried on, using 
the nominal parameters, in the following way: 
the robust stability requirement with respect to 
unmodelled dynamics and the given performance 



Fig. 8. Nominal sensitivity (C,t('») “ ^,,,( 1 )) and bounding functions. 
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Fig. 9. Spectral density of disturbance. 


specifications are used to suitably define the 
weighting functions entering in the design 
approach. Through synthesis algorithms it is 
possible to find a controller, satisfying the 
specifications for the “nominal’' parameters, or 
to understand if some of the specifications have 
to be relaxed. Note that the controller obtained 
by synthesis may be of higher order than 
desirable. However, it is often possible to find, 
by means of order reduction techniques, a 
simpler controller satisfying nevertheless the 
nominal performances. For this design phase 
we have used ari interactive design software 
described in Ferro (1992), implemented using 
the Robust-Control Toolbox of MATLAB. 

The robustness analysis phase has been carried 
on following the approach proposed by Milanese 
etal. (1991). 

First, performance specifications are repre¬ 
sented in the frequency domain by functions 
P) fc = 1-3 such that the fcth performance 
specification is satisfied for all unstructured 
perturbations A 6 Ae if and only if Fi(cu, p) >0, 
(o For example, in the case of stability, the 
corresponding specification inequality can be 
obtained by the small gain theorem, giving: 

|lV(ai)r(Mp,0)|<l. (27) 

A given compensator is said to achieve robust 
fc-performance if p) > £2^, Vp e H. 

Note that in this way, robustness is guaranteed 
vs both parametric and unstructured uncertainty. 

A robustness measure for the /:th perfor¬ 
mance, called performance margin pi, is defined 


as the radius, according to norm in parameter 
space, of the maximal ball, centered at the 
nominal parameter p", such that the closed loop 
system preserved the given performance specifi¬ 
cation for every parameter vector belonging to 
the maximal ball and for all admissible 
unstructured perturbations. This measure is a 
generalization of the widely used concept of 
stability margin (see e.g. de Gaston and 
Safonov, 1988; §iljak, 1989; Vicino et ai, 1990). 

Performance margin p* can be computed as 

p/ = min p 

p.io.p 

subject to 
p ^ 0 

\p,-p'i\^pu, 1 = 1, ...,4 
F^{(o,p)^0, (ueQ*. (28) 

Note that the kih performance is robustly 
achieved if and only if p^* ^ 1. 

The optimization problem (28) may have local 
minima and the global solution is needed for 
solving the problem. Global optimization meth¬ 
ods based on random search algorithms are not 
appropriate, since these methods guarantee 
convergence to the global minimum only in 
probability and, more importantly, they do not 
give any measure on how far the obtained 
solution is from the true global minimum. When 
F(o),p) is a polynomial function in (o and p, 
algorithms able to produce a sequence of upper 
and lower bounds converging with certainty to 
global extrema have been proposed, and shown 
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to be able to solve some non trivial robustness 
problems (Vicino et al., 1990; Vicino and 
Milanese, 1990; Milanese et al., 1991). 

Indeed the given specifications can be 
represented by polynomial inequalities in p and 
u). The explicit functional expressions 
k = l-3, corresponding to the considered per¬ 
formances, have been obtained by means of 
symbolic manipulation package DERIVE on a 
personal computer. As an example, p) is 
reported in the Appendix. 

The results reported in this paper have been 
obtained by use of the algorithm reported in 
Malan et al. (1992), being a further improve¬ 
ments over the previously cited ones. 

Note that in case the controller is found not 
robustly performing (i.e. p**:£l for some k), 
the above analysis gives useful indications for the 
design phase. In particular, if the robustness 
margin of one performance is less than one, a 
new controller may be designed by strengthening 
the corresponding specification and possibly 
relaxing the specifications with robustness 
margin greater than one. 

Following the described approach, a compen¬ 
sator has been found, satisfying performance 
specifications for nominal parameter p'\ given 
by: 

(l-t-j/0.5)(l-l-j/ll)(l +s/22y 

x(l-f j/67)(l -l-j/405)(l -l-j/10") 
C(j) = 5 . 

j\l -fj/57)^(l-l-j/58.8) 

X (1 -I-a'/]()65)(1 +S/9X 10"’) 

(29) 

This transfer function has been found to be 
reasonably well approximated by the fourth 
order transfer function; 

_ (1 + s/0.5)(l + s/50Y{l ^s/im) 

s\l+s/200) 

This controller largely satisfies the given 
specifications in correspondence to the nominal 
model: the closed loop is stable with a damping 
factor of 0.4, the disturbance effect on the 
output is 1:120 with respect to the open loop, 
the current in the electromagnetic actuators does 
not exceed 7 A. However, as it results from 
Table 1, it does not achieve robust disturbance 
attenuation. This suggests that the gain loop has 
to be raised. 

Indeed compensator Cj4(5) = 1.4Cio(j) has 
been found to achieve all robust performances. 
Note that the stability margin for this compen¬ 
sator is near to 1, indicating that higher gain may 
cause stability problems, as confirmed for 
example by the robustness analysis of C 2 o(‘S') 

2Cio(^). 


Table 1. Specification margins and coRRespoNDiNO 

COMPUTING TIMES ON A VAX 9000 COMPUTER 


specification 


CPU lime 
(see) 


CPU time 
(sec) 

P* 

CPU lime 
(see) 

1 

1 IK 

128 

1 08 

80 

0.81 

104 

2 

0 63 

15 

1 46 

23 

2.16 

69 

3 

3 33 

s 

3 33 

5 

3.33 

4 


We remark that the robustness analysis has 
been carried on by an approach able to exploit 
the nonlinear structure induced by the con¬ 
sidered perturbed parameters. It is known that 
this is an hard problem, requiring computation¬ 
ally cumbersome algorithms. Computationally 
simpler techniques have also been tried, by 
considering an interval plant formulation (Chap- 
pellat and Bhattacharyya, 1989; Tesi and Vicino. 
1991). However this approach lead to conserva¬ 
tive results, and for the present case study the 
resulting conservativeness is so high to prevent 
the possibility of finding a controller which could 
be guaranteed to perform robustly by such a 
simplified analysis (see Milanese et ai, 1992). 
On the other hand, for this case study, the 
computational burden of the nonlinear analysis 
appeared to be quite acceptable (see computing 
times in Table 1) and worth paying, in 
consideration of the improvements in the 
obtained results. 

5 CONCLUSIONS 

Thr presented case study illustrates how some 
robust identification and control techniques can 
be applied in dealing with real world problems. 

The modeling and identification step has been 
performed by using physical insight and set 
membership identification theory, able to ac¬ 
count for parametric variations and modeling 
approximations. 

Robustness of the control system with respect 
to both types of uncertainty is considered, not 
for stability only, but for other performance 
specifications, such as disturbance attenuation 
and command power limitation. 

An iterative design strategy has been adopted, 
based on successive phases of “nominal” design 
and robustness analysis. 

The main conclusion that we draw from this 
case study is that, in facing with real world 
problems, it is necessary to take approaches with 
the following features: ability to deal with 
nonlinear physical parametrizations; accounting 
for both parametric uncertainty and unmodeled 
dynamics; designing and analysing with respect 
to different performances. 

These requirements clearly lead to difficult 
problems, but it appears that techniques now 
exist, able to solve cases of such a complexity to 
be of some interest in practical applications 
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APPENDIX 

In order to show the type of functional expressions of 
functions Ff^((i),p), we report the specification function 
F,(q>, p), related to robust stability of the closed loop system 
using controller C|q(5). 

As can be seen, F^(a), p) is composed of 101 monomials 
in variables to, k^, k^, fn\- The other specification 

functions have approximately the same complexity. In 
particular F^iw, p) is composed of 71 monomials and 
F^{a), p) of 83 monomials. 

We recall (see Milanese et al., 1991) that the closed loop 
system is robustly stable Vp e 11, VA € ^tid only if 

Fi(cu, p) >0, Vp 6 n, Vw. 

F,(cu, p) = + 26/25 • \{fp]Pl(D^ + 4 • + 

P]klw^ + 26/25 ■ i{fip]kl(o* + 4 • - 

BSO/JfkjO)'* - 1144/125 ■ - 88/25 ■ 

\0'^P]k2CJ* + 242/125 • 10"jS>‘° + 6292/3125 * 

10* 1/5? 0)" + 968/125 • 10*’j8?a)* + 1566/625 • 
IO'PxPW ■*- 39,933/15625 ■ lO^^PiPlo)^ -1- 
3132/625 • W'''PiPl(D^ -K 3949/3125 • lO^p^p^o}'^ + 
440,363/312500 ■ lO^^p^p^o)^ + m,\19ll^,Y25 - 
lO^^P.P^o)^ - 3828/625 • 10^'p^p2Q)* + 1566/625 • 
\0'^P^kla)^ + 39.933/15,625 ■ lO'^p^klo)* + 
3132/625 ■ 10'^/5,k?(i>= - 17.226/15,625 ■ 
10***/3,k2tw‘’ - 439,263/390,625 ■ lO^^pjk^to^ - 
34,452/15,625 ■ + P\kW ^ 26/25 ■ 

]{fPlk](i)* + 4 • \0^^PlkW “ 2Plk,m^a}^ - 52/25 

• \{fiplkym,o)^ - 8 • 10***j9?k,miQ)' - 
28,720j8|/c,ai® - 40,033/12,500 ■ lO'^PlkiW^ - 
10,289/3125 ■ lO^^Plk^w^ + 174/125 • 10*^/9?k,ii>^ 

+ + 26/25 • lO^Plm^w^ + 4 - 10*"j3?m?£u* 

■f 28,720^^m,a)*® 4 40,033/12,500 ■ XO^^Plm^w^ + 
10,289/3125 • XO'^Plm^o)^ - 174/125 ■ 10*^/9?m^ca^ 

- 468,247/56,250 ■ 10^^?cu*" + 
10,452,293/9,375,000 - 10*'*^?w“ + 

30,511,6191X5,625,m ■ + 

54,186,761/5,625,000 • + 7569/6250 • 

X{f''plu)^ + 17,226/15,625 • 10'%k,a>* + 
439,263/390,625 ■ XO'^p^kiW^ + 34,452/15,625 • 
XO^^pjk^w* - 17,226/15,625 ■ 10*“j82m,(W*" - 
439,263/390.625 • 10*%miW*‘ - 34,452/15,625 - 
XXf^pjm , w" + k]klu}^ + 26/25 • X(fk^klw^ + 4 ■ 
10***k?k? - mk]k2(i)^' - 1144/125 • lO^k^k^o)" - 
88/25 ■ lO'^k^k^w'^ + 242/125 ■ 10^k?(a“ -h 
6292/3125 • XO^k^o)^ + 968/125 ■ lO'^k^^to^ - 

2k, - 52/25 ■ 10*‘fc, - 8 ■ 

10'°k28,720k,k?<w*’ - 40,033/12,500 ■ 
10**'kik?w^ - 10,289/3125 • lO'^'kik^cu^ -I-174/125 

• 10*'kik? -h 1760k,kjm.iu** + 1144/625 ■ 

10"k,k2m,w'’ -h 176/25 • 10’"k,k2m,w^ -h 
3949/3125 ■ 10'^k,k2iu“ 4 440,363/312,500- 
lO'^jk^oi^ + 113,179/125 ■ lO'U.kztw^ - 
3828/625 • l(F*k,k2ca^ - 484/125 ■ 10*k,miQi*” - 
12,584/3125 ■ 10**k,m,<y'* - 968/625 ■ 10*''k,m,cu*' 
-I- k|m?a;® + 26/25 ■ Xifiklm^o)^ + 4 ■ 10***k^/7i?ai^ 4 
28.720k?m, w** 40,033/12,500 • 10*‘*kim, + 

10,289/3125 • 10'‘*klm,a>^ - 174/125 • 10‘’k?m,ai^ 

- 468,247/56,250 ■ XO’^kjw^ + 
10,452,293/9.375,000 - 10*^k?cu‘' + 
30,577,679/15,625,000 • l(F"kJw^ + 

54,186,761/5,625.000 - lO^^k^m^ + 7569/6250 - 
lO^k? - 880k2rn?a>*** - 1144/125 - lO^kjmfw'* - 
88/25 ■ 10*^k2m?a>^ - 3949/3125 • XO'^kjm^w'^ - 
440,363/312,500 - lO'^fcamtO)"- 113,179/78,125- 
10*'*k2fn,eu® + 3828/625 ■ l(F*k2m,cy* + 242/125 - 
10®m?ft)'^ + 6292/3125 • 10'*m?(W*“ + 968/125 • 
10'"m?^u^ 


(31) 
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On the Design of Robust Two Degree of 
Freedom Controllers* 


D. J. N. LIMEBEER.t E. M. KASENALLYt and J. D. PERKINSt 

A new design method for two degree of freedom controllers is presented. 

optimization methods are used to ensure a guaranteed level of robust 
stability and robust performance. The techniques are illustrated on a high 
purity distillation system. 


Key M^onU control; two degrees of freedom; normalized coprime factors; loop shaping; robust 
performance; robust stability; model matching; control system design. 


Abatract —The aim of this paper is to introduce two methods 
of designing robust two degree of freedom (TDF) 
controllers. Doth design procedures will be illustrated anti 
evaluated on a high purity distillation column example. In 
the first approach the feedback controller and the prcfilicr 
are designed in a single step using an optimization 
procedure. The second method optimizes the feedback 
controller and prefilter in two separate design stages 
Roughly speaking, the feedback controller i.s designed to 
meet robust stability and disturbance rejection specifications, 
while the prefilter is used to improve the robust model 
matching properties of the closed loop .system. The single 
step approach has the advantages that it is easy to use, and 
that the resulting controller degree is the same as that of the 
plant. The two stage approach offers greater design flexibility 
and it may produce robust stability and robust performance 
margins which are significantly bigger than those achievable 
with the .single stage approach. Set against that, the two stage 
technique is more difficult to use and it produces controllers 
which may have an order which is significantly bigger than 
that of the plant. 


1. INTRODUCTION 

In conventional control systems of the type 
illustrated in Fig. 1, the plant input u is 
generated by processing e = r - Fy which is the 
difference between the reference inputs and a 
fixed function of the outputs. If the input to the 
controller is some fixed function of r and y (such 
ase = r- yore = r- Fy), it is well known that 
the associated design problem has a single 
degree of freedom (SDF) which may be 
described in terms of a stable Q parameter. In 
systems of this type it is also known that there 
are fundamental design compromises which have 
to be made between robust stability and 
sensitivity minimization (for example). When 
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July 1992. This paper was not presented at any IFAC 
meeting. This paper was recommended for publication in 
revised form by Quest Editor R. F, Curtain. 

t Interdisciplinary Research Centre for Process Systems 
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one is faced with demanding performance 
specifications, the restrictions associated with 
SDF control systems may make it difficult or 
even impossible to optimize the trade-offs which 
have to be made. 

In two degree of freedom (TDF) configura¬ 
tions one allows the controller to process the 
references and measurements independently 
subject only to closed loop internal stability 
requirements. For example, one may set 

« = Kp = K,r + K3y. (1.1) 

and then select (the prefilter) and K 2 (the 
feedback controller) arbitrarily with internal 
stability the only restriction. The two degrees of 
freedom in systems of this type may be 
parametrized in terms of two stable but 
otherwise free Q parameters (Vidyasagar, 1985; 
Youla and Bongiorno, 1985). 

Linear quadratic Gaussian (LOG) methods 
and 'JT optimization procedures developed over 
the last decade are now widely used in the design 
of SDF control systems. The aim or this paper is 
to investigate the possibility of expanding the 
role of K" optimization tools in TDF system 
design. Our starting point is the TDF para- 
metrization theory of Youla and Bongiorno 
(1985) and the loop shaping design procedure 
described in McFarlane and Glover (1992). We 
present two optimization procedures for the 
design of such systems. In the first approach the 
entire controller is designed in a single step. In 
this case we optimize the two Q-parameters 
(which we will call Qi and O 2 ) simultaneously. 
The second procedure settles on and 

therefore K 2 first, and then optimizes Qi and 
hence K, in a second optimization step. The Q 2 
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Fig. 1. The archetypal single degree of freedom controller 
configuration 

parameter is used to optimize the loop's robust 
stability and disturbance rejection properties, 
while Qi is used to ensure a prescribed level of 
robust model matching. The first procedure has 
the advantage that it is easy to use and that the 
resulting controller has a low degree because Ki 
and K 2 share the same state-space. The second 
method will always have the capacity to produce 
a higher robust stability margin, and may result 
in an improved level of robust performance (as 
compared with the first approach). Unfortun¬ 
ately the two stage approach requires more 
computation than the first, is more difficult to 
use and results in a higher order controller. Both 
design procedures will be tested and compared 
on a hi^ purity distillation system. 

The theoretical background to the design 
methods is given in Section 2 where we 
summarize the notion of robust stability in the 
context of normalized coprime factors, robust 
stability optimization, a measure of robust model 
matching and TDF parametrization theory. The 
TDF design procedures are given in Section 3. A 
detailed design for the distillation problem 
appears in Section 4, with the conclusions in 
Section S. 

The space of functions with no poles in the 
closed right half plane is denoted Bold face 
letters will be used for transfer functions and 
script characters will be used for sets or spaces. 
The notation 

G = D + (1.2) 

will be used from time to time, and we also need 
G~(j) := G*(-j). Linear fractional maps are 
denoted 

/V(P, K) - Pn -H Pi2K(I - P22K)"^P2„ 

in which P is partitioned as Finally, 

we will make frequent use of one of Redheffer’s 
theorems on the contractive properties of linear 
fractional maps: 


2 THEORETICAL BACKGROUND 

The problem of designing controllers to meet 
robust stability specifications together with 
nominal performance requirements has been 
well studied. One such method, which is based 
on normalized coprime factors, concentrates on 
coprime factor robustness optimization and has 
already been successfully applied to several 
design problems (McFarlane and Glover, 1992). 
The task of designing TDF controllers to 
simultaneously meet robust stability and robust 
performance specifications seems to have re¬ 
ceived less attention. In the remainder of this 
section we will summarize the theory required 
for two design procedures for TDF controllers. 
These procedures make it possible to guarantee 
a prescribed level of robust stability, while 
simultaneously ensuring some minimum level of 
robust model matching. 

2.1. Robust stability 

One way to characterize a set of plants, 
with some nominal centre, G, is via a perturbed 
normalized coprime factorization. If G is a given 
plant model, then 


G = M-'N, (2.1) 


is a normalized left coprime factorization of G if 
M, N 6 are coprime and satisfy 


If 


MM -hNN‘=l. 


( 2 . 2 ) 



A 

B 


C 

D. 


is given 


^ ^ r-^^d R-^^y 


(2.3) 


is a normalized coprime factorization of G where 
H :=-{BD'+ ZC')R-' and R .= I + DD'. The 
matrix Z ^ 0 is the unique stabilizing solution to 
the algebraic Riccati equation 

{A - BS-^D'C)Z + Z{A - BS-'D'C)' 

-ZC'R-*CZ + B5-'R’ = 0, (2.4) 

in which 5 := / + D'D. 

A set of plants ^ may now be characterized by 


Theorem 1.1. If ||P|l„:s y and ||K||.s y~*, then 
llF,(P,K)|Usy. 

Proof. See Redheffer (1960). ■ 


Gp = (M-A„)-'(N + A;v), (2.5) 

in which A*,, A/v 6 satisfy ||Am A/v||.< y“'. 

In order to maximize the robust stability of the 
closed loop system, we require a feedback 
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controller which minimizes 


y = 



( 2 . 6 ) 


as K ranges over all stabilizing controllers. It is 
then immediate from the small gain theorem that 
the perturbed closed loop system will remain 
stable provided 

1|A„ (2.7) 

It is also well known that the lowest achievable 
ST” norm for the quantity in (2.6) is given by 
Glover and McFarlane (1989) 

y„-p'. = Vl-||M Ml?,. (2.8) 

where H-Hh denotes the Hankel norm. 

If ^ 0 is the unique stabilizing solution to 
the Riccati equation 

(A - BS-'D'CyX + XiA - BS-'D'C) 

-XBS-^B'X + C'R-'C = 0, (2.9) 
it can be shown that 


yop, = (i + A„„(Jrz))‘^l (2.10) 

A controller K<) which achieves this bound is 
described by the generalized state-space equa¬ 
tions (Glover and McFarlane, 1989) 

Q'x = [Q'(A -I- BF) + y^ZC'(C -I- DF)]x 


+ Y^ZC'u, (2.11) 

y = B'Xx-D'u, (2.12) 

where 

F^-S-\D'C + B'X), (2.13) 

Q = {\-yy-yXZ. (2.14) 


context of this paper, we will call the system 
robust from a model matching point of view if 

11(1-G_K,)-‘G„K,-Moll. :sy. (2.15) 


for all perturbed plant models G^ e % 


The 


transfer function M<, represents some desired 
closed loop transfer function and is chosen by 
the designer. In our applications, we will use M,, 
to explicitly introduce time domain specifications 
into the design process. In the case of 
multiplicative perturbations at the plant input, 
the set 'S is generated by Gp = G(I + A) with A 
chosen so that G and G. have the same number 


of 


right half 


plane poles and such that 
l|A|| I* ^ y . Oirect calculation gives. 

(I - G(I -I- A)K2)-'G(I 4- A)K, - Mo 
= (I-(I-GK2)-‘GAK2)~' 

X (I - GK2)-'G(I + A)K, - M„ 

= (I-GK 2 ) 'GK,-Mo 
+ (I-GK2)-'GA 
x(I-(I-K2G)-‘K2GA)-' 
x(|-K2G)-'Ki 

(I -GK 2 ) 'GK, -M„J 
(I - K2G)-’K, 

(I-GK2)"'G 




(I - KjG) 'K 




If K. and can be found such that 


^2 

\-i 


(I-GK2)-‘GK,(I-GK2)"'G 
(I - K2G) ‘Ki (I “ K2G)”'K2G- 

sy, (2.16) 


then it is clear that: 


In practical design applications it is usually 
necessary to reshape the plant’s frequency 
response in order to meet the closed loop 
performance requirements. The loop shaping is 
done by premultiplying the plant by a loop 
shaping precompensator W (McFarlane and 
Glover, 1992). The robustness maximizing 
controller is then found for GW and it is 
consequently GW and not G which is optimally 
robustly stabilized. 


(1) The feedback system will be stable for all 
those perturbations A which give G and Gp 
the same number of right half plane poles 
and such that ||A||«,< y'V This follows from 
11(1 - K 2 G)"^K 2 G|U:^ y and the small gain 
theorem. 

(2) The closed loop system will have robust 
model matching properties in the sense that 

||(I-G,K2)-'G^K,-Mo|U<y, (2.17) 


2.2. Robust performance 
The goal of this section is to develop a method 
of guaranteeing closed loop robust performance 
in a model matching sense in the face of nominal 
plant model perturbations. We will analyse the 
case of multiplicative perturbations at the plant 
input in detail, normalized coprime factor 
perturbations, additive perturbations and multi¬ 
plicative perturbations at the plant output may 
then be dealt with in the same way. In the 


for all This follows from 1|A|U< 

Y~\ (2.16) and Redheffer’s theorem on the 
contractive properties of linear fractional 
maps. 

We will now repeat this analysis for perturbed 
normalized coprime factors. In this case each 
element G^ e "S is represented by 

Gp = (M-Am)-‘(N + An), ( 2.18) 
where Am, An e Xt satisfy ||Am AnIU < y“‘- 
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Suppose 

♦ = (I - (M - Am)-'(N + An)K,)-‘ 

x(M-Am)-‘(N + An)K,-M,„ (2.19) 

in which K, and Ka are given, and where Am, 
satisfy ||Am AnIU < y“'. Then a 
rearrangement of the type given earlier yields 

( ~ (l-GKa)~*GK|-Mo | (l-GKa)~'M-' ~ 

G(I~KaG)-‘K, (l-GKa)-'M-' 
(I-K2G)-‘K, K2(I-GK2)-'M->_ 

[Am Aslj. (2.20) 

From this we observe that 

(1-GK2)~‘GK,-M(i | (1-GK2)~'M~' ~| 
G(I-K2G)-'K, (I-GK2)''M-' 

(I-K2G)-'K, K2(I-GK2)“'M-'j , 

<r. (2.21) 

and 

I|Am AN|U<y-', (2.22) 

ensure that: 

(1) The closed loop system in Fig. 2 will be 
stable for every Gp 6 S. This follows from 
the fact that the (2,2) block of (2.21) is 
stable, has infinity norm ^y and the small 
gain theorem. 

(2) The closed loop has the robust model 
matching property 

11(1 - GpK 2 )-'GpK, - H.IU ^ y, (2.23) 

for all Gp 6 'S. This property is immediate 
from Redheffer’s theorem (Redheffer 1960). 

2.3. All stabilizing controllers 

As we will now demonstrate, the TDF 
controller may be parametrized in terms of a 


pair of Q-parameters which we will denote Qi 
and Q 2 . A theory of this type is well known, and 
may be found in Vidyasagar (1985) and Youla 
and Bongiorno (1985). Once all the controllers 
have been parametrized, Qj and Q 2 may be 
optimized either separately or in a single step. 
We will study both these approaches in the 
context of the TDF configuration in Fig. 2. 

It follows from this diagram that 



(2.24) 


We now suppose that 

G = 1V|-'N = ^-*. (2.25) 

are left and right coprime factorizations of G, 
that 



are the corresponding Bezout identities and that 

<C=-(U + MQ)(V-NQ)-', Qe9r:, (2.27) 

is a representation formula for every feedback 
controller which stabilizes G (Glover et al., 
1991; Youla et al., 1976). In the case that Mo is 
stable, K = [K, K 2 ] will stabilize P 22 if and only 

if it stabilizes P. Now 


while the corresponding Bezout identities are 



(2.29) 



Fig. 2. TDF controller and a perturbed normalized coprime factor representation of 
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As a 

consequence, we have that 


[Ki 

Kal*-IMQ, U + MQj] 



><{[; 

r' 


= -[MQ, U + MQi] 



x[ * 

0 1 


I-(V-NQ2)-‘NQ, (V- 

■NQ2)-'J‘ 

From this we conclude that 



K2=-(U + MQ2)(V-NQ2)- 

(2.30) 

and 




K, = -(M-K2N)Q,. 

(2.31) 


Substituting (2.30) and (2.31) into (2.21) gives 


(1-GK2)~^GK,-M(,| (1 - GKj) - 'M- ‘ 


G(I - K2G)-‘K, 


(I 

-K2G) 

r'K, 


■-Mo 

V 

= 

0 

V 


0 

-u 


(I-GK2)”'M-' 

MI - GK,)-'M U 

"n" 

N (0, IQ 2 ], (2.32) 

M 


which is affine in Qi and Q 2 . 


3. DESIGN PROCEDURES FOR TDF 
CONTROLLERS 

We propose two design methods which will be 
tested on a high purity distillation system. The 
first technique uses a single step optimization 
procedure, while the second approach optimizes 
the parameters Qi and Q 2 in ^separate steps. 

3.1. The single step approach 
The single step approach makes use of the 
configuration illustrated in Fig. 3. By updating 


(2.32) to represent Fig. 3, we obtain 


z 

y 

u 


pGCI-K^G) ‘If, 


p(I-K,G) ‘K, 


P(I-GK2) ‘M“‘ 
(I-GK2)'’M'' 

KjCI-GKj) m 


»[;]■ 

in which the scaling factor p is used to weight the 
relative importance of robust stability as 
compared with robust model matching. If 



((1-GK;) ‘GKi-M,.) 
pG(l - K 2 G) 'K, 
p(l-K,G) 'K, 


p(l-GK,) 'M 
(I-GK 2 ) 'M ' 
K,(I-GK,) 'M 


I 


I 


(3.34) 


then: 


(1) The loop will remain stable for all Am, 
An E such that ||Am AN||rc<y“'. This 
follows from the (2,2) partition of (3.33), 
(3.34) and the small gain theorem. 

(2) A direct consequence of Redheffer’s 
Theorem is ||(I - GpK 2 )"'G^K, - Mo|U ^ 
YP~^ for all Gp e 'S generated by Am, An 6 

such that IIAm AN||rx<Y~^- This is the 
guaranteed robust performance property. 

(3) If p is set to zero, the TDF problem reduces 
to the ordinary robust stability problem 
described earlier in Section 2.1. 


There is an alternative scaling procedure which 
may prove more useful in certain applications. 
We can perform all the ‘ p scaling” through M(, 
by setting the (1,1) partition of (3.33) to 
((I - GK 2 )“^GK, - p^M()). Once the optimiza¬ 
tion problem has been solved, we reverse the 
scaling by replacing K, with p"^K,; increasing p 
tends to emphasise the model matching part of 



Fig. 3. The single step TDF design configuration. 
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the problem. In this case it follows that 
11(1 - GpK 2 )-‘G^K, - pXlU V for all Gp e 

As a refinement, the prefilter may be scaled so 
that the closed loop transfer function which 
maps p-*y in Fig. 3 matches the model exactly 
at steady state. To do this we make the 
substitution Ki-» Ki£ where 5 is a scaling matrix 
defined by S ;= R,^(0)~’Mo(0). We have ob¬ 
served that this rescaling has the effect of 
producing better model matching at all fre¬ 
quencies because the dif. optimization process 
tends to give the same magnitude frequency 
response shape as the model Mq. 

To set the problem up in a generalized 
regulator framework for optimization (Glo¬ 
ver et al., 1991), we define the matrix 



(3.35) 

which comes from Fig. 3. Setting 



(3.36) 

which may be passed to standard software 
packages. 

3.1.1. A prescriptive design procedure. As 
with any control system design methodology the 
process is iterative. As a starting point we 
assume that we have a nominal model of the 
plant together with some characterization of 
uncertainty. We then proceed as follows: 

(1) Select a loop shaping weight for the open 
loop plant. This is used to meet certain 
closed loop performance specifications 
(McFarlane and Glover, 1992). 

(2) Select simple target model. Mo, for the 
closed loop system. This is usually a diagonal 
matrix of first or second order lags which 
represent desired closed loop time domain 
properties. The selected target model must 


be realistic, or the resulting closed loop 
system will have poor robust stability 
properties and the controller will produce 
excessive control signals. 

(3) Find the minimal value Yopt in the pure 
robust stabilization problem; this may be 
calculated using equation (2.10). A high 

■ value of /opi indicates that the specified loop 
shapes are inconsistent with robust stability 
requirements, and that IQ, will significantly 
alter the loop shafts. In this case the loop 
shapes should be adjusted. 

(4) Set p for the TDF problem in (3.35). For the 
high purity distillation problem we used 
1 <p:S3. 

(5) Find the optimal value of y. In the 
distillation problem application we found 
that 1.2yop,< y<3yop, gave a good com¬ 
promise between the robust stability and 
robust performance objectives. 

(6) Calculate the optimal controller, post multi¬ 
ply it by W, and rescale the prefilter to 
achieve perfect steady model matching. The 
final controller degree will be ^deg (G) -t- 
deg (M,,) + 2 deg (W). 

3.2. The two stage approach 

There are many ways of utilizing the two 
degrees of freedom offered in a TDF feedback 
configuration. The procedure we will now 
describe is aimed at guaranteeing robust stability 
in the face of coprime factor uncertainty and 
robust model matching in the presence of 
multiplicative perturbations at the plant input. 
This compromise gives us the advantages of the 
McFarlane-Glover loop shifting procedure for 
the design of K 2 , while explicitly taking account 
of the multiplicative form of the modelling 
uncertainty when designing the prefilter. In the 
case of the distillation column application, it is 
easy to show that the uncertain actuator gains 
and input time delays may be represented as 
stable uncertainty at the plant input. Direct 
calculation, using the modelling assumptions 
given in Section 4, shows that = G(IA) 
where 

it,e '<'“'>-1 0 ] 

0 kje'lJ‘ 

In the first step of the two step approach we 
seek a K 2 which ensures some minimum level of 
robust stability. In particular we use K 2 to ensure 
that 

||J(I-GK2)-[I G][sy,. (3.37) 

The second column of this expression appears as 
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the second column of (2.16) which is the linear 
fractional expression for robust model matching 
in the case of multiplicative perturbations at the 
plant input. Since 



A + BF 


F 

5-1/2 

C + DF 



(3.38) 


is a normalized coprime factorization of G, we 
may use the expression for given in (2.11) 
and (2.12) together with K, = -(M - K 2 N)Q, to 
parametrize all TDF controllers which robustly 
stabilize the system in Fig. 2. If 


A, 

B.] 

Ic, 

D2J 


it follows by direct computation that [K, Kj] is 
described by the descriptor equations 

Q'x, = (Q'(A + BF) + y"ZC'(C + DF))x, 

+ (G'fl + y^ZC'D)5-’'^C,X2 
+ {Q'B + y^ZC'D)S-''^D,u, + y^ZC'u^, 

X2 = AyX 2-^ BxUxy 

y = B'Xx^ - - S''^D,U, - D'u^. 

The aim of the second step is to hnd a Q, which 
enhances the robust command following perfor¬ 
mance. Since the uncertainty in the distillation 
column system appears at the plant input, we 
will seek a Q| which minimizes 


yi = 


(I-GK2)'*GK,-M„ 

(I-K2G)-'K, 

M,, -I- NQ, 

MQ, 


(3.40) 


(3.41) 


which is the quantity in the first column of 
(2.16). To summarize, Qj e Sifi and Q 2 e^i 


have been chosen to minimize the first and 
second columns, respectively of (2.16). Our aim 
is to select Q| and Q 2 so that 




P.i 

P 2 . 



^Yi 



(3.42) 


in which f ” ‘^1 represents the expression in 

l-Pzi P 22 J 

equation (2.16). It is now immediate that the 
loop will be robustly stable for all An, An 6 
such that 

||Am ANlU<y2''- (3-43) 

It is also clear that ||(I —GK 2 ) ‘GKi —MolU — 
yi which is a measure of the nominal 
performance. Our next result establishes the 
robust command following performance of the 
closed loop system in the sense that 

11(1 - G^K 2 )-'GpK, - Moll- s Vy? + yf . (3-44) 


for all input multiplicative perturbations A e 
satisfying 


IIAII.s 



(3.45) 


Lemma 3.1. Suppose 



Then 


rp ~| 

^ y, and that 

LP21J I® 



+ yl for any A such that ||A||, 



Proof. To start, we note that ||A|Lg- y .. . 

Vyf + Y2 

ensures that the closed loop is well posed for any 
y,> 0 . If 


1 

■p.i 


LzJ 

'LP 2 , 

P22J1.H’. 


we must have 

11^11! +I|y|l2^(y? + y2)(l|u||^+11^11^). 

since 


(3.46) 

(3.47) 




(3.48) 


As u = Ay and ||A||.^ VyiV conclude 

that llzjljS Vyi + y 2 ||m '||2 which completes the 
proof. ■ 


4 A HIGH PURITY DISTILLATION SYSTEM 
In the remainder of this paper we compare 
and evaluate our TDF design methodologies on 
a high purity distillation system of the type 
described in Limebeer (1991), Morari and 
Zafiriou (1989), Skogestad et al. (1988) and 
Yaniv and Bariev (1990). Our study will be 
based on the following linearized model of the 
column 

1 p.878 - 0.8641 fit,e-*'^ 0 ] 

^'’~75j + 1 Ll.082- 1.096JL 0 

(4.1) 

The input time delays model uncertain flow 
dynamics in the column and lie in the range 
O^Ti, minute. The actuator gains are 

also uncertain due to flow rate measurement 
errors. These variables lie in the range O.Ssfci, 

* 2 =^ 1-2. 

4.1. Design specification 
The aim is to produce a controller which 
meets typical robust performance and robust 
stability specification. In our context robust 


AUTO ^ 9 !l-L 
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stability means guaranteed closed loop stability 
for all OsTi, T 2 £l minute and all 0.8£/ci, 
Ar2^1.2. The robust performance specification is 
given in terms of a closed loop step response 
requirement to changes in the product composi¬ 
tions which must be met for all values of ky, /c 2 , 
Ti and T 2 . 


response reference model was selected to be 


0.12 
s -(- 0.12 


(4.3) 


which is fast enough to meet the time response 
specifications. The loop shaping weighting 
function we decided on was 


(1) For all product compositions where a change 
is demanded, the composition should be 
within ±10% of its desired value within 30 
minutes, and then remain there for all future 
times. 

(2) The final values of all variables should be 
within ±1% of their desired values. 

(3) For product compositions where no change 
is demanded, deviations of less than ±50% 
at all times are required. 


Two cases will be considered. In the first, a step 
demand ^(o|^g ^ applied to the closed loop 


system. This input corresponds to a change in a 
relatively low gain direction of the plant 
(Skogestad et al., 1988) and one would therefore 
expect some difficulty with control. In the second 


case we consider 



which is in a 


relatively high gain direction; no control 
difficulty is expected. 

In conclusion, we mention that although we 
are concentrating on linear design, the final 
controller has to work on highly nonlinear 
hardware. As a consequence, the final controller 
should have a realistic gain and bandwidth. 


4.2. Distillation column design study 
We will tackle the distillation column design 
problem by applying the methods of Sections 3.1 
and 3.2 to the nominal plant model 

, e-^ rO.878 -0.8641 

' 75s + iLi.082 -1.0%J’ 

which corresponds to gain settings of ky = k 2 = 
1.0 and Ti = T 2 = I minute. The model used for 
the design is obtained by approximating the time 
delay by a first order Pad6 approximation. All 
our simulation studies use a sixth order Pad6 
approximation for the delays. Once the control¬ 
ler has been found, it will be evaluated for its 
robust stability and robust performance charac¬ 
teristics. The remainder of this section deals with 
the single step approach, while Section 4.3 
considers the two step approach for the same 
problem specification. Following the prescriptive 
design procedure given in Section 3.1.1, our time 


t H(i) is the unit step applied at <» 0. 


500(j + 0.55) 
j(11j +1) 




(4.4) 


in which the gain, and pole and zeros locations 
were arrived at by considering the required loop 
shape at high- and low-frequency. Integral action 
is used to boost low-frequency gain. The zero at 
—0.55 is used to reduce the roll-off to 
approximately 20 dB decade”* at cross-over, 
while the pole at -0.0909 ensures a low 
controller bandwidth. Notice that because the 
two diagonal elements of W are the same, W is 
well-conditioned. Ill-conditioned compensators 
for ill-conditioned plants can give very poor 
robustness at other points in the loop (Freuden- 
berg, 1990). The corresponding shaped and 
unshaped open loop singular values plots are 
given in Fig. 4(a). When selecting the loop 
shaping function, we were aiming at high 
low-frequency gain and a low bandwidth 
controller. 

The loop shaping function in (4.4) gives 
Xopi = 6.2350 for the pure robustness problem 
associated with GW; see equation (2.10). The 
design may now be completed by selecting a 
value for p. We found that p = 1.1 gave a good 
compromise between acceptable stability pro¬ 
perties and meeting the time domain require¬ 
ments of the design. This value of p leads to 
Y = 8.0105 for the lowest achievable value of y in 
(3.34). For the optimal value of y. Figs 4(b) and 
(c) show the Bode magnitude plots for the 
controller. Both the prehlter and the feedback 
controller roll off at high frequencies. This gives 
an acceptably low controller bandwidth. 

Figures 5(a) and (b) show closed loop step 


responses for the inputs 



and Hit) 


respectively, when Ti = T 2 = 1.0 minute, 

and when ky = k 2 = 1.0, ky = k 2 = 0.8, ky = k 2 = 
1.2, ifci = 0.8 and ^ 2 = 1.2 and ky = \.2 and 
*2 = 0.8. The “+” curves illustrate the step 
responses of the target model given in (4.3). 
These diagrams show that the closed loop system 
has a response which is close to the model 
response for an extreme range of actuator gains, 
and that the time response specifications are met 
for the maximum actuator delays of ry = T 2 = 1.0 
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Fig 4 (a) The shaped and unshaped open loop singular values for the LV configuration The prefilter (b) 
and feedback controller (c) open loop frequency responses 


minute. We complete the design study with Fig. 
6, which shows the closed loop step responses to 

the input for combinations of Ti 

and T2 where T2 = 0.0, 0.5 and 1.0 minutes, 
in steps of 0.5 minutes, and every combination 
of ki and kj in the range 0.8 ^ 2 ^ 1.2 in 

steps of 0.2. These curves indicate that the 
robust performance and robust stability specifi¬ 


cations are met for the entire range of allowable 
time delays and uncertain actuator gains. 

4.3. The two stage design 
The purpose of this section is to evaluate the 
two 6itp design procedure on the distillation 
control problem described in Section 4.2. The 
design study of this section is based on the G 
given in (4.2). As explained earlier and in 




Fio 5 Closed loop step responses with uncertain actuator gams and nominal time delays of =» r 2 = 1.0 

minute. 
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tune - muiutes 

Fig 6 Closed loop step responses with uncertain actuator gams and time delays 


McFarlane and Glover (1992) we begin the 
robust stability optimization by shaping the open 
loop frequency response. In the present case, we 
decided on 

1200 0 1 (5-l-0.55)(0.ay-H) 

0 260J 5(205+ l)(0,lj + l)’ ^ ^ 

as the loop shaping function. In our iterative 
design trials we found that a weight with a 
condition number greater than one could be 


used to improve the performance properties o 

r1200 0 1 

the closed loop. The ^ part of th 

weight was used to marginally reduce th 
condition number of GW. We see from Fig 
7(a), which gives the shaped and unshapec 
frequency responses, that the effect of W is to: 

(1) Reduce the open loop condition number. 

(2) Increase the low-frequency gain. This effec 



10-3 102 10-1 IQO 101 102 103 


(a) frequency - radAmn 



(b) frequency-radAnin (c) frequency-radAnin 

Fio 7 (a) The shaped and unshaped open loop singular values for the LV configuration The prehlter (a) 
and the feedback controller (c) open loop frequency responses. 
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is obviously produced by the integrators in 
the weighting function. 

(3) The zeros are -0,55 rad minute"' and 
-1.2 rad minute"' and are used to decrease 
the unity gain roll off rate. Restricting the 
roll-off rate near unity gain in this way has a 
beneficial effect on the closed loop time 
response. 

(4) The pole at —0.05 is used to ensure a low 
controller bandwidth, while the pole at -10 
is used to increase the controller's high 
frequency roll-off rate. 

It follows from (2 10) that yopt = 5.0435 for GW 
when the two unity time delays are approxim¬ 
ated by the first order Pad6 approximation. The 
feedback controller K2 was evaluated by 
substituting y2=}'opi into equations (2.11) and 
(2,12). Since 5.0435 < 6.23M, the two stage 
design method offers an improved robust 
stability margin (as compared with the single 
step procedure). The prefilter was designed by 
solving for a Q, € which minimizes 


Mo + NQ,] 

MQ, , 


(4.6) 


The target model Mo is given in (4.3). A 
y-iteration (Glover et ai, 1991) shows that the 
minimum achievable value of yi is y, = 0,6523. 
Since 


Vy? + yl = Vo. 6523"' + 5.0435'' 

= 5.0855 < 6.2350 x (1.1)-^ = 5.1529, 

(4.7) 


the two stage design gives an improved margin 
of robust model matching in the face of 
multiplicative input uncertainty. Figures 7(b) 
and (c) show the singular value frequency 
responses for the prefilter and feedback com¬ 
pensators, respectively. The two controllers 
exhibit high gain at low-frequency which is 
provided by the integrators in W. Since the 
controller gain rolls off (at 20 dB decade~') from 
steady state, both compensators have acceptably 
low bandwidths. Figures 8(a) and (b) give closed 
loop responses for the inputs 

«(,)-['] and 

respectively, when t, = = 1.0 minute and 

when fc, = *2 = 1.0, *j = *2==0.8, *, = *2 = 1.2, 
*, = 0.8 and *2 = 1.2, and *i = 1.2 with *2 = 0.8* 
As a final simulation study, we computed a large 
set of step responses corresponding to and T 2 
in the range of 0-^ T|, T2^ 1 minute, and *, and 
*2 in the range 0.8 ^*1, *2^1.2. To conserve 
space, these curves are not illustrated, but they 
are leminiscent of the results in Fig. 6. 

5. CONCLUSIONS 

The aim of this paper is 10 expand the utility 
of optimization procedures in the design of 
TDF control systems. We present two new 
design techniques which are based on the TDF 
paiametrization theory of Youla and Bongiorno 
(1985) and the loop shaping ideas presented in 
McFarlane and Glover (1992). A key requirement 



lime - ininlites tune minutes 

Fig. 8 Closed loop step responses with uncertain actuator gains and nominal time delays of r, = r 2 » 1.0 

minutes 




168 


D. J. N. Limebeer et al. 


of both methods is one of robust stability 
optimization in the face of normalized coprime 
factor perturbations. Georgiou and Smith have 
shown that coprime factor robustness is closely 
related to the idea of robustness in the gap 
metric (Georgiou and Smith, 1990). These 
authors are currently developing a TDF design 
procedure which is based on their gap metric 
ideas (Georgiou and Smith, 1991). 

In order to explicitly address the problem of 
meeting closed loop time response requirements, 
our design methods seek to minimize or at least 
bound quantities of the form ||(I- 
GK 2 )~^GKi -MoIU in which the target Mo has 
idealized time response properties. This gives a 
nominal level of model matching. If the 
minimization process is expanded to include 
other terms, one can bound ||(1 — 
GpK 2 )''G^K, - MolU for all plant models Gp in 
some model set These ideas are explained in 
Section 2.2 of the paper where we consider 
model sets generated by perturbed normalized 
coprime factors and multiplicative perturbations 
at the plant input. There is no difficulty 
associated with modifying these ideas to include 
additive perturbations and multiplicative pertur¬ 
bations at the plant output. Our design 
procedures are given in Section 3. 

Section 4 describes a comprehensive design 
study for a high purity distillation system. This 
work shows that our new design procedures may 
be used to meet a demanding mixture of robust 
stability and robust performance specifications. 
The TDF techniques have proved to be 
particularly useful in problems with explicit time 


response specifications which have to be met for 
a wide range of plant parameters. 
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tracking and disturbance rejection at the nominal parameters of the plant, 
and to maintain the last three items in spite of variations and lor 
uncertainties of some "physical” parameters. 
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Abatract —In this paper the robust asymptotic tracking and 
disturbance rejection problem is solved for linear time- 
invanant systems whose matnees are assumed to depend on 
some parameters, each of which possibly affects all the 
elements of the matrices describing the system, thus playing 
the role of a '‘physical’' parameter It is assumed that 
reference commands exist only for some of the controlled 
outputs (i e. that some scalar outputs must only be 
regulated). For such outputs the row by row decoupling at 
the nominal parameters is also obtained Both the conditions 
for the CKistence of a solution and a design procedure of the 
compensator arc given, the latter enabling us to satisfy some 
performance requirement in some (possibly “large”) subset 
of the parameter space (with the help of the existing robust 
stabilization procedures) 

1 INTRODUCTION 

The problem of the asymptotic tracking and 
disturbance rejection of a linear multivariable 
system subject to unmeasurable disturbances was 
studied by many authors (see e.g. Basile and 
Marro, 1991; Basile et ai, 1987; Davison, 1976; 
Davison and Goldenberg, 1975; Desoer and 
Wang, 1978; Francis, 1977; Francis and Won- 
ham, 1975a, b, 1976; Grasselli and Nicol5, 1971, 
1973a, b; Staats and Pearson, 1977; Wonham, 
1974; Young and Willems, 1972) and contribu¬ 
tions were given even for the case of periodic 
systems (Grasselli et al., 1979; Colaneri, 1990; 
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Grasselli and Longhi, 1991b). In most of these 
contributions parameter uncertainties were taken 
into account, consisting of small (or possibly 
large) independent perturbations of all the 
elements of matrices describing the system: it is 
required the compensator to robustly maintain 
stability, asymptotic tracking and output regula¬ 
tion in spite of them; in most cases it is assumed 
that all the controlled outputs must track 
corresponding reference commands. In the last 
decade, an increasing number of contributions 
appeared on the problem of robustly maintaining 
the asymptotic stability (and possibly, the 
fulfillment of some performance requirements) 
for “large” and possibly prescribed perturbations 
of the parameters of the system, described in 
different forms (Ackermann, 1980; Barmish, 
1985; Bernstein and Haddad, 1990; Doyle and 
Stein, 1981; Dorato, 1987; Francis, 1987; 
Galimidi and Barmish, 1986; Gu et al., 1991; 
Khargonekar et al., 1990; Keel et al., 1988; 
Petersen and Hollot, 1986; Safonov et al., 1981; 
Siljak, 1989; Vidyasagar and Kimura, 1986; Wei 
and Yedavalli, 1989; Willems and Willems, 1983; 
Zhou and Khargonekar, 1988; and the re¬ 
ferences therein). 

More recently, the problem of the output 
regulation under perturbations of “physical” 
parameters affecting the description of the 
system was solved (Grasselli and Longhi, 1991a). 
It was shown that robust solutions may exist 
even when no solution exists for wholly 
independent variations of the entries of matrices 
describing the system, e.g. when the physical 
structure of the plant represented by the system 
prevents Davison’s condition (Davison, 1976; 
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Davison and Goldenberg, 1975) from being 
satisfied. The aim of this contribution is 
two-fold: first to extend the results of Grasselli 
and Longhi (1991a) (with the same kind of 
parameter dependence) to the asymptotic 
tracking of some reference signals, possibly for a 
proper subvector >*(/) of the controlled output 
vector y(t) (i.e. it is assumed here that for some 
components of y{t) there may be no reference 
signal to be tracked) for the case in which the 
above-mentioned condition in Davison (1976) 
and Davison and Goldenberg (1975) is not 
satisfied; second, to add to the above problem a 
requirement of decoupling at the nominal 
parameters (together with some classical SISO 
requirements on the transient behaviour) and 
some global performance requirement about y(l) 
to be robustly satisfied in a (possibly “large”) 
subset of the parameter space, with the help of 
the existing robust stabilization design tech¬ 
niques. In Section 2 the results in Grasselli and 
Longhi (1991a) are extended to the first 
problem, and to the case when some other 
measurable outputs are available in addition to 
y(0. In Section 3 some performance require¬ 
ment is added to the same problem, and a design 
procedure is presented in order to give a solution 
to the second problem. 

2. ROBUST OUTPUT REGULATION AND 
TRACKING UNDER UNCERTAINTIES OF 
PHYSICAL PARAMETERS 

Qmsider the linear time-invariant system S 
described by 

1-1 

( 2 . 1 a) 

y(r) = C(P)x(t) + D(P)u(t) + 2 

1-1 

( 2 . 1 b) 

y(() = C(P)x(t) + D(P)u(t) -F i W)d,(t), 

l-J 

( 2 . 1 c) 

where feR is time, jr(l)gR" = :A’ is the state, 
u(t)eR^=:U is the control input, di(t)e 
R*"'(i = 1 , ..., fi) are the unmeasurable and 
unknown disturbance inputs, >(i)€R‘' = :y is 
the output to be controlled (which is assumed to 
be measurable), y(f) 6 R^ is the vector of the 
additional measurable outouts and A(P), B(P), 
C(p), Dip), CiP), DiP), M,iP). N,iP), 
(' = 1 ) • • ■» m) nre matrices with real 
entries depending on a vector P of parameters, 
which are subject to variations and/or uncertain, 
^ € fi c: R*. liie dependence on P in ( 2 . 1 ) seems 


to be able to describe all kinds of uncertainties 
in modelling a linear time-invariant process 
through state-space equations, e.g. imperfect 
knowledge, or perturbations, of “physical” 
parameters. The nominal value Po of p is 
assumed to be an interior point of the set £2 
which is assumed to be bounded, and the values 
of AiP„), BiPo), CiP„), Dip,), CiP,). DiP,), 
MiiPa). Niipa), W,) (/ = I, ■ • • , iu) will be 
denoted simply by A, B, C, D, C, D, M,, N,, 
(i = 1,... , |i). It is assumed that each of the 
first q components >> 1 ( 0 . ■ • • . yqiO of yit) must 
track the corresponding component of the 
reference vector r(<) 6 R’, q^q. Therefore the 
error signal e(I) e R* for S is defined by 

e(0:=Vr(0-y(f). V:=[I 0 ]'. ( 2 . 2 ) 

It is also assumed that the classes St of 
reference signals r( ) to be asymptotically 
tracked and % of disturbance functions d/(-) (i = 
1 ,. .. , ^) to be asymptotically rejected are of 
sinusoidal-exponential type and defined as 
follows: 

0 9 * 2 ©-•(2.3a) 
g?,. ;= {r(-):r(0 = 6 e“''-t- 6*e“‘'V/s:0, 6 eC’} 

i = (2.3b) 

a, := idii ); diit) = de"" -F 6 *e'"’' Vf > 0, 

deC'”'} i = l....,n, (2.3c) 

for some positive integer and some 

a,eCii = \,...,n), where * means complex 
conjugate. Notice that the triplets (M,( ), N,( ), 
/V;( )) and (A/y( ), Ny( ), N,i-)) may actually 
coincide for some i, j, i ^ j. In order to rule out 
trivialities, the a,s (i = 1, . . . , /x) are assumed to 
be all distinct, and to have all non-negative real 
and imaginary parts, while the triplet 
(M,( ), N,i-), N,i')) is assumed to be non-zero 
for each i = ft + I,... , fi. 

The problem of designing a control system for 
S so that suitable requirements are satisfied, will 
be studied under the following technical 
assumption. 


Assumption 1. There exists a neighbourhood 
c £2 of p, such that all the entries of Aip), 
BiP), CiP), DiP), CiP), Dip) are continuous 
functions of P inand such that 


CiP) DiP)i D} 


CiP) DiP). 

= :rf,y/pe^„, i = l,...,p. 


(2.4) 


Lemma 1. If there exists a neighbourhood 
^0 (= £2 of some ^ ^ such that all the entries 
of AiP), BiP), dP), DiP), dP), DiP) are 
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continuous functions of in then there 
exists an interior point jSo of such that 
Assumption 1 holds. 

The proof of Lemma 1 is reported in the 
Appendix. By Lemma L Assumption 1 is 
implied by the mere continuity of the matrices 
involved, provided that the nominal value of 
P can be properly chosen adequately near some 
initial choice of it. Therefore, Assumption 1 
seems to be reasonable for physical plants. 

Denoting with y(0:=l>''(0 y'iOY the vector 
of all measured outputs, the linear dynamic 
compensator K to be designed for S is 
represented by 

^(0 = Pw{t) + Qr(t) -h RyU)^ (2.5a) 

u(t) = Hw(t)~\-Jr{t), (2.5b) 

with w{t)eW' and P, Q, R, H and J real. 
Define also 

Cifi):=[CiP) C'if))]', 
6iP)=[D'iP) D'(/9)]', 

N'M', 

« = 1 , ( 2 . 6 ) 

and let Cf/3o), be denoted simply 

by C, N, (i = I, , /u), respectively. 

The following robust tracking and regulation 
problem will be considered m this section: find if 
it exists, a linear dynamic compensator K 
described by (2,5), such that: 

(a) the overall control system I! described by 
(2.1) and (2.5), is asymptotically stable at the 
nominal parameters ot system S, i e. for = /Jq; 

(b) the error response e(t) asymptotically goes 
to zero for each disturbance function i/,( ) e 
Q},{i = \,...,p) and for each reference signal 
r( ) e 9?, for P = /a,,; 

(c) properties (a) and (b) are preserved for all 
P in some neighbourhood c Q of jSq (or 
possibly in some “large” subset 'P of U 
containing jSo an interior point). 

In the following, such a problem will be called 
Problem 1, and property (b) will be called 
asymptotic tracking and disturbance rejection. 


Theorem 1. There exists a solution K of 
Problem 1, undei Assumption 1, if and only if: 

(i) the triplet (A, C) is stabilizable and 
detectable; and, 

(ii) there exists a neighbourhood £2 of j3o 
such that 


Im 


LW)J L CiP) DiP) 
VjSeV, 1 = 1, 


] 


(2.7a) 


V/aeVfc i = h...,p. (2.7b) 

Proof. (Necessity) Property (a) trivially implies 
condition (i). By virtue of (a), (b) and (c), the 
steady-state responses of £ to the disturbance 
functions di(-) e Sdi {i = \ , , p) and reference 

signals r( )e3J written for x{t), u{t) and e(t), 
imply condition (ii). 

(Sufficiency) Define 

-n, 1 = 1,...,;!, (2.8) 

where p, ^0 by the stabilizability of (A, B) and 
the assumptions about nr, (i = 1, . . . , p). 

Condition (i) implies that system 5 can be 
stabilized at the nominal parameters by a linear 
dynamic feedback from the measured output 
y{t). Let such a stabilization be obtained through 
a dynamic control law of the following form: 

I(r) = {A- Le)l(f) + (fi - L6)u{t) + l;(0, 

^(OeR”, (2.9a) 

u(f) = f$(f) + «.(0. «c(0eR'’. (2.9b) 

i.e. feeding back through a constant feedback 
gain matrix F the state ^(r) of an estimator of 
x(t) for d/( ) = 0, i = I, . . ., p, in which, by 
virtue of (i), the eigenvalues of (A - LC) are 
obtained to he in the open left half-plane, as well 
a^ those of (A + BF). Then, it is easily seen that 
the closed-loop system 5, described by (2.1) and 
(2.9) of dimension nc :=2n, having 1/^(0 as 
control input and y(t) as the only measured and 
controlled output, and described by equations 
similar to (2.1a) and (2.1b) but involving u^t) as 
control input instead of u{t) and some matrices 

AAP), B,(P), Q(/9), DM N,mii = 

1, . . . , iu) instead of the corresponding matrices 
in (2.1a) and (2.1b), satisfies the following 

condition (where a notation similar to the 
corresponding one defined for S is used for the 
matrices describing at j8 = jSo). 

Condition 1. Assumption 1 and condition (ii) of 
the theorem, rewritten for the matrices describ¬ 
ing Sc, hold for the same neighbourhoods 

the triplet (A^, Bc» CJ is stabilizable and 
detectable, and the following relation holds: 

rA,-a,/ Be] , . 

'“4 c, dJ""-"'’" . 

( 2 . 10 ) 

If, in particular, y(t) vanishes (i.e. no 

additional measurement is available) and/or the 
pair (.4, C) is detectable (i.e. no additional 
measurements y{t) are needed in order to 
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Fto. 1. The series connection Sj of . ../fju, 

and Sc- ' 

Stabilize system S) the dynamic control law (2.9) 
is not yet needed and can vanish, so that Sc can 
coincide with S, while preserving Condition 1. 

It will now be shown the existence of /x 
sub-compensators Km,, ■ ■ ■ , Km„ such that, 
for each j = \,... , n, the series connection Sj of 
• • • . and -Sc. having the input 
Uy(f) e IR'’ of Km, as the control input and y(t) as 
output (see Fig. 1) and described by 

MO = 

+ tM,i(fi)M0, MOe^"’. u/f)elRP, 
(=1 

(2.11a) 


using for the matrices describing Sj-i at jS = /3o a 
notation similar to the corresponding one 
defined for S, 


rank 


j4y_i — aCfl 

C,-x 


(i.e. full column-rank). 


del [E, Ej\ # 0. 


(2.14) 

(2.15) 


If oy ^ R, define through (2.14) and (2.15) the 
real matrices E, and E, in the same way (and 
with the same dimensions) as for o) e R, and let 
XjMT be described by 


wXf) = IV,w,(0 + [0 0,]u,{t), 

H;(f)6R^'"', M^(f)eR^ (2.16a) 
«y-i(0 = E,w,{t) + [E, 0]«,(f), 

M,_^(0eR^ (2.16b) 

where £, is defined by 

£, = [e; 0 cj O-.-ef^ 0], (2.17) 


y(t) = C0)x,it) + DjiP)u,it) 

+ fsN,.W)d.iO, (2.11b) 
(-1 

satisfies the following conditions. 

Condition 2. The above stated Condition 1, 
rewritten for system Sj, its dimension n, and the 
matrices describing it, hold. 


in which ej represents the jth column of 
(i = 1, . . . , Pj), Wj is defined by 

“'ll, 

-(Oj CTyJJ’ 

(2.18) 

in which Oy and (Uy are the real and the imaginary 
parts, respectively, of a, (with (Oj=^0), and 
Oj e IS defined by 


H^ = diag|[ "1..., 
IL-co^ o,} 


Condition 3. There exists a neighbourhood 
c Q of Po such that 



c Im 


lAOT-Vl 
L C,(/5) J 

6,:=! if j ji, dj:=0 if / > ^. 


(2.12a) 

(2.12b) 


The proof of the existence of Km, ,, Km 
with the above stated properties will be carried 
out constructively by induction. Then, assume 
that 5^_i satisfies Condition 2 written for S,_, 
(for ;■ = 1 this is true if Sc is denoted with 51, and 
Mt:(r) with Uo(t)) If O; eR, let Km, be described 
by 


w,{0 = a,M0 + [0 I]ujit), 

Wy(f)6R'’>, My(0eR'’, (2.13a) 

U,_i(0 = £;IVy(0 + [£; 0]M,(t), 

«y_,(f)eR'’, (2.13b) 

where pj<p by (2.4) and (2.8), E,€W’'‘^'’~^i\ 
£y and are real, and £y and E, are such that, 



(2.19) 


Now, for both types (2.13) and (2.16) of 
subcompensator Km,, it can be easily seen that 
(2.14), (2.15) and the assumption that 5^_, 
satisfies Condition 2 (written for it) for the 
neighbourhoods c £2 of Po imply that S, 

satisfies Condition 2 for the same neighbour¬ 
hoods M'a, V*. In addition, for both types (2.13) 
and (2.16) of condition (2.12a) holds if and 
only if 



c Im 


L C,.,(P) 





eV, 


( 2 . 20 ) 


On the other hand, Assumption 1 on 5^_, and 
(2.14) imply the existence of some neighbour¬ 
hood Vq, c ipj of Po in which the dimension of 
the subspace in the right-hand side of (2.20) 
remains constant, whence by (2.4) rewritten for 
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Fig 2 The overall control system 2 for Problem 1 


5,_, and (.2.14), such that 

rv.(/3)-V 


Vfl.U/ 


( 2 . 21 ) 


This, together with (2.7) rewritten for 
proves ( 2 . 20 ) with = W(,y H 'P/,, and completes 
the inductive proof that, for each 7 = 1 , . . . , /i, 
Sj satisfies Conditions 2 and 3. 

Since, by Condition 2 , is stabilizable and 
detectable at the nominal parameters /Jo, there 
exists for it a linear compensator K, described by 


H-,(0 = £vv,(0 + Q,eit). (2.22a) 


= ( 2 . 22 b) 

with Py, Qs and real, such that the overall 
contro’ system S thus obtained, represented in 
Fig. 2, satisfies requirement (a). 

Denote by A^iP) the dynamic matrix of 2. 
Since, by virtue of Assumption 1, the coefficients 
of the characteristic polynomial of are 

continuous functions of P in there certainly 
exists a neighbourhood c Q of /^o such that 
the asymptotic stability of 2 is preserved for all 
This, together with (2.12) for each 
7 = 1 , . . . , jU, implies that asymptotic tracking 
and disturbance rejection are guaranteed for 2 
at p = po and for all e 'F, n W, D ^^2 H ■ • • n 
=: V, by virtue of Theorem 1 in Grasselli and 
Nicol5 (1976). 


Remark 2.1. If the stabilizing compensator Ky 
(and possibly, F and L in (2.9)) is designed 
according to any procedure which is able to 
guarantee the asymptotic stability of 2 not only 
in some neighbourhood of /3o, but in 

some “large”, and possibly prescribed region W, 
of Q (see e.g. Ackermann, 1980; Barmish, 1985; 
Bernstein and Haddad, 1990; Doyle and Stein, 
1981; Dorato, 1987; Francis, 1987; Galimidi and 
Barmish, 1986; Gu et al., 1991; Khargonekar et 
al.y 1990; Keel et al., 1988; Peterson and Hollot, 
1986; Safonov et al., 1981; Siljak, 1989; 
Vidyasagar and Kimura, 1986; Wei and Yeda- 


valli, 1989; Willems and Willems, 1983; Zhou 
and Khargonekar, 1988; and the references 
therein) and if (2.4) and (2.7) hold for some 
“large” subsets and of Q (see Grasselli 
and Longhi, 1991a, for a simple example of a 
physical process S for which = = £2) 

property (b) too can be preserved in some 
“large” subset of Q, and, possibly, in H 'F* H 
W,. In fact, asymptotic tracking and disturbance 
rejection are guaranteed for 2 at each value of p 
such that A^{p) is asymptotically stable and the 
following relations hold for (see Theorem 1 in 
Grasselli and N 1 C 0 I 6 , 1976): 


Im 


NJ/3)J [e.Kj 


cr Im 


which can be seen to be equivalent to the 
following ones, regarding 5, (i = 1 , . . . , p): 


Im 


U/J)J ‘ U/VJ 

\A,iP)-a,n . 

‘"’i c.ip) J' ' = '. 


(which are guaranteed to hold at each 
6 V, c n by the design procedure). 


Remark 2.2 The design procedure of K in the 
sufficiency proof of Theorem 1, valid under 
Assumption 1 and conditions (i) and (ii), can be 
summarized as follows. 


Step 1. If {A, C) is detectable, denote S with 
and use the notations in ( 2 . 11 ) with 7 = 0 for 
(2.1a) and (2.1b). If (A, C) is not detectable, 
choose F and L in (2.9) so that the spectra of 
both (A + BF) and (A - lC) belong to the open 
left half-plane; call 5o the system described by 
(2.1), (2.6) and (2.9) having u,{t) as control 
input and y(r) as output, and use the notations in 
( 2 . 11 ) with 7 = 0 for its description. 

Step 2. For each 7 = 1, .... p, if ay e R, or 
a, ^ R, define through (2.13) or, respectively 
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(2.16) , with pj defined by (2.4) and (2.8), and 
E„ £j, £„ Wj and 0, satisfying (2.14), (2.15), 

(2.17) , (2.18) and (2.19) for system S)-,; define 
the notations in (2.11) for the system Sj obtained 
by connecting and Sf-i- 

Step 3. Find Pj, Q, and H, for the compensator 
K, described by (2.22), so that the overall 
control system Z satisfies requirement (a). 


At Step 1 (2.9a) can be substituted by any 
kind of reduced order state estimator of S\ this 
preserves the validity of Condition 1 for S^. As 
mentioned in Remark 2.1, robust stabilization 
design procedures are obviously preferable at 
both Steps 1 (e.g. the one proposed in Doyle 
and Stein, 1981) and 3. 

It seems worth mentioning that the purpose of 
Step 2 is to choose K^, j = 1,..., p, so that S^, 

for each j9 in some neighbourhood ;= (^ V, c 

1=1 

of Po> is able to generate free output 
responses equal to the output responses of 5, 
(for the same value of P) to all disturbance 
functions di(-) e 3ii(i = 1,... , p) and to all 
signals Vr(-)f r(') ^ ^ (according to a simple 
form of the internal model principle (Grasselli 
and Nicold, 1971)) while maintaining for the 
properties of stabilizability and detectability at 
P = Po> enjoyed by system S^. The former 
property of is achieved by attaining (2.24), 
whence (2.23), to hold for all Thus, the 

meaning of Step 2 is to enlarge (as much as 
consistent with the stability of Z) the (partial) 
internal model of disturbance functions d,{ ) e 
= , p) and signals V'r(-), r( ) e Si, 

possibly contained in (e.g. an integrator if 
Sc = S and S contains a motor, and or, = U for 
some () by means of the series connection of the 
internal models .... Ku^, so that (2.23) 
holds for all in some (possibly large) 


Remark 2.3. Let Va (or Vj,) be the largest 
subset of Q such that Assumption 1 (or (2.7)) 
holds (it is related to the structure of the physical 
plant). Then a procedure for determining a good 
estimate c of Wo can be obtained as 
follows. Call 4'a =3 Vo the subset of Q in which 
all the elements of A{P), B(P), C(P), t>{P) are 
continuous functions of p. For each i = I, ..., p, 
through elementary row and column operations, 
properly modified in order to avoid multiplica¬ 
tions by functions of P which are not continuous 
in or which are null it P = Po, the argument 
matrix in the right-hand sides of (2.7) can be 
transformed into the following form; 


^uiP) 0 1 
0 


(2.25) 


where Ai^O) is a diagonal matnx whose 
diawnal elements are continuous functions of P 
in Va and are not null at ^ = Po, by Assumption 
1, the elements of A2iiP) are continuous 
functions of in and there exists a subset 
of ^'a, containing Po as an interior point, 
such that ^ 2 iiP) = 0 for all 6 (by the same 
Assumption 1). A subset can be obtained 
from 'Fa, by eliminating from 'Fa, all the values 
of p where det Ai,(/9) = 0. Then, 4^0 is obtained 

as 'J'a,. The computation of a good estimate 

/“I 

<=. 'Ffc of 'Ffc can be similarly obtained by 
making exactly the same elementary row 
operations on the argument matrices in the 
left-hand sides of (2.7). 

Remark 2.4. If q = q, it is easily seen that 
conditions (i) and (2.7b), written at P = Po. 
imply p, = q (i = 1, . . . , p), i.e. the well-known 
condition (see e.g. Davison, 1976; Davison and 
Goldenberg, 1975) 

, [A-a,I fll , ^ 

rankj^ C ‘ = 

(2.26) 

which, for p = p, under Assumption 1, is 
stronger than the existence of a neighbourhood 
'Ffc of Po such that (2.7a) holds. However (2.26) 
is not necessary, in general, if q <q, and cannot 
be satisfied if q> p, i.e. if system S has more 
regulated outputs than control inputs. In 
addition, when q^p, although condition (2.26) 
rewritten with p instead of p is generically 
satisfied, i.e. it is satisfied for almost all 
(A, B, C, D)-tuples (Davison and Wang, 1974), 
in practice the physical plant described by S may 
have a physical structure implying the violation 
of such a condition either because of a non-full 
row-rank Rosenbrock system matrix of S at 
P = Po and near Po, or because of one (or more) 
invariant zero it P = po (and, po.ssibly, near Pa) 
equal to one of the ar,s characterizing 3), and SI. 
In these cases, as well as when q>p, (2.26) 
written with p instead of p is violated for some 
or for all is, so that a fully redundant internal 
model of reference signals r(-)eSi and distur¬ 
bance functions d,(-) 6 3, (i = 1, . . . , p), put 
in K (as in Davison, 1976; Davison and 
Goldenberg, 1975), is not compatible with the 
stability of Z. Hence in such cases only a 
solution K or Problem 1 exists, if Assumption 1 
and conditions (i) and (ii) hold, although (2.26) 
written with p instead of p does not hold and the 
above-mentioned fully redundant internal model 
of disturbance functions and reference signals 
cannot be used in K. 
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Remark 2.5, Statements similar to Theorem 1, 
with proper amendments of condition (i), hold 
for the case when requirement (a) is strength¬ 
ened by prescribing the degree of exponential 
stability of 2 at jS =5 )9o, or even the spectrum of 
and, correspondingly, requirement (c) is 
properly tightened, and/or system 5 is substi¬ 
tuted with a discrete-time one, possibly with a 
dead-beat convergence requirement, as in 
Grasselli and Lx)nghi (1991a) where the special 
case q = q = 0 was studied. 

3. ADDITIONAL DECOUPLING AND 
PERFORMANCE REQUIREMENTS 

The design procedure reported in the previous 
section allows to find a solution of Problem 1, 
even when, for q<q, (2.26) is not satisfied, 
since it is able to deal with the kind of parameter 
dependence assumed here for 5, described by 
(2.1), which can really model the actual 
dependence of a physical plant on wSome 
“physical” parameters. Although this can appear 
advantageous with respect to other existing 
methods, it seems not to allow, in general, to 
satisfy performance requirements on the tran¬ 
sient behaviour of 2. 

In the case, usually considered, when (2.26) 
rewritten with instead of fi holds (which is 
necessary if q = q and fi = the so-called row 
by rov/ decoupling of 2 at can be easily 

achieved together with requirements (a), (b) and 

(c) (see e g. Chen, 1984). In such a case, also 
some performance requirement can be satisfied, 
following the approach in Doyle and Stein 
(1981). 

When, on the contrary, q<c! and condition 
(2.26) does not hold, a solution to the problem 
of satisfying all the above mentioned require¬ 
ments is not available, at the authors’ knowl¬ 
edge. In this connection notice that, if q >0, the 
row by row decoupling at the nominal 
parameters of S, as far as the vector y(t) of the 
first q components of y{t) is concerned, can very 
often be seen as a desirable, although not 
unrenunciable, feature 01 2. Hence, it seems 
reasonable to try to satisfy this requirement for 
2 togethei with (a), (b) and (c): 

(d) under the assumption q > 0, the transfer 
function matrix from r(l) to y(l) at the nominal 
parameters of S (i.e. at /3 = j8o) denoted here 
with W(5), is diagonal and nonsingular. 

In addition, the following requirements will be 
considered for 2: 

(e) under the assumption ^ > 0, the diagonal 
elements of W{ja)) and/or the corresponding 
scalar step responses satisfy some classical SISO 
requirements on the transient behaviour, such as 
bandwidth or risetime, overshoot, etc.. 


(f) some performance requirement on 2 , such 
as the one considered in Doyle and Stein (1981) 
and specified later on (see the subsequent 
relation (3.10)) is satisfied for all P in the same 
set as in (c). 

In the following, Problem 1 with the 
additional requirements (d), or (d) and (e), or 
(d), (e) and (f), will be called Problem 2 , or 3, or 
4, respectively. 

Theorem 2. If ^ > 0, there exists a solution K of 
Problem 2, under Assumption 1 , if and only if 
conditions (i) and (ii) of Theorem 1 hold. 

Proof, Conditions (i) and (li) are necessary by 
Theorem 1. As regards sufficiency, firstly apply 
the design procedure described in the sufficiency 
proof of Theorem 1, with the only amendment 
of substituting e{t) in ( 2 . 22 a) with eo{t)eR^, 
defined by 

eoO) •'= - y(t), ro(t) e (3.1) 

so that the scheme of the control system in Fig. 2 
is modified by the substitution of e(t) and r(i) 
with eo(i) and r()(0i respectively. Call 2nj instead 
of 2, the control system thus obtained. Call 
Wo(r) the 9 X ^ transfer matrix of 2o from ro(l) 
to PU) at jS = Po- 

Obviously, if ro(t) = r(t), then eo(0=^(0- In 
this case, by the sufficiency proof of Theorem 1, 
2 o satisfies requirements (a), (b) and (c); call 
Vot instead of V, the neighbourhood of Po in 
which (a) and (b) are preserved. Properties (a) 
and (b) imply that, for each i = 1 , . . . , /i, 
lV(,(ar,) = Wo{a*) = /. Therefore, the rational and 
square matrix 1 V()( 5 ) has (normal) rank 9 and has 
no loss of rank at a, and or*, i.e. no transmission 
zero at U/ and or* (r = 1 , . . . , / 2 ). 

Now, let 

Z(s)T-\s)^Wo{s\ (3.2) 

be an irreducible matrix fraction description of 
Wo(5), i.e. let Z(s) and T{s) be q x q right 
coprime polynomial matrices, with T(s) nonsin¬ 
gular, such that (3.2) holds. By the stability of Zq 
Bt P = /Sq, det T(s) has no root in the closed 
right half-plane, whence the invariant polyno¬ 
mial of T{s) of the smallest degree, denoted 
here with C(j), is a Hurwitz polynomial. Define 
f(j) := r(j)/C(j). By the nonsingularity of 
Ho(5), Z{s) is nonsingular; since Wo(s) has no 
transmission zeros at or^ and orf , Z(s) has no loss 
of rank at or, and cr* (i = 1 , .. . , /i). For each 
i = 1 , . .. , q, call ei(s) the monic greatest 
common divisor of all the elements of the ith 
row of Z(5), and denote with Z{s) the 
nonsingular polynomial matrix obtained from 
Z{s) by dividing by E/(j) all the elements of its 
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ith row (i = 1,..., fl) so that 

.^}2Wf-V), 

(3.3) 

where Z(s) and f(s) are right coprime by the 
right coprimeness of Z(s) and T(s), Z(s) has no 
loss of rank at or, and a* (i = 1,..., fi) and 
det f (s) is a Hurwitz polynomial. 

If detZ(s) is not a Hurwitz polynomial, 
denote with yi(s) the monic least common 
denominator of the poles of the ith column of 
which lie in the closed right half-plane 
(i = 1,..., q). Denote with <p(i') the monic 
polynomial having a, and a*, i = 1, . . . , /i, as its 
only roots, all simple ones; and consider the 
rational matrix 

fW2-V)di»8{^. 


If such a matrix is not strictly proper, choose q 
monic Hurwitz polynomials 9>i(i), • • ■ , q>q{s) of 
suitable degrees so that the rational matrix 

Gi^o{s):=ns)Z-\s) 

A- f y»(^) 1 

l(p(s)g),(j) ’ " ■ ’ <p(j)<p^(j)r 

is strictly proper. Let K^o be a reachable and 
observable compensator having Gmd{s) as 
transfer matrix and TmoO) e R’ as input. By 
construction, the only unstable poles of Gmd{s) 
are the roots of «p(s), which are not transmission 
zeros of Hence, by a straightforward 

extension of Corollary 9.4 in Chen (1984) to 
detectability and stabilizability, the series con¬ 
nection Zo of Kmo followed by Zo is stabilizable 
and detectable from y{t) at ^ = jSo. 

By (3.3) and (3.4), the transfer matrix Go{s) 
of Zp from r^off) to y{t) at p = /3„ is diagonal 
and nonsingular, and expressed by 


;d(j) = diag { 


_fi(s)yi(j) 

C(j)(p(j)<p,(s) 

’ ■ ■ ■ ’ ^(j)(p(j)<p, (j) 


(3.5) 


Therefore, consider the control system de¬ 
picted in Fig. 3, and choose a compensator K,d 
with a diagonal transfer matrix G,o(s), such that 
the overall control system Z is asymptotically 
stable at j8 = Po- Th® existence of such a K,p is 
guaranteed by the detectability and stabilizability 
of Zp at |3 = Pa and by the diagonality of Gpis) 
dit P = Po- Notice that the ^ numerators 
XiC'*). • • • < of f**® diagonal elements of 
G^d(s) must be nonzero at a, and a*(i = 

1.. ... p), otherwise a cancellation with (p(s) 
should prevent Z to be asymptotically stable at 
P = Pa. With this choice of K,d, requirements 
(a) and (d) are met. By the same reason as in the 
proof of Theorem 1, (a) is preserved in some 
neighbourhood W, c £2 of ^o- 

Since, for each i = 1,.... Yii^) *nd q)(s) 
are coprime, ;(i('*) '"^nd <p(j) are coprime, and the 
CT/S (/ = 1, ..., jii) are neither roots of det f (^) 
nor of detZ(j), it is readily seen that (p(s) 
divides all q denominator polynomials in the 
Smith-McMillan form of G(i) := Gmo(j)Gjd(j) 
(e.g. use the valuations of G(i) at ary and 
a*, y = 1, . . . , A (Kailath, 1980)). By well-known 
results (e.g. Francis and Wonham, 1975a; 
Grasselli and Nicol6, 1971) this proves that 
e(t) := r(t)-y(t) asymptotically goes to zero for 
each reference signal r( ) 6 98, for each d,( ) e 

2)((f = 1, ... , A) and for d,( ) = 0, i = A + 
1, at P = Po and for all In 

addition, for each PeV^, for each reference 
signal r(-) e 98, for each d,(-) e Si/ (i = 

1.. .. , A), and for dj(') = 0, i = A + 1. • • ■ > 

the response r(,( ) is the sum of a signal roa( ) 6 98 
and a free response rob( ) of Z corresponding to 
the same value of p. Therefore, the property of 
Zo to satisfy requirements (a), (b), and (c) in 
some neighbourhood of Pa, implies that for 
each r(-) e 98, for each d/(') £ 2, (i = 1, , A)> 

for d,( ) = 0(i = p + l, . . . , /i) and for each 

the response Co(0 asymptotically 
goes to zero, as well as the response e(t) does. 
Hence e(t) asymptotically goes to zero for each 
r( )e98, for each di( ) e 2i, (i = 1, . . . , A), for 
d,(.) = 0(j = A +f. ■ ■ • • fr) and for each /3 e 
Wonv,. 



Fig. 3. The overall control system I for Problems 2, 3 and 4. 
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Lastly, notice that for each /3 € the 

steady-state response of Z to any r( ) e and 
di{') e 0, (i — 1, . . , , /i) is unique. Therefore, 
the property of So to satisfy requirements (a), 
(b) and (c) in Vo implies that for each 
d,{‘) e 3, (i == jU + 1,. . . , for r( ) = 0 and for 
each /SfiVonVp, ro(r) and e^it) asymptotically 
go to zero, whence >>(0 does. 

This proves that requirement (b) is satisfied at 
j9 = /3o, and preserved, together with (a), in the 
neighbourhood V : = Vo H V, of A,. 

Remark 3.1 Requirements (a) and (b), as far as 
the only y(0 and the only reference signals 
r( ) e and disturbance functions dX ) 6 21,(1 = 
1, .... (i) are concerned, are preserved for all 
P e V,-, by virtue of the, say, “^-dimensional'’ 
fully redundant internal model of such d,{‘) and 
r() contained in through (p{s). 

Requirement (b) for ■ ■ » ></(0 and/or 

for disturbances dX ) € % (i = fi 1, . . , ju) is 
preserved only for all ^SeVonV^. If, although 
^ < /i, (2.7b) holds with pL instead of jU, then the 
same design procedure of ensures W^)(j) to 
have no transmission zeros at a, and a* for each 
1 = 1,...,^, thus allowing factors (.y - a,) and 
(a’ - a*) to be contained in (;p( 5) also for 
i = /i + 1, . . . , In such a case, (b) is preserved 
for y{t) for all p 6 V^, and it is preserved for 
3',-+i(0.-for all /3 e Wo n V,. 

The approach here followed in the design of 
Gmp{s) for decoupling 2^, at = P^ is the same 
as in Chen (1984). Notice also that the design of 
the diagonal elements of G,p{s) can be 
performed with the help of classical SISO 
techniques (Bode plots, etc.) in order to satisfy 
for Z also the additional requirement (e), thus 
providing a solution not only of Problem 2, but 
also of Problem 3. Notice also that the role of 
f(j) and c,(5) (i = 1, . . . , ^) and the use of the 
choice of (p,(j) (i = 1, . . . , ^) can be that of 
reducing the control effort of “cancelling” in 
part Zo- 

As regards Problem 4, assume that, in the 
design of Zp, V^ has been widened to some 
“large” closed (and possibly, prescribed) subset 
Vj of Q through the use for K, of some robust 
stabilization design procedure, as mentioned in 
Remark 2.1. The transfer matrix Gp{p, j) of Xp 
from rMp{t) to y(r) is square and nonsingular at 
P = Po (see Go(*s) in (3.5)); since Gp{p, s) is 
strictly proper (by the choice of gPi(^),.. . , 
()P^(j)) and has the same number of unstable 
modes for all /8eVj, then for all /JeV^ the 
perturbations Gp(Pj s) of the nominal Gp{s) 
satisfy the assumptions in Doyle and Stein 
(1981). They can actually be expressed in the 
form of “unstructured” uncertainties at the 


output y(r), namely 

Gp{P.s)^[I-^A{p.s)]Gu(sl (3.6) 

for some A()3, s) which can be computed. If the 
elements of A{P), P(p)j C(p) and ^(P) are 
continuous functions of p in V,, then A(P,s) 
satisfies a bound of the form 

a[A(/Jwco)]</,„(ai), V/SeV,, 

(3.7) 

for some /,„((u) which can be computed, where d 
means maximum singular value. Therefore, the 
design of the diagonal elements of G,f,(s) can be 
performed on the basis of the diagonal Gp{s) 
expressed by (3.5) and of /^(tw), with the help of 
SISO techniques, not only for stabilizing Z at 
P = Pi) and for satisfying requirement (e), but 
also in order to satisfy the following conditions 
(Doyle and Stein, 1981), whose fulfilment is 
made easy by the diagonality of G/;,(5) and 

Gsp(s): 

a[Gnija))G,o{ja))(I + Go(/a>)G,i,(/a»))"'] 

<i/L(o)) Va)e[0,=o], (3.8) 

frlGD(/to)C,o(/a))] s/7j(w)/(l - /„(<«)) 

Vw: l„{(o) < 1, a[GD(>w)G,D(/w)]»1, 

(3.9) 

(where g means minimum singular value) for 
some ps(a)) characterizing a performance re¬ 
quirement on y{t) of the form 

a((/ + Gd(P, jw)G,oij(o)) ■'] 1//7a(w) 

Vco € [0, Wo], for some Wp > 0. 

(3.10) 

Thus, relation (3.8) will guarantee the 
asymptotic stability of Z for all jS e V, (i.e. 
V, = V,), and relation (3.9) will guarantee the 
fulfillment of the performance requirement 

(3.10) for all p e V,. In this way all requirements 
(a), (b), (c), (d), (e) and (f) are met; in 
particular, requirement (b) remains valid in 
VonV, = Vo, while, for dX) = 0{i = fi + 
I, . . . , fi), as far as y(t) is concerned, it is 
preserved in all V^. 

The main loop in Fig. 3 could be called the 
tracking loop, while the inner loop could be 
called the regulation loop. The order of K can be 
reduced if (2.26) holds. 

Remark 3.2. Taking into account the discussion 
of Problem 4 contained in the previous remark, 
together with the sufficiency proof of Theorem 2, 
if 9 > 0 and Assumption 1 and conditions (i) and 
(ii) hold, a solution K of Problem 4 can be 
obtained through the following design 
procedure. 
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5n^ 1 and 2. As in Remark 2.2. 

Step 3. Find a compensator K, so that the 
system Xo in Fig. 3 is asymptotically stable in 
some “large” and closed subset 'Fj of Q 
containing Po as an interior point (e.g. Doyle 
and Stein, 1981; Bernstein and Haddad, 1990). 

Step 4. Find an irreducible right polynomial 
factorization Z(s)r~*(.r) of the transfer matrix 
of Xo from ro(0 to y(t) at P = Po (Chen, 
1984; Kailath, 1980). 

Step 5. Compute the g.c.d. C(‘^) of all elements 
of r(j), and, for each i = l,..., q. the g.c.d. 
£,(f) of all the elements of the (th row of Z{s). 
Define f(j);= r(j)/^(j), and Z(s) through 
(3.3). 

Step 6 . For each J = 1,. . . , q, compute the least 
common denominator y)(r) of the poles of the 
y'th column of Z~^(s) which lie in the closed right 
half-plane. Compute the monic polynomial q)(s) 
having a/ and a* (i = 1,.. . , p if (2.7b) holds 
with p instead of p; i = 1,p otherwise) as 
its only roots, all simple ones. Choose q Hurwitz 
polynomials q?i(f),.... (p^(s) of minimal de¬ 
grees so that matrix Gmi,(s) in (3.4) is strictly 
proper. Find a minimal realization Amo 

Step 7. Compute the transfer matrix G^iP, .r) of 
Xd from to y{t), and the bound l„{a)) in 

(3.7) for ii(P,jo)) with the help of (3.5) and 
(3.6). 

Step 8. Find the diagonal elements of the 
diagonal matrix (7,^(5) with the help of SISO 
techniques so that the single loop requirements 
in (e) are met together with (a) and conditions 

(3.8) and (3.9) (rewritten in terms of the 
diagonal elements of Gd(/<u) and G,o(ytij)). 

Step 5 only has the purpose of reducing the 
cancellations needed between Wo(.r) and G^ois)- 
It can be omitted and substituted with the 
following one. 

Step S'. Put Z(j) = Z(s) and f(j) = T(s). 


4. CONCLUSIONS 

The method here presented allows us to 
obtain decoupling at the nominal parameters 
and a robust asymptotic tracking and disturbance 
rejection and to robustly satisfy a performance 
requirement in the case when it is known the 
actual dependence of the plant to be controlled 


on some “physical” parameters, and the scalar 
outputs which must track corresponding re¬ 
ference signals are a proper subset of the set of 
the controlled outputs (in which case no solution 
may exist with the existing methods). 
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APPENDIX. PROOF OF LEMMA 1 

Define 



A{P)-sl B(py 

cm omr 


(A.l) 


r(i5) diag {5(0,. P), sm, P) .5(cr,. p)). 

(A.2) 


The lemma will be proved by showing that the hv^thesis 
implies the existence of an intcnor point p^ of and a 
neighbourhood of /3u> ^ such that 

rank r(/3) = rank r(/5o) VjSeV,. (A.3) 

Now, by the continuity of A(P). fl(/5), C(P) and D(j9) in 
if (A.3) is violated for ®ud for all 

neighbourhoods of ^there exists /9, 

arbitrarily near such that 

rank r(^,) > rank r(^o). (A.4) 

and for which there exists a neighbourhood 'P, of 
/3,,'P, c^Pfl, in which the elements of A(j0), B(P), C(/3), 
D{p)> ^iP) ^iP) continuous. 

The proof is completed by induction, by noting that the 
rank of r(^) is bounded for all /£! c since Vm has finite 
dimensions. 
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Robust Stabilization of Feedback Linearizable 
Time-varying Uncertain Nonlinear Systems *t 

RICCARDO MARINOt and PATRIZIO TOMEI* 

A global robust stabilizing state feedback controller is provided for a class 
of single-input nonlinear systems with bounded unmodelled time-varying 
disturbances entering nonlinearly in the state equations. 


Key Wordo—Robust stabilization, nonlinear systems, feedback linearization, self-tuning stabilization 


Abatnct—We consider single-input nonlinear systems with 
unknown unmodelled time-varying parameters or distur¬ 
bances whose bounds are known Assuming that the 
undisturbed system is globally feedback lineanzable and that 
a tnangularity condition holds for the uncertain terms we 
design a robust global stabilizing state feedback control 
Disturbances are not required to enter linearly in the state 
equations When they do enter linearly, the stabilization 
problem can be solved without knowing bounds on 
disturbances by using a self-tuning version of the robust 
control In particular, any linear system in controller 
canonical form perturbed by unknown global Lipschitz 
nonline an tics satisfying triangularity conditions is shown to 
be globally stabilizable by a fixed dynamic state feedback 
compensator whose order equals the state space dimensions 

1 INTRODUCTION 

We consider single-input nonlinear systems 
x=f{x) + q(x, e{t)) + g{x)u 
= f{x,e{t))+g{x)u, (1) 

X e R'*, mg/?, 61 g Q cz R^, 

where 6(t) is a vector of unknown, time-varying, 
piecewise continuous parameters or disturbances 
which takes values in the compact set Q c 
gi^) IS a smooth vector field with g(jc) 

0, Vjc G/?”,/(jc) IS the nominal or undisturbed 
smooth vector field 

f(x)=f(x, (2) 

with djsj the nominal constant value of 6. The 
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function 


q(x, 6(0) =f(x, 6(0) -f(x, 6s), (3) 

is assumed to be smooth and contains all the 
uncertainties in the system, including distur¬ 
bances and uncertain nonlinear terms. We 
assume that there exists an isolated equilibrium 
point (taken without loss of generality to be the 
origin) which is not affected by the vector d{t), 
i.e. /(0) = 0 and ^(0, 0) = 0, e We ad¬ 
dress the local (global) robust stabilization 
problem, i.e. the design of a smooth state 
feedback control which makes the origin locally 
(globally) asymptotically stable for every 6{t). 

When 0(r) is a constant and known parameter 
vector, the stabilization problem, which is a 
fundamental issue in nonlinear control theory, 
has been extensively studied in the literature 
(see Sontag (1990) and Bacciotti (1992) for an 
extensive bibliography and a recent monograph). 
The class of locally (globally) feedback lineariz¬ 
able systems, i e those systems 

x=f{x)^g{x)u, /(0) = 0, XG/?'’, (4) 

which are transformable by a local (global) 
diffeomorphism 

z = r(x), r(0) = 0, zeU\ (5) 

and by a nonsingular state feedback 


u = a{x) + P{x)v, af(0) = 0, (6) 

with ^(0)^0 (for global feedback linearization 
P{x) 0, Vx G /?”) into a linear controllable 
system 


with 


z =Az + bv, 



o 

o 

0“ 


"0" 

A = 

0 0 1 

• 0 

, b = 

0 


_0 0 0 

■ 0_ 


_ 1 . 


(7) 

( 8 ) 
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is locally (globally) stabilizable. Necessary and 
sufficient conditions for local feedback linear- 
lization have been obtained in Jakubczyk and 
Respondek (1980) and Su (1982), while the 
problem of global feedback linearization is 
addressed in Boothby (1984,1986), Dayawansa 
et al. (1985) and Respondek (1986). 

Since feedback linearization techniques in¬ 
volve nonlinearity cancellation, the problem of 
extending those techniques to uncertain systems 
containing a vector of constant unknown 
parameters d was addressed in Nam and 
Arapostathis (1988), Taylor et al. (1989), Sastry 
and Isidori (1989) and Kanellakopoulos et al. 
(1991a, b) where adaptive versions of feedback 
linearizing techniques were proposed under the 
linear parametrization assumption which restricts 
the vector 6 to enter linearly in equations (1), 
namely 

6 )= ^ Oiq.ix). 

1 “ 1 

In Taylor et al. (1989) and Kanellakopoulos et al. 
(1991a, b) the nominal system is assumed to be 
globally feedback linearizable and the vector 
fields q,(x) are required to satisfy structural 
conditions: matching conditions were introduced 
in Taylor et al. (1989) and weakened by ex¬ 
tended matching conditions in Kanellakopoulos 
et al. (1991a) and, recently, in Kanellakopoulos 
et al. (1991b) by the strict triangularity 
conditions which are the least restrictive ones. 
Those conditions do not impose any growth 
requirement on the nonlinearities, such as 
boundedness^ sector or Lipschitz assumptions 
(see Nam and Arapostathis, 1988; Sastry and 
Isidori, 1989). 

The main result of this paper, given in Section 
3, leads to the design of a local (global) robust 
stabilizing state feedback control under three 
assumptions: the nominal system (/, g) is locally 
(globally) feedback linearizable; the uncertain 
vector q(x, 0(l)) satishes coordinate-free trian¬ 
gularity conditions; Q is a known compact set. 
This extends the stabilization results presented in 
Kanellakopoulos et al. (1991b) since the crucial 
assumption of linear parametrization is removed 
and bounded time-varying parameters and 
uncertainties on non-linearities (such as look up 
tables) are allowed. The technique of proof is 
based on a recursive, constructive algorithm 
which simultaneously builds a Lyapunov func¬ 
tion and a fixed static nonlinear state feedback 
control that makes the origin locally (globally) 
asymptotically stable for any d{t). When the 
components of q(x, d(t)) satisfy, in suitable 
coordinates, more restrictive global Lipschitz 
conditions then the recursive algorithm provides 


a globally stabilizing linear controller in those 
coordinates, thus recovering earlier high-gain 
robust control schemes (Thorp and Barmish, 
1981; Barmish etal., 1983). 

When the linear parametrization assumption is 
met, the robust stabilizing control can be made 
adaptive by the state feedback self-tuning 
algorithm given in Section 4 consisting of a 
dynamic compensator of order at most n which 
tunes the control parameters. This self-tuning 
algorithm allows us to remove the hypothesis 
that the compact set Q is known. When global 
Lipschitz conditions are satisfied, Lipschitz 
constants need not to be known. In Section 4 it 
is shown that any linear system in controller 
canonical form perturbed by global Lipschitz 
nonlinearities with unknown Lipschitz constants 
can be globally stabilized by a fixed state 
feedback dynamic compensator of order n. 

The reader is referred to Isidori (1989) and 
Nijmeijer and van der Schaft (1990) for basic 
notations and results in geometric nonlinear 
control theory. 

2 PRELIMINARIES AND BASIC RESULTS 
We first recall the necessary and sufficient 
conditions which identify those single-input 
systems which are locally (globally) feedback 
linearizable. 

Theorem 2.1 (Jakubczyk and Respondek, 1980; 
Su, 1982). The system (4) is locally feedback 
linearizable, i.e. there exist a local 
diffeomorphism (5) and a nonsingular state 
feedback (6) such that the closed loop system 
(4)-(6) in z-coordinates becomes (7) if, and only 

if, 

• G' = span {g,. .. , ad'ig) is involutive of con¬ 
stant rank i + \ for 0 s f < n - 1. 

A global version of Theorem 2.1 is given in 
Boothby (1986) (see also Boothby, 1984; 
Dayawansa etal., 1985; Respondek, 1986). 

Theorem 2.2 (Boothby, 1986). The system (4) is 
globally feedback linearizable, i.e. there exist a 
global diffeomorphism (5) and a nonsingular 
state feedback (6) such that the closed loop 
system (4)-(6) in z-coordinates becomes (7) if, 
and only if: 

• it is locally feedback linearizable; 

• there exists a function h e C°°(/?") such that 
dh^O. VxE/?", and (dh, G""") =0, Vxe 
R", 

• the vector fields ad'fg, 0 s i s « - l, with 
/ =/ - (L,L7-'A)->L;h, g = (L,L;-‘A)-'g 
are complete. 
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In what follows we shall always assume that 
the nominal system (/, g) is locally (globally) 
feedback linearizable. As far as the uncertainties 
are concerned we shall make the following 
structural coordinate-free assumption. 

Strict trUmgularity assumption. 

• ad^G‘ciG‘, Os/sn-2, Vfl € Q. 


. 

.^}- 

Since ad^G' c-G', 0 s i ^ n - 2, it follows that 

yL,k±^.-.*L,Lr'l,±.G'yc\ 


Remark 2.1. The strict triangularity assumption 
was introduced in Kanellakopoulos et ai (1991b) 
(where it is called strict feedback) generalizing 
earlier structural conditions (called pure feed¬ 
back in Kanellakopoulos et ai, 1991b) which 
were obtained in Akhrif and Blankenship 
(1988). In Kanellakopoulos et ai (1991b) and 
Akhrif and Blankenship (1988) the parameter 
vector is assumed to be constant and to enter 
linearly. 

Lemma 2.1. If the nominal system (/, g) is 
locally (globally) feedback linearizable and the 
strict triangularity assumption is satisfied, then 
there exist a local (global) diffeomorphism (5) 
and a nonsingular state feedback (6) such that 
the closed loop system (1)~(6) in z-coordinates 
becomes 

z, = z,+i + (pjizu... ,z,, e{t)), 1 </<«- 1 , 

Z„ = V ■¥ ^„{zu . . . , Z„, e(t)). (9) 


i.e. 

L^h = <Pi(z„ e(t)), 

= , 2,+,. e(f)), ls/Sn-1. 

In conclusion, setting Lfh + (LgLf~'h)u = v we 
have (9). 

3 ROBUST STABILIZATION 
We now state and prove the main theorem 
which provides the design of robust state 
feedback stabilizing controls. 

Theorem 3.1. Consider the system (1). If Q is a 
known compact set, the nominal system (/, g) is 
locally (globally) feedback linearizable and the 
strict triangularity assumption is satisfied, then 
there exists a local (global) static state feedback 
stabilizing controller. 

Proof. Since the assumptions of Lemma 2.1 are 
satisfied, system (1) can be locally (globally) 
transformed into 


Proof. By assumption the distribution is 

involutive and of constant rank n — 1. By 
Frobenius Theorem there exists a function h 
such that in L/q, a neighborhood of the origin, (in 
R" if Theorem 2.2 applies) 

{dh, ^^0, 

{dh, G"-^) =0. 

It follows that dh, . . . , d{L'}~'h) are linearly 
independent and that 

2 , = h{x) 
z, = Ly^h{x), 

is a local diffeomorphism in f/o (or a global 
diffeomorphism if Theorem 2.2 applies) and 
(zi,. . . , z„) are local (global) coordinates. In 
new coordinates system (1) becomes 

ii = 22 + Lgh, 


z„ = L;/i + (L,L;-‘/i)u + L„Uf-'h, 
and 

C” = span{^}, 


2) = 22 + <^i(2,, e{t)), 

22 = 23+02(^1.22.^(0). 

2„ = 0„(Z|. . . ■ .z„, 0(0)+ n. 

The proof proceeds by induction. We first prove 
the following claim. 

Claim. Assume that for a given index i, 1 s / < 
n, for the system 

2, =22+ 0i(2,. 0(0). 

( 11 ) 

2, = U, + 0,(2,.Z„ 0(0), 

there exist i smooth functions 

2*(2i), 2 ^( 2 ,, Z 2 ), . - • . 2,+ 1 ( 2 ,, . . . , Z/), 

z;(0....,0) = 0, 2£><i + l. (12) 

such that for the closed loop system with the 
control V, = zf+i in the new coordinates 


2| = 2l. 

2z=Z2-zJ(2i), 


(13) 


2(— 2, 2, (Zi, .... Zi-i), 
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the function 

(14) 

/-t 

has a time derivative satisfying the inequality 

(15) 

/-I 

with - / > 0, 1 ^^ i. Then, for the system 

ii = Z2+ 0i(z,, 0(0), 

(16) 

^#4-1 = ^1 + 1 0f-n(^l, ■ • ■ , 0(t)), 

there exists a smooth control 

Vi^Y “ ^i+2(^l, ■ • - , ^f + l), ■2 j+2(1^j ■ • • , 1^) 

(17) 

such that for the closed loop system in new 
coordinates 

fy, Z/^i = Zy+i-z;^.i(z,,. .., zO, 

(18) 

the function 

= (19) 

/-I 

has time derivative satisfying the inequality 

( 20 ) 

y-i 

with kj+i -i>0. 

Proof of the claim. Consider the system (16). 
Make the change of coordinates given by (13) 
and (18). According to (15) and (19), the 
function V,+i is such that 

v,>. ^ - i + l)2y 

/-I 

+ f,f.+,+ f,+,2j+,. (21) 

From (12) and (16), we have 

^1 + 1 = ^(4-2 <^l4l(^l, • ■ ■ , + 6>(0) 

(22) 

By virtue of (11), (13) and (18), we can write 

f 

0,+i(zi, . . . , Z/+,, 0(0)- 2 

/=! 

^ ^ 2y — y>i+i(ii+i, o(0)> (23) 

where |, = [zi,.. ., z,]^ and 

i/;,+,(0. 0) = 0. VOeO. (24) 

Since V't-t-i >s a smooth function of it+i and (24) 


holds, we can write (see Nijmeijer and van der 
Schaft, 1990) 

m=I V'/d/.i. 0(0)2/, (25) 

with 1^/, 1 < / < i + 1, continuous functions. 

’ Since 0 e £2, a known compact set, we can find a 
smooth function cr,4i(|,+i) such that 

1 < r -f-1, V0 e £2. 

Define z *.^2 

zt+2 = -2, - {ki+i - i + 1)Z/+, - z,+,£r?+i(|,+i), 

(27) 

with A:,+, -, > 0. Taking (21)-(23), (26) and (27) 
into account, with Z/4.2 = zf+j in (16), we obtain 

V,>1 ^ - S (*/ - i + 1)2/ - f?+icrf+i 
+ |z, + ,| II I, +,11 a, + , 

1 

X ^ ["!'•'*! 

2 L |z,.|.i| J 

L““ J 

^-lik,-i)zj, (28) 

jrr.Y 

which concludes the proof of the claim. 

We now show that the claim is true for i = 1. 
Consider the system 

Z, =Z2+ 0,(Zi, 0(0). (29) 

Since 0i is a smooth function of z^ and 
0i(O, 0) = O, V0e£2, we can write (see Nij¬ 
meijer and van der Schaft, 1990) 

0i(z,, 0(0) = z,V'i(z,, 0(0), (30) 

where 0i is a continuous function. Since 

d{t) e £2, we can find a smooth function cri(zi) 
such that 

V^,(z,,0(O)^<^,(z,). V0en. (31) 

Let Z 2 be defined as 

z* = -*,z, - z,ar,(z,), (32) 

with Jk, > 0. Consider the function 

Vr = izl (33) 
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The time derivative of (33) computed with 
Z 2 = Zi in (29), is given by 

K, * -*izf - z?(ar,(z,) - V>i(z„ 8(0), (34) 

and, taking (30), (31) into account, is such that 



(35) 

Applying the claim (n - l)-times 
construct a smooth function z*+i(z, 
which determines the control 

we can 
> ■ • • > z„) 

w = w„ = z*+,(z„ . .. , z„), 

(36) 

and a global change of coordinates 


^i = z,, 

Z2 = Z7-ZJ(Z,), 

(37) 

Zn Z„ Z„ (Z|, . . . , Z„ _ i), 


such that for the closed loop system (10), (36) 
the radially unbounded, positive definite 
function 

<=i 

(38) 


has time derivative satisfying the inequality 

£ -c ||^-||^ (39) 


is transformed into (10) where now 

4>/(0, .... 0, 8(0) ^ 0, for some 8 e Q. 

We rewrite system (10) as 
Z, = Z^+, + [0y(Zi....,Zy, 8(0) 

-MO .0. 8(0)1+ 0y(O....,O, 0(0) 

= Zy+, + 0/(z,,...,Z;, 8(0) 

+ ^/O,...,0, 8(0), l£/sn-l, 

2n = [0»(2|. 0(0) “ 4>n(0, • . ■ , 0, 8(0)] 

+ 0„(o.0, 8(0)+ w 

^ .0(0) + MO -0, 8(0) + V, 

and design the robust stabilizing controller 
considering only the terms 0,(zi,. . ., z,, 0(f)) 
which vanish in the origin. The terms 
0^(0, . . - , 0, 0(f)) which act on the system as 
additive bounded (since 0(f) is bounded and (pj 
is smooth) disturbances are ignored. Since the 
final controlled system is exponentially stable in 
z-coordinates (both V and V are quadratic) the 
additive disturbances which have been neglected 
in the controller design will produce bounded 
perturbations on the state in z-coordinates and, 
therefore, on the state in z-coordinate$ when the 
diffeomorphism z = r(x) is global. 


with c>0. As a result, the equilibrium point 
f = 0 is globally uniformly asymptotically stable 
(Hahn, 1967). Since z is related to x by the 
transformations (5) and (37) it follows that Jt = 0 
is a locally (globally) uniformly asymptotically 
stable equilibrium point for system (1), (36). 


Remark 3.1. As far as the stabilization problem 
is concerned, under the assumption that Q is 
known, Theorem 3.1 generalizes a result 
obtained in Kanellakopoulos ef al. (1991b). In 
fact, while the parameter vector is supposed to 
be constant and to enter linearly in Kanellako¬ 
poulos ef al. (1991b), this is not required in 
Theorem 3.1. Moreover, as shown in (26) and 
(31), uncertainties in the nonlinear terms (e.g. 
look up tables) are allowed in Theorem 3.1 as 
long as functional bounds are known. On the 
other hand in Kanellakopoulos ef al. (1991b) the 
tracking problem is solved as well while 
Theorem 3.1 only addresses the stabilization 
problem. 

Remark 3.2. Assume that the origin is not an 
equilibrium point independent of 0(f), i.e. 
assume that 


Corollary 3.1. Consider the nonlinear system 

0l(z. 0(0) 


z =Az + bu + 


. zeR", (40) 


L(/>„(z, 8 ( 0 ) J 

with (.4, h) in controller canonical form (8). If 

I^,(z, 8 ( 0)1 ll[zi • . . , 2/1^11, IsiSfi, 

(41) 


with known Lipschitz constants then there 
exists a static linear globally stabilizing state 
feedback controller. 


Proof. It is a direct implication of the proof of 
Theorem 3.1 since the functions ori, . . . , 
which satisfy (26) and (31) can be chosen as the 
constants 

Example 3.1. Consider the system 
i, = Z2 + i,sinf^ 

(42) 

Z2= U. 

Since |jCi sin f^| ^ |jti| Corollary 3.1 applies with 
^] = 1, = 0 ^nd therefore we can design a 

linear stabilizing controller. Following the proof 
of Theorem (3.1) (see (32)) we define 


^(0, 0(f)) i=- 0, for some 0 e £2. 
Following the proof of Theorem 3.1, system (1) 


xl = -AiXi-Zi. 

The control u, according to (27) and (36), is 
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chosen as 

Xi - ki{ki + l);ti - *2*2. (43) 

4. SELF-TUNING STABIUZATION 
In this section we analyse the special case in 
which the parameter vector d appears linearly. 

Linear parametrization assumption. 

• The vector held q(x, 0(t)) is expressed as 

q(x, 0(f)) = 0,(f)9/(Jc)- 

/-I 

If the linear parametrization assumption is 
satisfied we can construct a dynamic controller 
without requiring the compact set £2 to be 
known. 

Theorem 4.1. Consider the system (1). Assume 
that the nominal system (/, g) is locally 
(globally) feedback linearizable and that the 
strict triangularity and the linear parametrization 
assumptions are satisfied; then there exists a 
local (global) dynamic state feedback stabilizing 
controller. 

Proof. Lemma 2.1 applies and system (1) can be 
locally (globally) transformed into the form (10) 
which by virtue of the linear parametrization 
assumption becomes 

1 = 1 

(43) 

in = U + 0,(f)«p„,,(2,, . . . , Z„). 

i-1 

Now, the proof proceeds by induction. 

Claim. Assume that for a given index i, 1 £ i < 
n, for the system 

l = Z 2 +^ 0y(f)0,.y(z,), 

(44) 

i/= V, -t- ^ 0y(f)0,,y(z,, . . . , Z,), 

y-i 

there exists a filtered transformation 

Zi = Z,, 

Zy= Zy - Zy*(Zl, .... Zy_l, fit,. 

z*(0, .. ., 0, ^ 1 , . • .) A/-i) ~ 2 Sy £ i, 

Ay=»;y(Z,,...,2y. Al. --.Ay-l). 

Tjy(o,..., 0, Ai. - - - > A/-i) *^ I — 1, 

(45) 


and a self-tuning control 

ty/“z,+i(Z], . . . , Zi, Ai. - • • f Ay). 

^y*+i(0> - - ■ . 0. Ai. - ■ - I Ay) “0. 
Ay~^y(^ii - - • » 2y. Ai. - • - » Ay-i). 

tji(0, .... 0, At. - - - . Ay-i) ~ 0, 

(46) 

such that for the closed loop system (44), (46) 
the time derivative of the function 

V, = ii(z'? + Ay). (47) 

y-i 

with Ay = My ~ Ay. My suitable constants, satisfies 
the inequality 

+ (48) 

y-i 

with *y - I > 0, 1 ^ i. Then, for the system 

ii = -22 + ^ 0y(O0i.y(^i). 

y-l 

(49) 

Z,+, = V.+i -I- ^ 0y(f)0y+l,y(z,, .... Z,+ ,), 
y-l 

there exists a self-tuning control 
V/^.i = zf+ 2 ( 2 |. - - . 2y+i. Ai-- Ay+i). 

z;+2(0, . . , , 0, , ^ii+,) = 0, 

Ai-n = »J.-n(2i. - - - . ^y-t-i. Ai. - - . Ay). 

Tfi+tiO, . . . , 0, Ai. - - - . A.) = o, 

(50) 

such that for the closed loop system (49), (50) in 
the new coordinates given by the filtered 
transformation 

Zy=Zi, 

Zj = Zj — Zj (zj, ...» Zy_i, Ai. • - - . Ay-i)> 

(51) 

the time derivative of the function 

y+i 

Vy+i = i 2 (^y+ My) 
y-l 

= V^ + K2f.i + A?.i). (52) 

with Ai>i = Mi>i ~ Ai-m. My-fi ^ suitable constant, 
satisfies the inequality 

Vy.i^-’f(*y-'>'y. (53) 

y-l 

with *j+, - i > 0. 


Proof of the claim. Consider the function 
V^y+i = V,-fif?^,. 


( 54 ) 
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In view of (44), (47), (48) and (51), we have 

2 (*/-*■+ l)fy 

/-I 

+ + (55) 

From (45) and (49), we obtain 

^.-n = W<+1 + J) ®y(09>/+i,/(^i, • • • . Z(+i) 


/-I 


(«) 

Since 0y.*(O,.... 0) = 0, 1 s) s i + l, 1 s sp, 
we can write (see Nijmeijer and van der Scheft 
(1990)) 

0y,*(^l, ■ ■ ,Z/) = ^ . Z,)Zh (57) 


(-1 


which by (51) implies that we also can write 

4>,,k(Zl, ■■■ ,2,)='^ 0/.*(^i, ■ • • , z,)z,. (58) 

/=! 

Therefore, we have 

^ e^(O0,+i.y(zi. ■ •., z,)-2 


;-l 


X ^ e*(t)0^,*(z.. z,) 


;=l /=l 

/=^1 O'^y /-I 

p 1 + 1 

- L ®/(0 2 V’/.((l.+i. Ai. ■ ■ •. A.)2/- 

;-l y=l 

Since ^ e £2, a compact set, we can find a 
smooth function cr,^.i(|,^.,, jQi, . . . ,/i.) and an 
unknown positive constant such that 

k/.Kl j + lr fki, . . . , At)l ^ Ai. ■ ■ - r Ai). 

(59) 




with 

/^i+i ” ^?+i^+i» (62) 

and i(cy+i-l>0. Consider the function (A/+i“ 

f^i+i ”■ A^+l) 

Vf+i*^^. + iA?+i. (63) 

Taking (55), (56), (61) and (62) into account, we 
obtain 

/+! 

7-1 

- /ii+,zf+,aff+, + |Z(+,| |||,+,(| af,+,V/v]7 

1 y/f^t-n *■(+! "1 


Vful «( + ! 




riil-..ir 

I i^,+ii - 


S - 2 (*; - 1)^?- 

/-I 


(64) 


To prove that the claim holds for i = 1, we 
consider the function 

Vi = -(*1 - l)r, - AiZia,(z,) ^ zj(z,, Ai), (65) 

in which A:i-'1>0 and ciri(zi) is a smooth 
function such that 


/=1 


so that 


^^ia,(zi), (66) 

with a suitable unknown positive real and 

(^ 1 , is continuous since 0,y(O) = O). The 
dynamics of Ai are chosen as 

A, = z?a,(z,). (67) 

It is easy to verify that the function 

V, = izf + i/if, (68) 


|i0;(o'i'v',,«(l,+i.A..A,) 

ly-1 /=i 

<V/iuTa/+i(l/+i. Ai. • ■ ■ . A.)- ( 60 ) 

Define as 

l^f+l ^ ~~ (^1 + 1 “ 1 “1" l)-Zy + i ~ Ai+l^f + l^?+l 



— Z*,4.i( 2]| . . . , -Zi + l, Al> • • • » Ai + l)» (61) 


with Ai = 1*^1 “ An has time derivative such that 

Vi^-k,zi (69) 

Applying (n - l)-times the claim we can 
construct a function v„ = zU+i(Zi,.. . , z„, 
Ai,.... An) which determines the control 

v = v„, (70) 

and a change of coordinates 

z = iz, .z„)-»f = (z,.f„), (71) 
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such that the function 

= + (72) 

has time derivative satisfying the inequality 

V^-c\\z\\\ (73) 

with c>0. Consequently, ||z(t)|| and the 
estimates jil,(t) are bounded, which imply that 
||z(t)|| is bounded. The control variable u which 
depends on z and on the estimates (k, is also 
bounded and, in turn, ||z(/)|| is bounded (see 
(43)). According to (56) and (61), ||f(f)|| is 
bounded and therefore the function z^z is 
uniformly continuous. This fact along with the 
following inequality 

lime f ||z(T)||^dTSlim- f V(r)dT 

I-.® Jo 

= V'(0)-lim V(0<®, (74) 
imply by Barbalat’s Lemma (Popov, 1973) that 
liml|z-(OII=0. (75) 

and, by (5) and (71) that 

lim||jr(OII=0. (76) 


Remark 4.1. Note that the dynamic compen¬ 
sator in Theorem 4.1 is of order at most n and 
therefore independent of the dimension of the 
vector 6. 


with k 2>0 and 

fi2 = xla(2ixu Ai)- 

The function az has to satisfy, for some 
unknown value of H 2 , the inequality (60) which 
for system (77) becomes 

(0,(f)jc, + 62 it)x]) £ V{i2 a2ixi, Ai)- 

A possible choice for 0^2 is 

tr2(;r„ A.) = 1 + (1 + A?Vt 


The proof of Theorem 4.1 allows us to 
generalize Corollary 3.1 and to determine a fixed 
stabilizing controller for a family of linear 
systems in controller canonical form perturbed 
by global Lipschitz nonlinearities with unknown 
Lipschitz constants. 


Theorem 4.2. Consider the system 


z = Az + bu + 


®(0) 


\_4>„{z, 0(0) J 


z 6 R", (78) 


with {A, b) in controller canonical form (8). If 

\4>,{z, 0(0)1 Sju, ||[zi, . . . ,z,]’^||, l<i<n, 

(79) 


with unknown Lipschitz constants n„ then there 
exists a globally stabilizing dynamic state 
feedback compensator of order n. 


Example i.\. Consider the system 

X^=X2+ 6iit)x] + 02(OJC| . 

X2=U, 

where 0,(0 and 02 (f) arc bounded disturbances. 
System (77) satisfies the conditions of Theorem 
4.1. Following the proof of Theorem 4.1 we 
define (see (65)) 

u, = -(k,-l)jc,-Ai-<Ciai(-ri). 

where k, > 1 and, according to (66), (ir,(ji;,) can 
be chosen as 

ai{x^) = l + x]. 

The dynamics of Ai are chosen as in (67) 

A, = JC?a,(jCi). 

Define, as in (51), 

X2 = X2-Vi. 

According to (61) the final control u is 

M == 1^2 = “JCl - *2^2 - (^2i2(xliXu Al) 

- (iyXxO(x{xx)- — X2. 


Proof. Even though the strict triangularity and 
the linear parametrization assumptions are not 
satisfied they are satisfied by the bounding 
functions in (79) so that we can follow the 
arguments used in Theorems 4.1. The assump¬ 
tion that £2 is known, which is required in 
Theorem 3.1, is not needed. 

Example 4.2. Consider the system 
ii = jr 2 + 0{t)xx sin jr 3 , 

X2=X^2 
= W; 

with 0{t) belonging to the unknown interval 
[Bm. ^m]- Since 

|0(r)jri sin x^\ < max 

Theorem 4.2 applies. Note that the strict 
triangularity assumption is not satisfied. 

5 CONCLUSIONS 

We have addressed the robust state feedback 
stabilization problem for single-input, time- 
varying uncertain nonlinear systems. Assuming 
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that the nominal time-invariant system is 
globally feedback linearizable, that the time- 
varying nonlinear uncertainties satisfy structural 
coordinate-free triangularity conditions and that 
the time-varying unknown parameter vector 
belongs to a known compact set, we construct m 
Theorem 3.1 a fixed static state feedback control 
which globally stabilizes each system in the 
family. 

In Theorem 4.1 we show that if the unknown 
parameter vector enters linearly, the knowledge 
of the compact set in which it varies is not 
required. Theorem 4.2 shows that the techniques 
developed in this paper lead to the construction 
of a fixed self-tuning control which globally 
stabilizes any linear system in controller 
canonical form perturbed by time-varying 
unknown nonlineaiities as long as they are 
bounded by globally Lipschitz functions satisfy¬ 
ing the strict triangularity condition, without 
requiring the knowledge of the Lipschitz 
constants. 
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Fading-memory Feedback Systems and Robust 

Stability*t 

JEFF S SHAMMA$<^ and RONGZE ZHAO|l 

A sniall-guin condition is necessary for robust stability of fading-memory 
closed-loop feedback systems with unstructured model uncertainty Linear 
plants stabilized by fading-memory nonlinear compensators lead to closed- 
loop fading-memory systems 


Key Woril§ Nonlinear systems, time varying systems robust control robustness system theory 


Abitnct—This paper considers fading memory for nonlinear 
time-varying systems and associated problems of robust 
stability 

We define two notions of fading memory for stable 
dynamical systems uniform and pointwise We then provide 
conditions under which stable linear or nonlinear systems 
exhibit uniform or pointwise fading memory In particular 
we show that (1) all stable discrete time linear time varying 
(LFV) systems have uniform fading memory (2) all stable 
continuous time LTV systems have pointwise fading 
memory and (1) stable finile-dimensional continuous time 
1 TV systems have uniform fading memory 

We then show that a version of the small gam theorem 
which employs the asymptotic gam of a fading memory 
system is necessary for the stable invcrtibility of certain 
feedback operators These results are presented for bo»h 
continuous lime and discrete time systems using general ^ 
or notions of inpul/oulput stability and generalize 
existing results for stability We further investigate fading 
memory in a closed loop context For linear plants wt 
parametrize all nonlinear controllers which lead to 
closed loop pointwise fading memory 


1 INTRODUCTION 

Thf problem of robust stability analysis (cf 
Dorato (1987) and references therein) is to 
determine under what conditions a given 
controller stabilizes a prescribed family of 
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possible plants Typically, this plant family anses 
from various approximations, simplifications, 
and limitations in the plant modeling process 
One framework for plant family representations 
is that of unstructured uncertainty More 
precisely, the plant family is represented as a 
nominal plant combined with a norm-bounded 
perturbation The theory for robust stability 
analysis for linear time-invariant systems subject 
to unstructured uncertainty is well developed 
(e g Chen and Desoer (1982), Dahlch and Ohta 
(1988), Doyle (1982), Doyle and Stem (1981), 
Duylt et al (1982), Georgiou and Smith (1990), 
Khammash and Pearson (1991)) 

Linear systems typically arise as linearizations 
of nonlinear systems Furthermore, adaptive 
control laws for linear systems are typically 
nonlinear Thus, robust stability analysis tools 
for nonlinear systems are desirable For 
nonlinear systems, a standard tool for stability 
analysis is the small gain theorem (Desoer and 
Vidyasagar (1975), Sandberg (1965), Zames 
(1966)) A limitation of the small gam theorem is 
that It can often be a conservative sufficient 
condition for stability Recent work by the 
author (Shamma (1991)) has shown that the 
small gain theorem is m fact necessary when 
considering nonlinear plant families charac¬ 
terized by a norm-bounded perturbation This 
work was developed for (i e finite-energy) 
stability of discrete-time systems The nonlinear 
systems considered in Shamma (1991) are those 
with fading memory (e g Boyd and Chua 
(1985)) Intuitively, a fading memory property 
means that the current output depends on the 
recent inputs and not the remote past Thus, 
fading-memory is a reasonable assumption for 
many physical systems 

The objective of this paper is to further 
develop the analysis of fading memory systems 
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in the context of robust stability. The results 
here are presented in an essentially norm- 
invariant manner. We consider stability over 
arbitrary JZ), or fp spaces with p e [I, We also 
consider a form of bounded-input/bounded- 
output ^ ox stability with asymptotic decay. 
Given the norm-invariant nature of the presen¬ 
tation, it is the fading memory property which is 
isolated and exploited to lead to the desired 
results. 

In this paper, we define two notions of fading 
memory for stable dynamical systems: uniform 
and pointwise. In uniform fading memory, the 
effects of a finite-duration of input eventually 
vanish depending on the length of the duration 
only. In pointwise fading memory, the effects of 
a finite duration of input also eventually vanish 
but now depending on the length of the duration 
and the input itself. 

The distinction between uniform and po¬ 
intwise fading memory turns out to be 
important. In particular, we will see that all 
stable discrete-time linear systems exhibit uni¬ 
form fading memory, all stable continuous-time 
linear systems exhibit pointwise fading memory, 
and stable finite-dimensional continuous-time 
linear systems exhibit uniform fading memory. 
The pointwise fading memory property also 
allows us to weaken the conditions under which 
the small gain theorem is necessary. This leads 
to a larger class of nonlinear systems for which 
these results are applicable. 

The fading memory property is also con¬ 
sidered in a closed-loop context. More precisely, 
we consider under what conditions a stabilizing 
compensator leads to a closed-loop system with 
pointwise fading memory. A key tool in this 
analysis is the factorization representation tor 
nonlinear operators (cf. Verma (1988) and 
references therein). This allows us to para¬ 
metrize all nonlinear compensators for linear 
plants which lead to closed-loop stability with 
pointwise fading memory. This parametrization 
takes the form of the familiar linear fractional 
parametrization (e.g. Francis (1987), Youla et al. 
(1976)) with the free parameter having fading 
memory. 

The remainder of this paper is organized as 
follows. In Section 2, we CvStablish notation and 
state some preliminary results. In Section 3, we 
consider the fading memory property. In Section 
3.1, we define uniform and pointwise fading 
memory. In Section 3.2, wc concentrate on the 
differences between pointwise and uniform 
fading memory and provide conditions for an 
operator to have pointwise fading memory. In 
Section 4, we generalize previous results 
(Shamma (1991)) which show that the small gain 


theorem is necessary for robust stability of 
fading memory nonlinear systems. These results 
are applied to a robust stabilization problem in 
Section 5. Section 5 also presents conditions 
under which a closed-loop system exhibits fading 
memory. Finally, Section 6 contains some 
concluding remarks. 


2 MATHEMATICAL PRELIMINARIES 
In this paper, we will consider both discrete- 
time and continuous-time systems with several 
notions of finite-gain stability (cf. Desoer and 
Vidyasagar (1973); Willems (1971)). Towards 
this end, the symbol T is used to denote any one 
of the following normed signal spaces: or 

with p 6 fl, oo), yj;, and c,,, where 

If el/-Am sup |/(t)|=o}, 


C(i= ]/ e : lim |/(n)| =()[. 


The spaces y' and are equipped with the 
usual norms, all denoted HH. The spaces //^ 
and Cl) are equipped with the usual “supremum ’ 
norms, also denoted || • ||. The symbol S' is used 
to denote either or Occasionally, it will 
be necessary to specify or restrict the particular 
definition of V and J. 

Let /: .T—► The support of / is denoted 

supp(/). The restriction of / to the interval 
[a, h] is denoted J For T £ .T, denotes 

the T-shift (time-delay) operator; 


Sr fit)- 


10 , 


t<T- 


and P) denotes the truncation operator: 



t^T\ 

t>T. 


The extended space, is defined as 

'K '= if Pff e r, WTeST}. 

The set of all f e Tj. with f ^ V is denoted by 

Y,\r. 

Let H \ Tp—> Y,. Then H is called causal if 
PThif = PiHPrf, VT e .T, 
time-invariant if 


HS, = SrH, VT 6 J, 
stable if / 6 T implies Hf e Y with 


l|W|l = su^ 




ll/ll 

/#() 


< 00 , 
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and incrementally stable if it is stable with 




sup 

f\^h 


ii/i -m 


< X. 


The definitions of stability over and r„ are 
somewhat non-standard. For causal operators, 
stability over implies stability over with 
’.he same induced norm. The main difference is 
that .stability over implies a notion of 
asymptotic stability in a boiinded- 
input/bounded-outpui setting. Similar arguments 
hold for stability over c„. 

Henceforth, all operators arc assumed to he 
causal. The .set of all stable NLTV operators 
is denoted .9^,. The subset ol 
operators in 19^, which are linear (and possibly 
time varying) is denoted y,. 

The following definition is adapted from 
Willems (1971). 


the Holder inequality (e.g. Desoer and Vid- 
yasagar (197.S)). 

Now let y denote Since supp(v) = 
[T,, TjJ, we can con.sider n as an element of 
y"[7'|, Tj). From the Hahn-Banach theorem 
(Rudin (1987)) there exists a ze(y^[7',, T^])* 
such that ||z|| = 1 and 2 (i') = ||o||. Then define G 
by 

(<^^/)(f) = i~»V-(/)z(/|,7,/:|)- 

Note that since K'(t) = 0 for t <^1^, this operator 
is causal. CA)nditions 1-4 follow immediately 
from this definition. In fact, the construction in 
case y denotes follows these lines with the 
dual element z slated explicitly. 

The construction lor discrete-time { V denotes 
or C(,, and if denotes z^) follows from similar 
arguments. 


Definition 2.1. The operator I - G: 1] is 

said to be causally invertible if 

(1) / - C IS one-to-one and onto 

(2) {! - G) ’ is causal. 

A sufficient condition for causal mvertibility of 
I~G (cf. Willems (1971)) is that G can be 
factored as G = S, G or C7 - GS, , where f ' 0. 
Finally a preliminary lemma is presented. 

Lemma 2.1. Let i;, c-) with supp(iJ) = 
[T;, Tj] and supp (iv) -{G, 74 ] with G -- 1 2 
There exists a G e Jj such that: 

(1) w = Gv. 

(2) supp(G/)c=[7:„ Gl \/fer. 

(3) supp(/)cz5r~[7L 7;]4>Gy =0. 

(4) ||G|| = ||iv| 

Proof. For simplicity, the proof is given for the 
singlc-input/single-output case only. 

It will be necessary to distinguish between the 
various definitions of Y ann 5". First, let Y 
denote for some /? e [1, ^), and let denote 
Define C7(., .):7 x by 

g{t, ■^) |w(T)r'' sign (n(T)). 

Now define G € by 

(G/)(0 =' f git. t)/(T)dT. 

-'O 

Condition 1 follows immediately from this 
definition. Using that g(r, t) = 0 for [7,, T 2 ] 
or ^ i [7,, 74 ] leads to Conditions 2-3. Condition 
4 follows from a straightforward application of 


^ rADlNG-MEMORY 

3.1. Definitions and conditions for fading 
memory 

A notion of fading-memory is the central focus 
of this paper. Intuitively, fading-memory implies 
that the effects of a finite-duration of input 
eventually vanish. A definition of fading- 
memory tailored to stable time-tnvananl systems 
over 7' or y signals was given in Boyd and 
Chua (1985). An alternate definition for stable 
time-varying systems over was given in 
Shamma (1991). The following definitions 
generalize those in Shamma (1991) to stable 
time-varying systems over Y. 

Definition 3.1. An operator //e '^^id to 

have pointwise finite-memory if there exists a 
function FM{ , ,//); Lx ifif such that for 

all / e y and f f f 

( 1 ) FM{}.LH)^t. 

(2) FM{fD\H) = FM{PJJ\Hl 

(3) (/- ,.„,)/// = (7 - 

Note that FM{f, l\ H) is causally dependent on 

f 

This definition of pointwise finite-memory is 
.somewhat weaker than that in Shamma (1991). 
The definition in Shamma (1991) requires that 
inputs over a given finite-duration are forgotten 
uniformly as follows. 

Definition 3.2. An operator H e 1/^1. •*> said to 
have uniform finite-memory if there exists a 
function FM{ •; 77): 3" —».T such that for all 
fey and t e S' 

(1) FMit-H)^t, 

(2) (7 - = (7 - Ff*,(,,«))77(7 - P,)f. 
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The notation FM{-) is used both for uniform 
fading-memory and pointwise fading-memory. 
From the different number of arguments, this 
relaxed notation should not lead to any 
confusion. 

Fading-memory op)erators are now defined as 
follows. 


See Boyd and Chua (1985) for examples of 
linear time-invariant op>erators on €°° and iP" 
which do not exhibit fading-memory. 

Example 3.2. A nonlinear time-invariant opera¬ 
tor without pointwise fading memory. 

Define 


Definition 3.3. An oj)erator He^^L is said to 
have uniform (resp. pointwise) fading-memory if 
it can be approximated arbitrarily closely in 
norm by uniform (resp. pointwise) finite- 
memory operators. 

Some useful consequences of these definitions 
are now presented. The proofs are slight 
modifications of similar propositions in Shamma 
(1991) and are omitted. 

The following proposition states that pointwise 
finite-memory nonlinear operators are right- 
distributive over signals with sufficient time- 
separation. 

Proposition 3.1. Let H have pointwise finite- 
memory. Let /, e T have supp (f) c [0, T,]. Let 
Ti = FM{fy,TuH). Then for all with 

supp (/z) '=(72,“). 

Hifi+f2) = Hf + Hh. 


(///)(n) = sat(l|P„/||)/(n), 

where sat (•) denotes the standard unity- 
saturation function. 


3.2. Pointwise fading-memory 
In this section, we will highlight the difference 
between pointwise and uniform fading memory. 


Example 3.3. A pointwise but non-uniform 
fading-memory operator. 

Let M denote the set of / e 1) such that 

f^/(^)e.^"[l,®). Define by 


(/7/)(0 = 


0 , 0 :£t<l; 

0, ^ /Ifo, I) ^ 


The following theorem characterizes how the 
effects of a finite-duration input eventually decay 
in uniform fading-memory operators. 

Theorem 3.1. For He&’sLy the following 
statements are equivalent: 

(a) The operator H has uniform fading-memory. 

(b) Given any c > 0, there exists a function 
FMi{ - ■, H): S'-* S' with FMf{t\H)^t such 
that 

ll(/- 

-(/-P^-A,,„.«,)H(/-WI1^ Ell/ll, 

for all / 6 T and t e 

The following theorem states that all discrete¬ 
time linear operators have uniform fading- 
memory. 

Theorem 3.2. Let V denote with p e [1, ») or 
Co- Then every H e has uniform fading- 
memory. 

Some examples illustrating fading-memory are 
the following. 


In words, the operator H takes the input over 
the time-interval [0,1) and stretches it over the 
time-interval [1, 

From Theorem 3.1, it is clear that H does not 
have uniform fading-memory. However, H does 
have pointwise fading-memory. To see this, we 
will construct a pointwise finite-memory ap- 
proximant, /?, of H as follows. Given any c:>0, 
define H by 

:mu)= 

'o, 0 <f<l; 

0, r ^ 1. /l[o,i) ( 

0. IIV.)//ll>(l-f)ll/l|o.i)ll- 

From this definition, it is easy to see that 
\\H - H\\:^ E. The main idea is that for any 
signal /, H truncates the portion in the interval 
[0,1) which will be stretched to [1, “). However, 
the portion of [0,1) to be truncated is 

signal-dependent, hence the restriction to po¬ 
intwise fading-memory only. 


Example 3. L A linear operator over without 
pointwise fading-memory. 

Define r as (///)(«) =/(0). 


The above construction of a pointwise 
fading-memory approximant provides the main 
insight for the following theorem. 
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Theorem 3.3. For H s , the following 
statements are equivalent, 

(a) The operator H has pointwise fading- 
memory. 

(b) Given any e>0 the following condition 
holds. There exists a function 

■; W): T* X ,T—► such that for atl 

/ e r and f e 3" 

( 1 ) 

(2)/ 

( 3 ) 11(7 - Pfm,u ~ 

H{I-P,)J\,^E\\f\\. 

Proof, {a^h) Given any f > 0, choose H e 
with pointwise haite-memory such that \\Pl - H 
II < e/ 2. Then set the desired FM^{ ‘, ■\H) = 
FM{ ', •;/5). To see this, for any / e T and 
T 6 

U “ F 

~ U ~~ m))FIU ~ Ff)f 

= — P FM{f,T.H))^^f 

However, 

-(/-/W;,e/))H(/-/^)/ = 0. 

Standard norm bounding leads to the desired 
result. 

(b => a j Given e > 0, we will explicitly 
construct a pointwise finite-memory ap- 
proximant, H, to H. Let ar„ = E( 2 )”^\ and let 
FM^^(-, ■;//) be functions as in condition (b). 
For any / e Y, define the sequence {r, } c: ,T as 

= 0 

t2 = FM^Xf^t,;H) + l 


Note that for a given t, this sequence is 
dependent on the particular input, /, but in a 
causal manner. Then set 


{F!f){t) = 


{H{l-PJf)ir\ 


etc. 


ti^t< t2\ 
t2^t<t2,\ 


It follows that He^)siL has pointwise finite- 
memory. Furthermore, 


Theorem 3.3 leads to the following corollary 
that dll continuous-time linear systems exhibit 
pointwise fading-memory. 

Corollary 3.1. Let denote with p e[\, 
or Then eveiy H e Sf has pointwise 

fading-memory. 

Finite-dimensional linear systems, however, 
always exhibit uniform fading-memory. 

Theorem 3.4. The input to-statc mapping of a 
continuous-time linear system has uiiilorm 
fading-memory. 

Proof Let H denote the inpui-lo-state mapping 
of a continuous-time finitc-dimensional linear 
system stable over F, 

g(t) = A{t)git)~^f(t). 

Let d>(/, r) denote the associated state-transition 
matrix. Since H is finite-dimensional, d> admits 
the decomposition r) = M{t)M '(t) for an 
appropriate M. 

It suffices to construct a function FM, (•;//) as 
in Theorem 3.1. Since H is linear, the desired 
condition from Theorem 3.1 becomes 

Towards this end, let / e Y have supp (/) c: 
[0. 7;], and let g = Hf Then for / > 7’,, 

M '(r)/(r)dT. 

-^(1 

Thus 

|g(0l=^|M(0ll|Af-’||,,„|ll/'' 

where 

‘ + Ui. 

p ^ 

in case Y denotes !f'\ q — \ in case Y denotes 
|Af(r)| denotes the pertinent induced matrix 
norm, and 

. l/r/ 

(T)rdrj . 

The above relationship can be used to define the 
desired function FM^\ namely set 7Af,(Ti;77) to 
be the smallest time Tn ^ 7, such that 

ll^|[7, -)ll F ||A/ ^|(0.7,lll^' - 



y=() 

ll/ll ll/ll. 

y-0 


4 CONDITIONS FOR ROBUST INVERllBILITY 
The main results of Shamma (1991) deal with 
uniform fading-memory systems over In this 
section, these results are extended to encompass 
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pointwise fading-memory continuous-time and 
discrete-time systems over any signal-space in V. 
The main results are the following. 

Theorem 4.1. Let V denote any one of the or 
Tp spaces with p e [1, *). For H e let Q e 
be called “admissible” if the operator I - QH is 
causally invertible and 1|0||<1. Then the 
operator I - QH has a stable inverse for all 
admissible Q e if and only if 

inf \\HS,\\^\. 

ft.'y 

Definition 4.1. Let H e satisfy 
inf WHS.W = y. 

tzJ 

Then H is said to have the uniformity in 
norm-excitation (UINE) property if the follow¬ 
ing condition holds. Given any e e (0, y), there 
exists a sequence {fk} c V such that 

(1) l|HS*/*||>(y-f)llAI|, 

(2) sup||/*||<-. 

k 

(3) inf II/* II >0. 

k 

The UINE property states that approaching 
the operator gain, ||//,V*||, does not require 
injecting signals that become arbitrarily large or 
small as Note that by the homogeneity 

property of linear systems, any H e .Vi has the 
UINE property. Furthermore, any nonlinear 
fi = which is time-invariant also has the 
UINE property. 

Theorem 4.2. Let T denote any one of the or 
(p spaces with p6[l,oc). Let H e have 
pointwise fading-memory and the UINE pro¬ 
perty. Let Q e he called “admissible” if the 
operator {I - QH) is causally invertible, Q has 
pointwise fading-memory, and ||0ll < Then 
the operator / - QH has a stable inverse for all 
admissible Q e and 

sup \\(I - QHy l<ac. 

Q admissihlc 

only if 

inf i|/yi;il<i. 

/e T 

A key difference between Theorem 4.1 and 
Theorem 4.2 is that Theorem 4.2 provides 
necessary conditions foi uniform robust inver- 
libility. The reason for this is that in case H is 
nonlinear (as in Theorem 4.2) a destabilizing Q 
is constructed for H with finite memory but not 
fading memory. Rather, when H has fading 
memory, it is the uniformity in stability which is 
violated. 


It is important to note that the asymptotic 
small gain condition of Theorem 4.2, while 
sufficient for robust invertibility, is not sufficient 
for uniform robust invertibility. It is easy to 
show (using simply time-varying gains) that the 
uniform bound in norm can be violated in 
finite-time. A claim without proof of uniform 
robust invertibility was made in Shamma (1991). 
Hence, the statement of Theorem 3.2 in 
Shamma (1991) should be modified accordingly. 

Proof. The following is the necessity proof of 
Theorem 4.2 for H with pointwise finite- 
memory. This will sufficiently demonstrate the 
main ideas so that one may modify the proofs in 
Shamma (1991) appropriately. The main 
difference between the present results and those 
of Shamma (1991) is that pointwise finite- 
memory is assumed rather than the stronger 
uniform fading-memory. 

First, suppose that 

inf ||//5J|-l4-e>l. 

We will construct a signal / e for which 

there exists an admissible Q such that (/ - 
QH)f e y, hence {I - QH) ’ is not stable. 
Towards this end, there exists an /J, e T and time 
r,, e S such that 

(1) supp(/,) = |0, fi)l- 

(2) ll(///.)l|0,Mll^(> + f/2)ll/;.ll; 

Similarly, since \\HSf\\^l uniformly in time, 
there exists an /, e F and time e 'J such that 

(1) supp(/i) = [FA/(/,. f«;A/)+ 1. 1,]. 

(2) ||(H/,)|,*M,/,,,,,,v,)..,,,|ll^(l + f/2)||/;^ 

This sequence of /„ and has the following 
recursive form. 

(1) supp(/„) 

= [FAf(/,-t- • ■ +/„ 1. Ll- 

(2) \\Hf„\\f.\nh{ ifn i-L ,«)+i/„|ll 

^(1 + f/2)11/„||. 

Note that via Proposition 3.1, 

"(Ey«) = S"/n. 

Since // has the UINE property, we may 
assume that 

lim ||/„|| = ae(0, =c). 

If this were not the case, the UINE property 
assures that ||/„|| may be selected so that 

Thus, we may select a subset of the f„ such that 
||/„|| is convergent. 

JL 

Now let / = 2 Then / e F^\F. In order to 

n - 0 

construct the destabilizing Q, let /„ denote the 
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(th interval 


/„ - [FM(/o+ r„.W) + 1 , 

Then 

+ f/2) ||/|,J|. 

Since ||/„|| —a-»0. for n ^ N* with N* 
sufficiently large. 


II /... II 

il/.ll 


^ 1 -t- t/4. 


The destabilizing C' is now constructed as 
follows. For n< N*, set Q„ = 0. For n^N*, set 
Q„ to be the LTV operator which maps 
as m Lemma 2.1. It follows that 


iieji- 


1 -I- e/4 
l-l-f/2 


< 1 , 


n Si N*. 


Now set ^ = V Q^, With this construction, Q 

n 

exhibits pure delay hence is admissible. Further¬ 
more, the summation Q = '^Q„ admits a "block 

n 

diagonal” representation (cf Lemma 2.1), hence 
WQW < 1. Finally, 


presented for time-invanant H with V — In 
the time-varying case, the UINE property can be 
exploited as in the proof of Theorem 4,2. The 
proof for T = Co follows from similar arguments. 

Since ||/y||>i, th;:re exists an /□ € and 
f > 0 such that 

(1) supp (/;,) = (0, fn). 

(2) lj(w/:.)U,,l|2i(l + 2t•)||^||. 

As in the proof of Theorem 4.2, we will 
construct a signal / e and an admissible 

Q such that (/ - QH)f e This signal / will 
take the foim 

/ - ‘■S'/./o + SjJ.i + + . . . , 

where the T, are chosen such that the effects of 
the previous inputs have sufficiently decayed. 
Towards this end, let 

r = o. 

= /,;//) + !, 

= 7',+ro;//) + !, 

T, = FM, (/o + SjJ,, + Srju. r, + 77) -f 1, 

. . . etc. 


(/ - QH)f = 2 /„ e r 

n 0 

Since {I-QH)fe T and f e f.M'', the stable 
inverlibility of (/ - QH) is violated 

The C'csv inf \\HS,\\ = 1 follows from continuity 

fey 

arguments as in Shamma (1991) ■ 

Note that only linear Qs are used for 
destabilization. For a time-invanant H, the 
above construction can always lead to a linear Q 
which IS also periodic. 

The main idea in the proof for finite-memory 
it to exploit the right-distributivity property of 
Proposition 3.1. In fact, one could use this 
property alone to define an even weaker notion 
of pointwise fading-memory tor which Theorem 
4.2 still holds. 

It turns out that for T = y,^ or C(,, it is possible 
to construct a destabilizing Q for fading-memory 
77 as follows. 

Theorem 4.3. Let V denote either or Co- Let 
H E have pointwise fading-memory and the 
UINE property. Let Q e y)vi called '‘admis- 
sible” if the operator 7 — QH is causally 
invertible and ||Q|| < F Then the operator 
7 - QH has a stable inverse for all admissible 
Q E y/ only if 

inf II7/5JI-1. 

/c y 

Proof. For clarity of presentaiion, the proof is 


Let /„ = |7^, Tn+i)- follows that 

||(/7/)|,J|^(l + r)||/|,„J|. 

Thus, we can construct admissible Q„ via 
Lemma 2.1 which map77/|/'-^/|/ ,. Finally, set 

U “ X Qn- Since the Q^^ constructed via Lemma 

n 

2.1 do not "‘interact”, it follows that ||(^||<1. 
Since the Q^ exhibit pure delay, (/ QH) is 
causally invertible. Furthermore, (/ - QH)f = 
^ 6 y;; while / e Tims {I - QH) ' is not 

stable over yji ® 

This proof breaks down for V denoting other 
than yj) ^0 The reason is that norm of the 
exciting signal 

/ ~ T S H S + , . . , 

does not remain bounded for 'V other than y^ or 
Co- In fact, we have not shown that (7 - (?77) ' is 
an unbounded operator. Rather, it is the 
asymptotic stability which was violated. 

At a glance, it would seem that the condition 
||77|| < 1 is also sufficient for robust invertibility 
over y^. However, standard small gain argu¬ 
ments would only assure that the operator 
(7- QH)~^ is stable over y* and not necessarily 
y^. Some additional work is required to 
guarantee asymptotic stability. 

Theorem 4.4. Let T denote either or c^^. Let 
G e y/v/ have pointwise fading-memory and 
||G|| < 1. Let (7 - G) be causally invertible with 
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(/ - G) ‘ continuous, i.e. 

Then (/ - C)'' is stable over T. 

Proof. The proof is stated for Y = The case 
where Y = Co follows from similar arguments. 

Standard small gain arguments show that 
(/ —G)“‘ is finite-gain stable over if”. We will 
show stability over if^ as follows. First, we will 
show that if / e if^ has finite duration, then 
(/—G)“*/ will eventually decay, and hence 
belongs to if^. The remainder of the proof 
follows from continuity of (/ — G)~'. 

Towards this end, given any c>0, let G be a 
pointwise finite-memory approximant of G such 
that ||G - G|| ^ e and 1|G|| < 1. Let / e if^ have 
supp (/) = [0, TijJ. Thus g = (/ - G)-'/ e iT. Let 
7’i = FM(g, To’, G) + 1. Then since 

g =f + Gg + iG - G)g, 

it follows that 

(/-P,> = (/-Pr,)Gg 

+ (I-PT,)(G-G)g. 
since G has finite-memory, 

iI-PT)Gg = iI-PT,)G(I-P,,)g. 

It follows that 

|l(/-P-„)gll^l|G|| ||(/-Pr„)g|| + f||g|l- 
Now let Tj = FM{g, T,; G) -(-1. Then similarly 
lia-/’r.)gl|S||G|| ||(/-Pr,)gll + E|lg|| 
^l|G|ni(/-P7;,)g|| 

+ l|G||f IlgIKf ||g||. 

Proceeding recursively with T„+i = 
FM{g, r„; G) + 1 leads to 

ll(/-Pr,)g||^(l+--' + l|G||"-)f||g|| 

+ ||G|r ll(/-P7i,)g||. 

Thus 

limsup ||(/-P7;)gll^:r4?^"ll^ll' 

» 1 — IIOII 

Since e is arbitrary and ||G||—»1|G|| as f—»0, it 
follows that g e if^. 

Thus for any / of finite duration (/ - G)~'f e 
if^. Since any / e can be approximated by a 
finite duration signal, it follows from continuity 
that (/ - G)“‘ is finite-gain stable over if^. 

5. CLOSED-LOOP FADING-MEMORY AND 
ROBUST STABILIZATION 

In this section, we consider the block diagram 
of Fig. 1. Some preliminary assumptions and 
definitions are as follows. The plant, P, and 



Fig. 1 Block diagram for rc/iust stabilization. 

compensator, K, are operators on This 
feedback system is said to be well-posed (cf. 
Willems (1971)) if given any (ui, U 2 ) e K ^ 
there exist unique , € 2 ) which satisfy 

Cl = Ui + Ke2, 
e2 = U2^ Pey. 

such that the mapping <1>(P, is 

causal. Assuming well-posedness, the compen¬ 
sator, K, is said to (incrementally) stabilize the 
plant, P, if 4>(F, K) is (incrementally) stable. 
The compensator K is said to pointwise fading- 
memory (incrementally) stabilize P if 4>(P, K) is 
(incrementally) stable and has pointwise fading- 
memory. 

The robust invertibility conditions of Section 4 
can be used to give necessary conditions for 
robust stability as follows (cf. Shamma (1991)). 
Define the following/am/Zy of plants: 

9“.dd = {P:P = ^o + AW}, 

where A e ^hh ||A|1 < 1, and W e • We 
assume that Padu is such that any causal 
compensator results in a well-posed feedback 
system for every P € 

The problem of robust stabilization is under 
what conditions does a comp)ensator, K, which 
stabilizes P,, also stabilize every P e ^^dd- In case 

sup ||CI>(P, /C)||<oo. 

the compensator, K, is said to uniformly robustly 
stabilize the family 

Theorem 5.1. Let Y denote any one of the or 
€p spaces with pe[l,^). Consider the plant 
family Let the compensator, K, pointwise 
fading-memory stabilize Po such that WK(I - 
PqK)~^ satisfies the UINE property. Then K 
uniformly robustly stabilizes the family only 
if 

inf ||W/C(/-Po/C)''5J|<L 

In case V denotes ^ or Cq, then K robustly 
stabilizes the family only if 

inf||WA:(/-P„/^)-*5J|<l. 
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Proof. The proof follows from the results of 
Section 4 and slight modifications of the proof of 
Theorem 4.1 in Shamma (1991). 

A key assumption in Theorem 5.1 is that the 
closed-loop operator WK{l-Fy^K)~^ exhibits 
pointwise fading-memory. In case P is linear 
(possibly time-varying), it is possible to para¬ 
metrize all fading-memory incrementally stabi¬ 
lizing compensators as follows. We will use 
factorization representations of P and K as in 
Verma (1988) to develop conditions for clo.sed- 
loop fading-memory. 


Definition 5.1. Let H : F,. Then H = ND ' 

is said to be a right-coprime fractional repre¬ 
sentation (r.c.f.r.) of H if 
( 1 ) 

(2) D is causally invertible. 

(3) There exists an F e such that 



i.e. the matrix operatoi 



has an stable 


left inverse. 


Theorem 5.2. Let Ki - XY ' be a linear 
compensator which stabilizes the linear P = 
where XY~^ and NM ' arc r.c.f.r.s with 
N, M, Xy and Y all linear. Then all pointwise 
fading-memory incrementally stabilizing non¬ 
linear compensators are parametrized by 

K = {X F MQ){Y FNQ)-\ 

with Q incrementally stable with pointv^isc 
fading-memory. 

Proof. Let ^(P, K) be incrementally stable with 
pointwise fading-memory. From Verma (1988), 
Khargonekar and Poolla (198b), all incremen¬ 
tally stabilizing controllers take the given 
parametrized form. Since P and Ki are linear, 

/M -(X + MQ)\ 

\-N {YfNQ) J 
_ / M -^\ /O 
~ V_/v T / VO NQ I 

/ M -XX-'/O -Me\\ 

\-N y / VO NQ )) 

-Cs -X X 


Thus 


a:) = (_p 



-1 


0 

” \ 0 y 4 

II QUM 

Vo //V-N 

This implies 


<!>(/’. a:)( 

M -A-X 
~N Y ) 

(M 

® V 

” Vo 

Y^-NQl\ 


MQ \ 

“VO 

Y ^Nq) 


) 





Since the left-hand-side has pointwise fading- 
memory, it follows that the operator MQ has 
pointwise fading-memory. Similarly, Y NQ has 
pointwise fading-memory, and hence so does 
NQ. 

Since both MQ and NQ have pointwise 
fading-memory, the operator 



has pointwise fading-memory. Since P was 

(N\ 

.staoilized by the linear Ki, \ has a stable 

\M' 

linear left inverse, which implies Q has pointwise 
fading-memory. 

To show the converse, let Q have pointwise 
fading-memory. That d>(P, K) has pointwise 
fading-memory follows from 


/ / -K\ ' 

/M MQ \/ M - A'\ ‘ 

"Vo y) ■ 

That is, d>(P, K) is the composition of 
continuous fading-memory operators. 


b CONCLUDING REMARKS 
In this paper, we have investigated the 
fading-memory property primarily in the context 
of robust stability. Some possible directions are 
the following. One direction is the extension of 
these results to the case of structured dynamic 
uncertainty. Another direction is determining 
what classes of nonlinear systems have fading- 
memory. Finally, there is the issue of norm- 
compulation of nonlinear systems (Nikolaou and 
Manousiouthakis (1989)). 
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Bibliography on Robust Conti ol* 

P. DORATO.t R. TEMPO* and G. MIISCAT05 

I his bibliography includes a compilation of selected books and journal 
articles written on the subject of Robust Control during the period 

1987-1991. 


Key Words—Bibliogritphy, conlrol systems, robust control, uncertdinty 


Abstract —This bibliography includes a compilation of 
selected books and journal articles written on the suiijecl of 
Robust Control during the period 1987-1991 Papers arc 
further divided into theory and applications The theoretical 
subject matter of each article is identified by appropriate 
acronyms and application papers are separated into specific 
areas 

1 INTRODUCTION 

iNTHEVOiuMt “Robust Control” (Doralo, 1987) 
there is a fairly comprehensive bibliography of 
papers and books written on the topic of robiist 
control up to the year 1986. The purpose of this 
compilation is to update that bibliography to 
cover the period 1987-1991. The only exception 
we have made to citations prior to 1987 is the 
seminal paper of Kharitonov (Kharitonov, 1978) 
on stability of interval polynomials. Tor earlier 
citations on this subject, The interested reader 
may consult the reference list in the book of 
Bhattacharyya (1987). 

In order to keep the bibliography to a 
manageable size, a number of topics related to 
the problem of analysis and design of control 
systems in the presence of uncertainty have been 
omitted. In particular, we did not include papers 
and books on the following subjects: (i) 
Adaptive control; (ii) robust-adaptive control; 
(lii) fuzzy control; (iv) variable structure control; 
(v) stochastic control; (vi) robust estimation. For 
the same reason, we have also omitted 
conference papers, very brief papers and 
surveyed only the English-language literature on 
the subject. 

* Received 7 October 1991, icvised 6 April 1992, received 
in final form 22 June 1992 This paper was not presented at 
any IFAC meeting This paper was recommended for 
publication in revised form by Guest Editor R F Curtain 

t Department of Electrical and Computer Engineering, 
University of New Mexico, Albuquerque, New Mexico, 
USA 

*CENS-CNR, Pohtccnico di Torino, Corso Duca degli 
Abruzzi 24, 10129 Torino, Italy 

Dipartimento Elettneo, Elettronico e Sisiemistico, 
University di Catania, Viale A Dona 6, 95125 Catania, 
Italy 


The bibliography is divided into two main 
parts; books and journal articles (see the list of 
journals in Section 3). Journal articles are 
further divided into theory and applications. 
Each paper is listed just once in the Reference 
section and is coded by acronyms (sec the list in 
Section 3.1) to help the reader identify the 
theoretical subject matter of the article. 
Application papers are broken down into various 
areas so that readers may have easy access to 
applications of specific interest. 

2 BOOKS 

Under the book section of our reference list, 
wt have compiled texts, lecture notes, reprint 
volumes, software userguides and conference 
proceedings on the subject of robust control 
published in book form during the period 
1987-1991. We also include a few books printed 
at the beginning of 1992. 

1 JOURNAL ARTICLES 

Journal articles were compiled from the 
following journals. 

ASME J. of Dynamic Systems, Measurement and 
Control, Automatica, I EE Proceedings—Part D, 
IEEE Control Systems Magazine, IEEE Trans¬ 
actions on Automatic Control, IEEE Transac¬ 
tions on Circuits and Systems, International J. of 
Control, International J. of Robust and Non¬ 
linear Control, J. of Guidance, Control and 
Dynamics, J. of the Franklin Institute, Mathema¬ 
tics of Control, Signals and Systems, SIAM J. on 
Control and Optimization, Systems and Control 
Letters. 

3.1. Theory 

The theoretical subject matter of each paper 
is identified by appropriate acronyms (see the 
list below) written at the end of each citation. 
Many of the acronyms correspond to standard 
usage in the literature, for example [LOG] 
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for Linear-Quadratic-Gaussian. Others were 
created expressly for this bibliography, for 
example [KRF] for Kharitonov methods for 
rational functions. We have elected not to 
explicitly divide papers by the classification of 
structured/unstructured perturbations, although 
this could be a useful separation of robust 
control papers. Instead, we have separated our 
categories in terms of nominal models, theoreti¬ 
cal approaches and performance measures. 
Thus, for example, state space and matrix 
methods [SSM] may include papers with 
structured or unstructured perturbations, but 
with state space models subsumed. Also, since 
the literature on //* is vast, we have attempted 
to be more selective in this area and to include 
only papers directly related to robust control 
problems or theoretical papers of special 
significance. 

List of acronyms 

DPS—Distributed parameters/time-del ay sys¬ 
tems; DTS—Discrete-time systems/sampled- 
data; FDM—Frequency-domain methods; 
GCC—Guaranteed cost control; 
methods for robust control; HINF—//“ methods 
for robust control; KHAR—Kharitonov theory 
and polynomial methods; KRF—Kharitonov 
methods for rational functions; LONE— 
methods for robust control; LOG—Robust 
linear quadratic Gaussian; LSS—Large scale 
systems; LTR—Loop transfer recovery; 
MIMO—Multi-input-multi-output; MOD— 
Uncertainty modelling; MSN—Mixed sensitivity 
methods; MSU—Mixed structured/unstructured 
perturbations; NLC—Nonlinear vs linear con¬ 
troller, NTV—Nonlinear/time-varying systems; 
OBD—Observer based design; OFT— 
Ouantitative feedback theory; OSM—Ouadratic 
stabilization methods; RCO—Robust control¬ 
lability and observability; RMM—Robust model 
matching; RMO—Robust multiobjective design; 
ROD—Reduced order design; RPA—Robust 
pole assignment; RPM—robust performance 
methods; RSP—Robust stabilization problem; 
RST—Robust servo/tracking problems; SPR— 
Strictly positive real methods; SSM—State space 
and matrix methods; SSV—Structured singular 
values; SURVEY—Survey papers; TUTORIAL 
—Tutorial papers. 

3.2. Applications 

The application papers are separated into 
various areas so that the reader can easily 
identify the subject of interest. The appropriate 
acronym identifies the theory used in the work. 
It should be noted that most applications found 
in journal articles might be called more properly 


“feasibility studies'', since they seldom deal with 
hardware implementation on a physical system. 
The application areas surveyed here include the 
following: DC-motors, electromechanical, en¬ 
gine control, flexible structures, flight control, 
mechanical systems, process control, robotics, 
thermal systems. 

It should be noted that many application 
papers are reported at conferences, rather than 
published in journals and hence do not appear in 
this bibliography. The following conferences 
typically include papers on applications of robust 
control theory: 

AIAA Conference on Guidance, Navigation and 
Control, AlChE Annual Meeting, American 
Control Conference, IEEE Conference on 
Decision and Control, IEEE Int. Conference on 
Robotics and Automation, IFAC Symposia and 
World Congress, Int. Symposium on Industrial 
Robots, Int. Symposium on Robotics Research. 

4 CONCLUSIONS 

The 15 year period 1972-1987 could properly 
be called the developmental period for what we 
now call robust control theory. Included in the 
theoretical developments of this period we have: 

methods [HINF], guaranteed-cost control 
[GCC], Kharitonov methods [KHAR], loop- 
transfer-recovery theory [LTR], mixed- 
sensitivity methods [MSN], quadratic stabi¬ 
lization methods [OSM], robust pole assignment 
[RPA], robust servo/tracking methods [RST], 
state-space and matrix methods [SSM] and 
structured-singular-value methods [SSV]. How¬ 
ever, the theory developed during this period 
was largely divided between structured and 
unstructured methods, with good analysis results 
for the former and good synthesis results for the 
latter; and between robust stability and nominal 
performance. This bibliography indicates a 
continued interest in these topics; however it 
also indicates a number of new directions 
including: robust multiobjective design methods 
[RMO], the most notable example being //^///* 
methods [HHM]; mixed structured/unstructured 
perturbation methods [MSU]; /j methods 
[LONE]; robust control of nonlinear and 
time-varying systems [NTV]; robust control of 
distributed parameter systems [DPS]; robust 
performance methods [RPM]; and passivity 
methods [SPR]. 

A major post-1986 theoretical result was the 
two-Riccati solution to the //* control problem 
(Doyle et al., 1989). This work reduced the f/* 
problem to the solution of two uncoupled Riccati 
equations and provided an upper bound on the 
complexity of the //“ controller, equal to the 
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McMillan degree of the “augmented’’ plant. 
Other notable theoretical developments include 
new results on the mixed H"IH" problem and 
associated coupled Riccati equations; the design 
of reduced order compensators and the use of 
Kharitonov methods for mixed structured/ 
unstructured problems. 

In recent years, the major applications of 
robust control theory appear to be in the 
following areas: flexible structures, flight control, 
process control and robotics. In addition, a 
number of robust control software packages 
have been developed, including “/i 7’oolbox”, 
“Robust-Control Toolbox \ “Matrix,” and “Pro¬ 
gram CC’'. 
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Abstract —We consider real polynomials whose coefficients 
depend polynomially on the elements of an uncertain 
parameter vector. The size of perturbation is characterized 
by the weighted norm of the parameter vector. The smallest 
destabilizing perturbation defines the stability radius of the 
set of uncertain polynomials. 

It is shown that determining this radius is equivalent to 
solving a finite set of systems of algebraic equation.*, and 
picking out the real solution with the smallest norm. The 
number of systems of equations depends crucially on the 
dimension of the parameter vector, whereas the complexity 
of systems of equations increases mainly with the kind of 
polynomial dependency and the degree of the polynomial. 
This method also yields the smallest destabilizing parameter 
combination and the corres|X)nding critical frequency For 
two or three parameters this transformed problem can be 
solved using symbolic and numeric computations 

1. Introduction 

Consider a control system with uncertain real parameters. 
Assume that the controller was designed for some nominal 
value of the parameters. How far can the parameters be 
changed until the system becomes unstable? A crucial role 
in solving this problem is the type of dependency of the 
coefficients of the characteristic polynomial. For linear 
(affine) dependency there exist several methods to determine 
the stability radius. 'Hiere are both graphical and analytical 
procedures by Barmish (1989), Chopellat and Bhattachaiyya 
(1988), Kacsbaucr and Ackermann (1990) and Polyak and 
Tsypkin (1990). A much more difficult problem arises when 
we assume multilinear or polynomial dependency. Vicino ef 
fl/. (1990) transform the problem into an optimization 
problem and present a numencal algorithm to find the 
solution. Murdock and Schmittendorf (1991) solve the same 
problem using a genetic algorithm. We will show that the 
problem can also be solved in an analytical-numerical way 
that is we have to solve sy.stems of algebraic equations. We 
first demonstrate how to derive these systems of equations 
and then how to find the solutions. 

2. Problem formulation and basic results' 

Given a family of polynomials 

P{s, q) = fl„(q)j'' + ■ • * + fli(q).y + M*!)' (^) 

with q e and n/(q) e ^[qi, 92 - • » 9/1 (* = 0, 1, - , n). 
The nominal polynomial P{s, 0) is assumed stable. Find the 
maximal p s.t. P{s,q) is stable for all ||q||;,<P P ‘S called 
the stability radius. 


‘Received 3 September 1991; received in final form 17 
March 1992. This paper was not presented at any IFAC 
meeting. This paper was recommended for publication in 
revised form by Guest Editor P. Dorato. 
tDLR, Oberpfaffenhofen, Germany. 


Concerning the choice of the norm, i.e. p, there arc three 
important possibilities. For p = qo the set of admissible q 
describes an /-dimensional cube. Dual to this norm is p = 1 , 
which corresponds to a diamond, p = 2 yields an 
/-dimensional sphere in ^-spacc. From the practical point of 
view the case p -= «' is the most important one, because there 
the uncertain parameters vary independently. We will handle 
the case p = » in detail. For the other cases it is not difficult 
to derive the corresponding results. 

It is known that the stable set of q is bounded by three 
hypcrsurfaces, namely flo(q) 0, fl„(q) = 0 and H„ ,(q; ^ 0. 
The last equation is the last but one Hurwitz determinant this 
1 $ the critical one. It results from the elimination of o from 
the two equations RcP(;ai) = 0 and Im P(y(o) = 0. Here 
linear or nonlinear dependency make a big difference. For 
fixed w Re F = 0, Im P = 0 represent in case of linear 
dependency a linear manifold; this means for example that 
for / = 3 W 2 (q) = 0 is generated by the continuoUwS movement 
of a ..traight line. In the case of nonlinear dependency wc 
have a set of curves in which generate = 0 . Evaluating 
H„ ,(q) must in general already be done by .symbolic 
computations. 

All three equations of the hypcrsurfaces will be treated in 
the same way The only difference is that the third one will 
be the complicated one with respect to the number of terms 
and degree of the terms. In the sequel we use F{q) = 0 for 
each of these equations. 


3. Two or three parameters 

We consider first the case of two parameters, where we 
can demonstrate the basic idea. The polynomial family with 
llqll^ ^ e (e sufficient .small) is stable and can be described by 
a square of side length 2c We enlarge this square 
continuously until there is a pioint of intersection with the 
curve F(^j, 1 / 2 ) = ^ point may he on a vertex or on an 
edge of the square. The first situation is characterized by the 
fact that qi — qj 9 i — which results in the two 

polynomials 

/•'(-/>. <7,) = 0, f(9,,-9,) = 0. (2) 


The second case is an intersection point on an edge. This 
means that Fiqy,q 2 )^^ 3 horizontal or a vertical 
tangent, lliis necessary condition leads to the two systems of 
equations in two unknowns 


F ( q ,. <72) = 0, 


f(< 7 i. <72) = 0, 


* 72 ) _o 
dq, 

Ql) ^ Q 

dq^ 


( 3 ) 


It may be possible that in the intersection point the curve 
cannot be differentiated, i.e. the curve has for example a 
cusp and both partial derivatives vanish, but this is already 
covered by (3). Solving the two polynomials and the two 
systems ot equations give us a set of points (ijf,, qj). The 
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point with the smallest norm determines the stability radius. 
For the first case we have ||(±^, ±q)\\^~\q\ and for the 
second case ^ 2 ) 11 )* ” (kiM 92 l)- The critical 

parameter combinations {q^, qj) also Axes the critical 
frequency. The polynomial P(s,qi,q 2 )~^ has a root at 
j = 0 or at j = 00 or a root pair at j = ±jaj. All other roots are 
not in the right half plane. 

Consider now the case of three parameters. Instead of a 
curve we now have a surface qj* Qn) = ^ which bounds 
the stable polynomials and a cube is enlarged now. The 
intersection points lie either on a vertex, on an edge or on an 
surface of the cube. Corresponding to the eight vertices of 
the cube we have the four polynomials 

F(q, +q, +q) = Q, 

F(q. +9. -9) = f*. 

F{q, -q, +(?) = 0 . 

F(q. -q, -q) = (). 

In the case of 12 edges, two intersection points must 
coincide, i.e. the partial derivatives must vanish and we have 
to solve the six systems of equations 


+</. +^?) = o. 




+</) = 0, 


q2> 


F{-^q, +q, ^3) = 0, 


F(+(/, -q, = 


dF^ 

i/2~ 

dF^ 

dF 

OF 

11 . 

^q^ </i«4<y,,y2*“ ^ Q 

IL 

^q^ q\^-Fq,q2^-^ 


-0. 
= 0 . 


= 0, 
= 0 , 


in two unknowns. 

If the intersection is on one of the six surfaces then the 
normal vector of the surface F{q^, qj. q^) = 0 is parallel to 
one of the coordinate axes which means that two of the three 
partial derivatives vanish simultaneously and we have the 
three systems 


constructs a so-called Grbbner base. Starting from the 
equations F|(q) = 0, F 2 (q) = 0, . . , F,(q) = 0 another system 
of equations is derived, which has the same set of .solutions 
as the original one. For a zero-dimensional solution set (that 
means that the system has a finite number of solutions) this 
may look like 

F,(q) = 0 C,(?,) = 0, 

f=(q) = 0 Gj((?„</j) = 0, 

: : 

F,(q) = 0 G,{q,, q 2 , ■ ■ ■ , qi) = 0. 

Tliis new system has a similar form like the triangular form in 
the case of a system of linear equations. This algorithm is 
implemented in algebraic and symbolic computation 
packages like REDUCE or MATHEMATICA. Unfortun¬ 
ately there is until now a severe restriction. Because of time 
(and storage) limitations it is only possible to attack 
problems with, at most, five or six parameters. 

The second step is the numerical solution of the 
transformed ''tridiagonal" .system. Finding the roots of the 
first equation (a polynomial) gives the first component of the 
solution vectors. Substituting this value into the second 
equation yields again a polynomial in one variable. Thus the 
complete solution vector is obtained. It is not necessary to 
always find the complete solution vectors. We can omit for 
example the non-real solutions. Additionally if one or more 
components would lead to a norm greater than the norm of 
already known solutions then this candidate cannot lead to 
the stability radius and its computation can be stopped. 

5. Example 

Given the family oi polynomials 

P{s, q^, q 2 , q^)- + 03 ^^-+ a^s" -I- fl,A’ + a^^, 


with 

fl j = 20 -K 2qy + 3i/,, 

^2 “ ^24 -t- 32 q y -I- 42^^ -I- 

fl, = 1040 -h 800q, + 120^72 + \2{)q, + 60^293. 

fln= 1600+ 1600i/,. 

The bounding surfaces are 

a^^{q^, ^2- qd" J600 + IfiOOt^i ~ 


aF dF 

/^(9..9z,93) = 0, -=0. --=0, 

f(9i.92.9,) = 0. ^=0, ^ = 0, 

dq, dqj 
dF 3F 

F(9..93.90 = 0, j—O. 


in three unknowns. 

For />3 parameters the number of polynomials and 
systems of equations obviously depends on the number of 
subpolytopes of an /-dimensional cube. The derivation of the 
polynomials and systems of equations for arbitrary / is 
straightforward. But now occurs what is called the 
combinatorial explosion. The number of systems grow 
exponentially with the number of parameters. 

Using other norms (p — 2 or p - °°) gives similar 
equations. In any case we have always equations which are 
linear combinations of F and their partial derivatives. For 
p = 2 the number of systems is smaller (no subpolytopes), 
but the equations are more complicated In that case at the 
intersection point the vector q mu.st be parallel to the 
gradient of F, i.e. we have only one system of equations: 

4. Solving a system of algebraic equations 

Finding all solutions of a system of algebraic equations is 
split into two parts. First the algorithm of Buchberger (1985) 


and 


qif q^) ~ 


^21qlq]+- ■ +21,440-0. 

The Hurwitz determinant, not reported explicitly for space 
limitations, has 24 terms. The stability radius with respect to 
the real root boundary is = 1. The complex root boundary 
yields p = 0.342 with the critical frequency aj* = 8.23. This 
value is a root of the polynomial 

H^iq, -q, -q) = 0. 

The intersection point lies in a vertex of the cube. For 
comparison also the results for p = 2 and p = 1 are given. For 
these norms the radius with respiect to the real root boundary 
is also pp— 1 . For the Euclidian norm the radius with respect 
to the complex root boundary is p = 0.584 with the 
parameters = 0.383, q 2 = —0.255 and ^ 3 =-0.358 at the 
frequency uj* - 8.33. Using the || ||i-norm the critical system 
is 

H,(q„i), </ 3 ) = 0 , 

^ .0, 

and the solution is 0.545, q 2 — 0, ^^ 3 =—0.371 with 
p = 0.916 and a>^ = 8.71. The intersection point lies on an 
edge of the hexaeder. 
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6 . Conclusion 

We have shown that in case of polynomial dependency the 
stability radius can be found by solving systems of algebraic 
equations. These equations arc derived from the bounding 
surfaces. The difficult problem of finding all solutions of 
systems is carried out with symbolic and numeric methods. 
The restriction to few parameters seems to be severe, but a 
quotation of Lazard (1991) on software for the solution of 
algebraic equations gives hope for the future. ‘Five years 
ago, problems with four or five unknowns were outside of 
the capabilities of most available softwares. Recent 
progresses make or will make accessible problems with six or 
seven unknowns.” 
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Abftract—In this paper we present a new algorithm for 
solving robust performance problems by A^-synthesis. 
Convergence properties of the new algorithm arc considered 
and demonstrated by examples. The algorithm called fi-K 
iteration, works by flattening the structured singulai value ft 
over frequency. 

1. Introduction 

Robust performance is said to be achieved if the design 
specifications of a controlled system arc satisfied in the 
presence of disturbance signals and model uncertainties. A 
general framework for analysing robust performance using 
the structured singular value ^ as a measure of performance 
was introduced by Doyle (1982) who later proposed a 
controller synthesis procedure, called ^-synthc-LS In this 
procedure a controller is sought which minimizes , or which 
achieves level of performance arbitranly close to the 
optimum fi. This minimization problem has not yet been 
solved, but in Doyle (1985) an approximate solution is given 
involving a sequence of minimizations, called D-K iteration 
The purpose of this paper is to introduce an alternative to 
D-K iteration, called fi-K iteration. 

The robust performance problem under consideration can 
be described with the help of the now standard feedback 
configuration shown in Fig. 1. 

P is the standard plant which defines the interconnection 
between the actual plant and the controller, the lucalion.s of 
pertubations representing uncertainty, and the performance 
function to be minimized; it also includes weighting functions 
used to describe the uncertainty models and the performance 
specifications 

A is an n x n block diagonal matrix o( perturbations 
representing uncertainty except for one block which is used 
to characterize performance Mathematically A is an element 
of the set 

A= {diag[6,/^j, . . . , <5,7^ , A,, . . , Af ]'t C, 

( 1 ) 

where s is the number of repeated scalar blocks, / is the 
number of full blocks, and 

2 r, + V m, = n. (2) 

If, as is usual, o(A) ^ 1, then A is an clement of the bounded 
set 

BA= {Ae A:c7(A)^ 1}. (3) 

K is the controller to be designed, and M is the 
interconnection matrix between P and K given by the linear 

* Received 4 September 1991; received in final form 1 June 
1992. This paper was not presented at any IFAC meeting. 
This paper was recommended for publication in revised form 
by Guest Editor B. A. Franci.s. 

t Control Systems Research, Department of Engineering, 
Leicester University, Leicester LEI 7RH, U.K. 


fractional transformation 

M = F;(F, AT) := P,, + P„Ar(/ - P^,K) 'P^,, (4) 

with P partitioned according to its inputs and outputs as 
shown in Fig. 1. 

The structured singular value of M with respect to A is 
defined as 

mm (f»(A):A6 A. det (/-M A) = 0) ' 

= 0 if del (/ - MA) # 0, VA 6 A (5) 

The robust perlormancc design problem (/u-synthcMi) can 
now be stated as 

inf sun nJM), (6) 

A. siablliiing ek ^ ' 

where ^ denotes the set of real number. This problem has 
proved difficult to solve and a solution i.s still not available. 
However, an approximate solution has been given by Doyle 
(^985) based on the following bound 

M^(Af)= inf aiDMD (7) 

DitD 

where the set D is defined by 

D = {diag [D,, . . . , d,/„„.. . ] :D,e 

D, = D;>0. d,>0}. (8) 

The idea is lo look for a solution to 

inf sup inf d\DMD ']. (9) 

K Mubill/ing OlI> 

even though the upper bound is not always equal lo An 
approximate method for doing this is to solve (9) first for K 
keeping D constant, and then for D keeping K constant, and 
so on. For a fixed D (9) is an ^ optimization problem, and 
can be solved by various methods. For fixed K (9) can be 
solved at each frequency by solving a convex optimization 
problem m D By taking a sufficient number of frequencies 
over a sufficiently wide frequency range the scaling matrix D 
can be approximated by a stable matrix of real rational 
functions with a stable inverse. This sequence of 
minimizations, known as D-K iteration, is not guaranteed to 
converge lo the minimum, but it nevertheless offers a 
systematic procedure for addressing the im(>ortant problem 
of robusi performance. 

The new procedure. fi-K iteration, which will be 
presented here is motivated by the following. 

• In Helton (1985), it is stated that many optimization 
problems have the property that an optimum solution must 
make the objective function constant in w almost 
everywhere. 

• In many examples, using D-K iteration it can be 
ob.served that the “^-optimar’ controller appears to flatten 

at least over the bandwidth of the system; constraints 
on M, for example at high frequencies, can cause a changi in 
the general level of 
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Fig, 1. Standard feedback configuration. 


The idea then in the new algorithm is to determine a 
sequence of controllers which yield a flat structured singular 
value. This after all is what happens in optimization 
where the ^-optimal controller results in a cost function 
with a flat maximum singular value. 

To get some insight into how this might be done we will 
consider a specific robust performance problem for a 
single-input-single-oulput plant. This is covered in Section 2. 
Then in Section 3 the fi-K iteration algorithm is presented 
for the general robust performance problem. Convergence of 
the algorithm is considered in Section 4, and two illustrative 
examples are described in Section S. Conclusions arc given in 
Section 6. 


2. A SfSO robust performance problem 

Consider the control system configuration of Fig. 2, with 
the following nomenclature 

(7,) is the nominal plant, with multiplicative input 
uncertainty; A 2 is the normalized model error, HA^IU 1; W. 
is the (model) error bounding function; W, is the 
performance weighting function; A, is the normalized 
fictitious uncertainly to characterize performance, 1|A,|C 1. 

The configuration can be rearranged into the standard 
M-A structure of Fig. I, by setting 

A-diagfA., A,) (10) 


In which case, the interconnection matrix M is given by 


M 




1 - 


( 11 ) 


where 


T„:^KC„(l + KC„)'. ( 12 ) 

is the nominal complementary sensitivity function, and 

\ (13) 


is the nominal sensitivity function. 

The robust performance problem is to find a stabilizing 
ccmirollcr K such that the ^ norm of the transfer function 
from d to e is less than 1 for all perturbations A 2 , HAjlC ^ 1 
This is equivalent to finding a stabilizing controller K such 
that Pa(M) 1. and therefore it makes sense to try to solve 
( 6 ). 

For this relatively simple interconnection matrix M the 


following facts can be shown. 


Fact 1. 

Voj e 

(14) 

Fact 2. 

d{M) - IIMil,., the Frobenius norm of M. 

( 15 ) 

Fact 3. 




where | ■ | denotes modulus. 

(16) 



Fig. 2. A SISO robust performance problem. 


Fact 4. If at some frequency 

HV,5„G„| = |W2 7;,G,7‘| (17) 

then 

(18) 

at the same frequency w,,. 

Suppose now that an ^if^-optimal controller is found for 
M, i.e. we solve 

inf ||M(/C)|U. (19) 

It is well known that (J\M(K^^)] is flat over frequency, and 
from the above facts (and our observations) fiA[Af(/Cf,)l will 
often have a bandpass-like characteristic as illustrated in Fig. 
3. A little thought suggests that a controller which forces 
p^iM) to be flat will result in a convex a(M). Suppose then 
that we multiply M by a bandpass-like rational function r(j) 
similar to the shape of jU^[Af(Akn)] and calculate the 
optimal controller K^ for the product rM. One might then 
expect to be convex with p^[M{K^)] flatter than 

This leads us into the p~K iteration algorithm 
presented for a general multivariable problem in the next 
section. 


3 . p~K iteration 

The above discussion motivates the algorithm now 
proposed for finding an approximate solution to the general 
robust performance problem 

inf sup p^[F((P, ^C)|. (20) 

k slahili/iiiK •'> ^ ^ 

The basic strategy is to ‘‘flatten” the ^-curve. 

Step 1. 

Find the stabilizing ^^-optimal controller (a variety oi 
methods exist) 

AC„:= arginf ||F,(P, A:)|U. (21) 


The optimization is over stabilizing K, but for notational 
convenience in (21) and much of what follows the word 
“stabilizing” ha.s been omitted. 

Step 2. 

Find the / 4 -curvc corresponding to K^^ (the Matlab toolbox, 
/i-'Tbols, could be used for this) 

( 22 ) 


over a suitable range of frequencies. 

Step 3 

Normalize ^i)(/fn) by its maximum value, i.e. determine 






(23) 


Step 4. 

Find a scalar stable minimum phase real rational function 
^„(a') by fitting to the /i,,(>(u)-curve obtained in Step 3. 

Step 5. 

Multiply the interconnection matrix F/(P, AT) by ^o(i). In 
the specific example of ^Section 2 this would correspond to 
multiplying each of the weights W, and W 2 by Pn(s). 



Fig. 3. flic maximum and structured singular values of 











Step 6 

Find the iHT'-optinial controller 
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K,(s) = arg inf /c)||^ ^24) 

Step 1 

Find the (i-curve corresponding to it, 

f'lO") =fiA[/v(F /f,)! (2S, 

over the frequency range of interest 

Step 8 

Normalize Pi(ja)) and denote the result by 
Step 9 

Curve fit piijco) to gel p^{s) 

Step 10 

Find the ^-optimal controller 

= arg inf AC)IU (26) 

Step 11 

Find the /i-curve corresponding to Kj 

M 2 OU 1 ) = /fj)] (27) 

Subsequent steps of the algorithm should now bi clear 
and in practice would be continued until the p curvi was 
sufficiently flat over the frequency range of interest or until 
the desired level of f)crformance (as measured b> the peak 
value of p) had been reached 


4 Convergence 

In this section we consider the convergence properties of 
the proposed p-K algorithm The algorithm generates the 
following sequences 

ATn = arg inf ||F;(P K)\U Mn - /^o)l 


/C, = arginf||^(,F;(P f()\U K^)] 

K^= Mg inf Wfi,pufiiP ^)IL M2 = Ma(/'i(P A^2)I 


Sup|X)sc that we normalise each ol the p functions by 
dividing each curve by its maximum value 


That IS 

-0 (14) 

Fhe sequence is therefore monotonically decreas¬ 

ing and bounded, and by the Bol/ano-Weierstrass theorem 
(Bartlc 1%6) it has a limit point That is 

limit point, as n—(^5) 

We now present a reasoned argument for believing that the 
sequence {p^) will converge to a frequency independent 
function equal to 1 

First, a Lemma which follows from Helton (1985) 

Lemma I et y( ) be a well posed’ cost function, ic it 
satisfies Helton s assumptions 1 hen if for a given controller 
K, a|y(AJ] is frequenev dependent, then there exists 
another controller A, such that tT[y(A^)] is frequency 
independent and 1|7(A,)|U < ||y(A )|U ■ 

Next let 

J„{K)^p„ , p,LiF f^) (^b) 

and assume that Heltons assumptions are satisfied Then 
with this notation we have from (13) that 

ThwCLlore if i fiequency dependent we have by the 
Lemma that 


•ntl|/i,y.(A)|U \\pMK„)\U (18) 

or equivalently 

L. ('^9) 

But wc have already shown that the sequence {f„) converges 
and therefore the sequence (jU^) must also converge to a 
frequency independent function (which must be one by 
normalization) otherwise (f„) may well decrease below the 
positive limit 

The above argument is clearly laiking in rigour but it docs 
offer support to the observed effectiveness of the algorithm 
One can also imagine the appioach hading to a local 
minimum where p is flat but where a controller exists which 
achicvf's a lower p curve which may or may not Ih flat Like 
D-K iteration therefore p A iteration is not guaranteed to 
converge to the minimum If it wcic suspected that p K 
Iteration had nol given the minimum then it might be 
possible to gel a belter result after first rescaling M using a 
D matrix 


I hen. It IS easy to sec that 

0 < 1 , Vm e and |||u„||, = 1 n -0 I 2 

( 10 ) 

Consider the infinite sequence defined by 

c, =\\f,(P /Co)IU = inf||F,(P A)|U 


5 txamptes 

I wo examples arc given to illustrate the application of the 
p-K iteration algorithm Lxample 1 is SISO Lxample 2 is 
MIMC) 

Example 1 In this example we solve the following robusl 
performance problem 


C, = UMP. /(:,)iu = inf ||>1„F(P, /(:)iu 

C2 =\\iiilt«F,{P. K^)\L-inf \\ii,p„F,{P 

(II) 

Now because a\li„ i ftiiF/iP ^(„)] i' constanl in oj it 
follows from (30) that 


\\ii„ , KJIU 

= IIM„ . ftoF,(P, F„)\L (32) 


and hence 


c„ = IIA, I ii„Fi(P. K„)\\- 

= \\Pi,Pn 1 Ao^V(^’ 

sinfllM, I fiofi(P'F)\\- 
K 

= C„ + i 


where 


and 


inf \u^p^{M) 


A SI ihili/mv 


M 


W,7„G„' -W,W,7„0,/ 

S„ W,i„ 

MW)~HV,S„| + |W'7;,6„'| 




(j+2)(i+0 5) 


1 - 1 - 


W,(j)-50- 


1 245 


^ 0 007 


1 + 


W:,U) = 0 1256- 


0 502 


1 + ; 


(33) 
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Fig. 4. System with additive uncertainty 


The design problem corresponds to meeting disturbance 
rejection requirements m the presence of plant uncertainty 
modelled by an additive perturbation; see Fig. 4 
Bode magnitude diagrams of the weighting functions and 
the open-loop gam are shown in Fig. 5. The p-curve is 
approximately flat after three fi-K iterations as shown in Fig 
6. The controller sequence (after model reduction using 
balanced truncation) was 


^o(^) = 1.923 


_ (5 + 2)(s 0 5) 

(j + 6 007)(j -H 2 723) ’ 


i:,(j) = 2.032 


(j -H 2 053)(j -t- 0.498) 
(j^-0.007)(j +3.175)’ 


^:2(J)-2.031 


(J-^2.114)(J^-Q5) 
(s -I- 0.007)(j -I- 3 293) 


The Bode magnitude diagram of /l 2 (j) is shown in Fig. 7 
The order of the fits used to model /i„(j) and /ii(j) were ' 
and 4, respectively 

The problem was repeated using D-K iteration, and 
similar level of accuracy was achieved after just twe 
Iterations, with a second order D scahng The “optimal' 
controller was 




(j-h 2 119)(j-l-0 5) 

(s + 0.007)(j + 3 307) ■ 


Example 2. This MIMO example is taken from the 
MATLAB toolbox manual, p-TOOLS (Balas et al , 1991 
where it is used to demonstrate p-synthesis. The problem is 
to meet disturbance rejection requirements in the presence 
of plant uncertainty modelled as a multiplicative perturbatio 
at the plant input The plant model is known as HIMAT anc 
represents a scaled version of a remotely piloted aircraft The 
nominal state-space model is 


A = 


-0 0226 -36 6 -18 9 -32 1' 

0 -1 9 0 983 0 

0 0123 -11 7 -2 63 0 ' 

0 0 1 0 _ 



[rail/sec] 

Fig 5 Bode magnitude diagrams of the weighting functions and the open-loop gam (Example 1) 



[rwdjtcc] 

Fig 6 Bode magnitude diagrams of the p curves of the three p-AT iterations (Example 1) 
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[rad/soc] 

Fig 10 Bode diagrams of the singular values of the ^''‘optimal" controller (Example 2) 


“ 0 0 “ 

-0 414 0 

-77 8 22 4 

0 0 _ 


ro 57 1 0 0 1 
Lo 0 0 57 ij’ 

^ = [o oj’ 


and the weighting functions for the multivanable version of 
the interconnection matrix M as shown in (11) are 


* 5 + 0 03 


W'2(0 = 


50(j + 100) 

5 + 10000 ^ 


where I 2 is the 2x2 identity matrix 
Bode magnitude diagrams of the weighting functions and 
the singular values of the open-loop gam arc shown in Fig 8 
The /x-curves for 11 ^-K iterations are shown in Fig 9 llie 
orders of the models used to fit the /i-curves range from 2 to 
9, and the final controller was adequately represented by a 
lOth order system The Bode diagrams of the singular values 
of this controller arc shown in Fig 10 
A 13th order controller of similar performance was 
obtained for the same problem using D-K iteration (and 
balanced truncation) after just four iterations with all 
Z>-scalings of order 3 


6 Conclusions 

A new algorithm, fi-K iteration, has been presented for 
^-synthesis The accuracy of the algorithm depends on the 
curve fitting of the fiijto) curves In the examples tested so 
far the algonthm compares well with D-K iteration and only 
requires a single scalar function to be fitted over frequency at 
each Iteration Each iteration does, however, require the 
calculation of /i over a range of frequencies, and this 
computation is known to be difficult in general 
Convergence of D-K iteration was more rapid than ^-K in 
the examples shown 

Acknowledgements —The authors arc grateful to the U K 
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Abstract This paper proposes a new approach to determine 
^ optimal control for discrete-time linear systems, based on 
convex programming It is shown that all stabilizing state 
feedback control gams belong to a certain convex set 
well-defined in a special parameter space Ihc Linear 
Ouadrahe Problem can be then lormulated the 
minimization of a linear objective over a convex set The 
optimal solution of this convex problem furnishes under 
certain conditions, the same feedback control gam which is 
obtained from the classical discrete time Riccati equation 
solution Furthermore, the method proposed can also handle 
additional constraints, for instance the oiks needed to 
assure asymptotical stability of discrete lime svsU ms under 
actuators failure Some examples illustrate the theory 

1 Introduction 

Hie theory Oh discretc-time linear systems has been 
developed, historically, as an extension of previous results 
concerning continuous-time systems In the IMbOs the I inear 
Quadratic Pioblem—LQP has been exhaustive!) studied for 
both continuous-time and discrete time systems bcLOnimg 
a well-known technique for control design (Anderson and 
Moore, 1971) The discretc-lime optimal state feedhaik 
solution can be calculated from the solution ol an associated 
Riccati equation It exhibits many good robust properties as 
for the continuous-time cave However the control gam 
depends directly on the matrices model This tact is one of 
the mam reasons for the difficulties to generalize the discrete 
LQP to take into account additional requirements as for 
instance, parameter uncertainties, outpul feedback de 
centralized control or sensors/actuators lailurc Other 
difficulties stem from the fact that neither the set ol 
stabilizing controllers nor the objective function arc convex 

This paper presents a convex approach to solve ihe 
discrete-time LQP It is formulated as a J (2 optimal state 
feedback control problem First, we define a parameter space 
into which a convex set generates all the stabilizing state 
feedback control gams An oplimiziti m pioblem is then 
formulated whose solution furnishes the state feedback gam 
that minimizes the norm of a closed-loop transfer 
function Under certain conditions (to be detined in the 
sequel) this control gain equals the one provided by the 
classical LOP approach, thal is, the associated Riccati 
equation solution 

Furthermore, additional requirements can be easily 
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incorporated in ihc algoiilhm lo support this chum, we 
solve completely the discrete-time linear system slabi- 
lizability problem subject to actuators lailure In this case, 
since only one feedback gam must be stabilizing for all 
prespecified contingencies, we provide an upper bound for 
each closed-loop transfer lunction norm 

2 t^reliminanes 

Let us consider a discrete time linear system whose 
dynamic behavior is given by the following difference 
equations 

r* , I - Ax^ + ^1 

U^=-/IAa (1 ) 

+ Du^, 

where e 'H'’ is the slate variable t is the control 
variable e 'H' is the external disturbance and is 

the outpul variable Matncais A, By /?, C and D have 
apprt priale dimensions and are supposed lo be known 
Without loss of generality, the usual orthogonality hypothesis 
is also made, that is C 7^ = 0 and D'l) ^0 Defining the 
closed-loop matrices A^, - A — B K and C ,, C ~ DK and 
supposing that a state feedback gain K is calculated in such a 
way thal is asymptotically stable the closed-loop transfer 
function from k to z is given bv 

H(2)^r„|zl-A,,| 'B, (2) 

The norm for a stable transfer matrix //(z) can be 
defined as 

lr{//(e '■)//(£'")) d<« 0) 

and can also be calculated from the discrele-timc associated 
(jramians Let be the controllability Gramian of 0,) 
and / J the observability Gramian ol (C ,,, A^,) Then, 

v4„/|Ah-/, + ~ 0 (4) 

A„L.,A^, W,. 4 C.,(„-() (S) 

The M norm is given by 

11/711^ =-1r(C„/,C„)=-lr(R;L.,5,) (fi) 

If we denote by I( the set of all stabilizing state feedback 
control gams /C e the problem of optimal control 

can be stated as follows 

(PI) m.n(||H||= ATeTT) (7) 

Examined in the state feedback parameter space, this 

problem may have a very complicated geometi) In fact, 

neither the objective function p'jr JC are convex with respect 
lo the elements of the control gam K However, it is 

well known (Anderson and Moore, 1971, Dorato and Levis, 
1971) thal the optimal solution of (PI) is given bv 

A {B^PBi + D'D) 'B’^PA. (8) 
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where, under controllability and observability assumptions, 
p g unique symmetric positive definite solution of 

the discrete-time Riccati equation 

A^PA-P-APB^iBiPB.-^DD) 'B^PA^rC^il (9) 

From (8), we note that the optimal feedback gain K depends 
directly on the system matrices A and 82 - Obviously 
(Kwakernaak and Sivan, 1972) this solution can also be 
regarded as the optimal one for the following discrete-time 
LQP: 

2 WC’Cx* + ulD'Dut), 

(P2) “t »_(i (10) 

+ given. 

It is important to remark that, although the optimal control 
gain does not depend on the initial condition jr„, the minimal 
value of the quadratic criterion is such that 

J' = xi,Px„. (11) 

If we rearrange equation (9) in a closed-loop form, keeping 
in mind that CD - 0, we have 

{A - B 2 K)’P(A - B 2 K) - P + (C - DKy(C - DK) - 0. 

( 12 ) 

and we conclude from (5) that P = and ||/f|l2-7* 
provided we choose Note that the necessary 

optimality conditions for (Pi) are given by equations (4)-(5) 
and 


+ D'D)K - B 2 L„A\L, - 0, (13) 

being thus related to the solution of (P2). Indeed, from (13), 
if is non singular then the unique solution is given by (8), 
which is the optimal solution for (P2) and does not depend 
on the initial condition jcq. This fact will always occur in case 
B\) is observable or rank (R,)-n (a sufficient 
condition). This is assumed throughout this paper. 


as well as a symmetric positive dehnite matrix 
L^. /-i->0, satisfying 

= (18) 
As a consequence, there also exists W ^ L^. such that 


+ (19) 


ICeeping in mind that A^.^ = A - BjK and developing (19), it 
gives, for any jt e JK" 

jt'lAWA ~V/ - B^KWA’ -AWK’B^ 

+ B.KWK B.^-^ (20) 


11 ' = {x‘ ■: 0|. 

Since Vu 0 e : G'u = 0, 1 ; — [jt' ; 0], it is easy to verify 
from (20) that the W matrix 


r- 


W 

KW 


WK' 

KWK 


]»„. 


( 21 ) 


belongs to proving thus the necessity. 

Now, the sufficiency. From the fact that every = 5 ^ 
0 e iK'": G'l; = 0 has the form i 0], with Jt=^0ey)l", 

taking W e partitioned as in (17), we have 


iy[FWF'- W-\'Q]v 

= x'lAW^A' ~ - BjW^A' 

-AWjB;,^ B^B[]x 

= j'|(A- ■)VV,(A - ')' 

- W^^B^B[]x 

-Hjr'[R.,(W,- 'W.)R;)x. (22) 


Since W e wc have 

v'lFWF'- >/vEiW'¥^():O'v = 0 

iT'iro^ w;Wi 

Consequently, from (22) and (23), wc conclude that 


3. Main results 

In this section, we propose a new convex programming 
problem which is equivalent to (PI) in a sense to be defined 
in the sequel. From the above discussion, this is necessary in 
order to circumvent the non convexity of First of all, let 
us introduce the following extended matrices F 
p — m rt and C e 


A 

-B, 

G=n 

.0 

0 J’ 

LiJ 


as well as the symmetric matrices Q e R 


B,Bl 0 


0 ■ 

_ 0 oj’ 

L 0 

D'D^ 


and the set %: 

{W=W's:l)-.v'\FWF' - 'W + Q]vrs(), 

Vv6!N'’#0:C;'i/ = 0}, (16) 

where W g is symmetric and is partitioned as 



jc'|(A - ')W,(A ~ B.W;W^ ')' - W, + B^B\]x^{). 

(24) 

The last inequality evidences that the control gain 

K =■ W 2 WJ' is a stabilizing one for the pair {A, Bj), Tlie 

proof of part (b) then follows. Part (c) is obvious, since '^^2 I** 
a convex set (Luenberger, 1973). 

At this point, it is important to notice, from the previous 
theorem, that there exists a one-to-one relationship between 
the elements of (the set of all stabilizing control gams) and 
the ones of the convex set ^ 2 - Indeed, for each W e %, 

fC = ‘ e 'X and, conversely, for each K e there 

exists a matrix W given by (21) which belong.s to % (see 
Fig. 1). 

Now, with the results of Theorem 1, it is possible to handle 
the convex set ^2 order to achieve stabilization, avoiding 
the use of set J(. The next theorem guarantees the optimal 
characteristics to a particular W e ‘€ 2 . which furnishes the 
desired optimal control gain. 

Theorem 2. The optimal solution of (PI) is obtained by 
solving the following convex problem; 

(P3) mm {Tr (RW) :W e ^€ 2 ]. (25) 


with W, 6 being positive delinite. 

Theorem 1. The following statements hold; 

(a) ^2 is a convex set. 

(b) The pair (A, B^ is stabilizable by a linear state feedback 
if and only if ‘€2=^0. Tn the affirmative case, e ^ 2 ^ a 
stabilizing control gain is given by K = W 2 W 1 V 

(c) For a W* ^ there exists an hypcrplane that separates 
ir* from 

Proof. Part (a) follows directly from the fact that is 
defined as the intersection of an uncountable number of 
linear constraints. Let us prove part (b). Suppose the pair 
(A, ^ 2 ) is stabilizable; then there exists a control gain 


Being W* its optimal solution, then K*-W 2 W^^ solves 
(PI) (and consequently solves also (P2)) and Tt (RW*) =J*. 

Proof. From Theorem 1, — being so a 



Fig. 1 . Relationship between sets 5 if and ^ 2 - 
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stabUizing control gain. We just have to prove that this state 
eedback^ mdeed the optimal one. F.rst, let us prove that 
for A-iTjIVi . we have ||//||^ s Xr 'i' 

this end, note that for any W belonging to W, is positive 
definite and " * 


(A + (26) 

then we conclude from (4) that W, Now usinv 

equation ( 6 ), the fact that IT at) implies WiarV^tV 'W and 
the orthogonality condition C D ^ 0, we get * ^ 


||W|li = Tr {(C - DK)LXC - OK)‘\ 
=.Tr{(f - DK)W,{C-l)Kr} 
sTr {C'H,C' + DW;w, 'W,D'} 


The above inequality is a very important result; actually it 
holds in particular foi the optimal solution of (PI), implyinB 

that J Vff ^^2 and, as can be verified, the 

equality holds for 


other situation can be taken into account by simple including 
the case in d 8 , and defining properly the parameter M (see 
the example in the next section). 

Af 

Theorem 3. Define the convex sSet Pi where 

given by (16) with F replaced by F, (defined as in (14)), 
i = 1 • A/. The optimal solution Wf of 

(P4) min (Tr(/?M (31) 

is such that ACy - VV^W, ' and Tr ) solve the 
actuator’s failure problem stated before 

Proof. sSince "U, belongs to ir also belongs to each one 
of the sets ^. 3 ,, jvf- then, we have Vv e # 

0. G'u = 0, 

v'IF/W^F;- ‘H} + Q]v 0, Vf =-1 ■ • Af. (32) 

implying (sec the proof of Theorem 1), with partitioned 
as in (17), that 


W* 


KL, 


(2H) 


{A - B^W'W, ^)W,{A - ')' - W, + ^ 0. 

V/ - 1 • • Af (33) 


where K = {BnL^^B^->r D'D) ^B^L^^A, being P - T„ the 
definite positive solution of the discrcte-time Riccati 
equation (9) and being the solution t)f the controllability 
Gramian (equation (4)). Now, it is simple to sec that 
K* - W 2 ^^ ^ ' provides the optimal gam and 

mm {TT(RW): Ure% 2 }^TT(R U''*) 

= Tr(C,,L,r')--r (29) 

The proof of Theorem 2 is completed. 

The above results deserve some remarks The first one is 
related to the uniqueness of solution of (P3) Being convex 
(not strictly), it would be possible that it.s optimal solution 
was not unique in the sense that different stale feedback 
gains could be generated Fortunately, this fact docs not 
occur. To prove that, suppose W* ^ W e %% gcnerale two 
different state feedback gains, being such lhat Tr{P()T- 
= 0 . With K associated to , deiernuning the transfer 
function H(z), using (27) and taking into account that (PI) 
admits only one solution, we obtain 

/■' = min (||//||?:A:£^}. 

<||//l|;:<Tr(P1MO, (-■'’>) 

which IS an impossibility, since by assumption ./* - 
Tr(Pir*). The second remark concerns the geometry of 
(P3). Indeed, we have proved that the discrcte-time LQP is 
equivalent to a convex problem defined on a special 
parameter space (the elements of matrix W € 
of great importance since (P3) can be solved by means of 
efficient numerical procedures available in the literature (see 
for instance a convex-based algorithm proposed in Geromcl 

eta/., mi). 

Another important characteristic of (P3) is that further 
convex constraints can be easily added to it For instance, 
suppose we want to solve the problem of finding a control 
gain lhat guarantees cln.sed-loop stability and minimal 
suhoptimality in case of actualor failure. This problem can 
be stated as follows; suppose the model (1) is such that the 
input matrix B 2 is not exactly known but belongs to the set 
302 ;i =:= 1 • ■ A/}. Find (it one exists) a stale feedback 

matrix gain and a positive parameter (as small as 
possible) such that; 

• A - ^ 2 ^/ asymptotically stable VB 2 ^ 

• VB2edd2. 

Note that, in the above problem formulation, we are 
representing a particular actuator failure (the tth component 
of the control variable u is reduced to zero) by the equality 
= B 2 jU. In this sense, the set .302 is composed by all 
matrices obtained from Bj by zeroing its ith co umn, 
t = 1 • • • m. In fact, this is the case when only one actualor is 
supposed to fail at a time, However, it is obvious that any 


It IS then clear that Kj = WjW^ ‘ guarantees asymptotical 
stability for all 9 & 2 , taking into account all feasible 
actuator’s failure On the other hand, since W^ is unique for 
all the above t •= I ■ ■ M inequalities, using (27) wc gel 

■\t{RW,) = ttfZ\\H\\l VBj,, ,= 1-- M. (34) 

consequently, ||/Y||^, c 30. and the proof is 

completed. □ 

4 Numerical procedure and example.s 
This section is in part devoted to solve the convex problem 
(P4), that is 

min{Tr(A?'ir):^rfc'€ 2 ,}. (35) 

Note that this problem is convex and reduces to (FV3) in cast* 
Af - 1 . Using the previous results, for each i “ 

simple task (Geromcl et ai, 1991) to determine a matrix 
(xilTf) and a scalar P{%), which define a separating 
hyperplanc from to %,/ Then, it is possible to apply the 
following outer linearization algorithm which converges to 
the global optimal solution (Bernussou et a/., 1989; 

Luenberger, 1973); 

, = arg min (Tr (RW): W t ‘, (36) 

(37) 

given, 

where e > 0 is a sufficiently small parameter used to 
approximate ‘^ 2 /^ by a closed convex set and 
To illustrate the proposed method, let us consider the 
following discrete-time system; 



'02113 

0 (K)87 

0.4524' 


'0.6135 

0 6538' 

A = 

0.0824 

0.8096 

0.8075 

. 82 = 

0.2749 

0.4899 


J).7599 

0.8474 

0.4B32_ 


_0.8807 

0.7741 _ 


"1 

0 

0 “ 


"0 

0 “ 

0 

1 

0 


0 

0 

0 

0 

1 

, D = 

0 

0 

0 

0 

0 


1 

0 

_0 

0 

0 _ 


_0 

1 _ 


and B^ = I. Since the eigenvalues of matrix A are 
(0.3827, -0.4919, 1.6133), it is open-loop unstable Using the 
discrete-lime Riccati equation, wc get 

^ro.2968 0.3758 0.31141 
~ Lo.2302 0 4953 0.4997 J' 

and IIf/II 2 - 5.2448 With the proposed algorithm, the 


AUTO 29:1-P 
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optimal solution of (P3) is calculated to be 


1.4582 

-0.3930 

-0.0926 

0.2597 

0.0918 

-0,3930 

1.6316 

-0.1254 

0.4629 

0.6612 

1 -0.0926 

-0.1254 

1.1972 

0.3013 

0.5151 

0.2597 

0.4629 

0.3013 

0.3489 

0.4411 

L 0.0918 

0.6612 

0.5151 

0.4411 

0.6084 


providing the control gain 

0.3006 0.3803 0.31481 
0.2292 0.4989 0.5002J' 

and II//II2 = 5.2452, Comparing both numerical solutions, we 
verify relative errors of about 0.8% in the control gain norm 
and about 0.007% in the norm. 

Now, let us suppose that one of the actuators may fail. 
Obviously, the optimal solution of the discrete-time LOP 
provides no guarantee for stability in this case. However, 
Theorem 3 can easily handle this possibility by defining 
9^2 — (^ 2 \f ^ 22 > ^23} where 

'0.6135 0.6538] [0.6135 o" 

^21= 0.2749 0.4899 . ^.2 = 0.2749 0 . 

Lo.8807 0.7741 J [0.8807 0_ 

'0 0.6538“ 

021= 0 0.4899 
[0 0.7741 _ 

Considering the set ^ and solving (P4), wc have found 
ju^ = 11.8127 and the control gain 

ro.2966 0.6,356 0.74051 
'^'Lo.4823 0.6592 0.4925J’ 

that guarantees the closed-loop asymptotical stability under 
the actuators failure previously defined. Funhermore, for 
each matrix ^3^, i = 1, 2, 3, the associated '^'2 norm are given 
by 

^21 Ml^lli-* 7.0560, 

^22:||/^H2=S.6285. 
fl2,:||H||? = 8.4779, 

making evident that is a ^-norm upper bound indeed. 
Table 1 shows the closed-kxip eigenvalues in two situations, 
namely 82 = flji (the nominal one) and Bj = 822 (actuator-2 
failure). It is easy to see that under the last contingency the 
closed-loop system with becomes unstable, and the 

same does not occur for the closed-loop system with K^. In 
Figs 2 and 3, we show the impulse response (and unitary 
impulse has been applied to the second component of w) nf 
the closed-loop system with and K^, respectively, 

supposing that actuator 2 (corresponding to the second 
component of u) fails at fc - 6 and remains inactive for k >6, 
The unstable behavior is obvious when the Riccati control 
gain is used. 

5. Conclusion 

In this paper wc have proved that the optimal .state 
feedback solution of the discrete-time LQP can be 
determined by means of a convex problem. This is an 
important result, mainly due to two facts. First, for a given 


Tabu- 1 . C^i osbd loop r igenvaluh.s 


Contingencies 




-0.4826 

0.4153 

Both actuators 

0.2085 

-0.5326 


0.4385 

-0.4069 


-0.4861 

0.4047 

Only first actuator 

0.3347 

0.5973 

1.0959 

-0.5068 


LQR with rollura at 4r=6 



FKi. 2. Impulse response—A^Riccm, 


Robust control with rallure at ^=6 



Hr 


Fib. 3. Impulse respemse— Kf. 


pair (A, 82 ), all stabilizing state feedback gams can be 
parametrized over a convex set. Second, the convexity of the 
^ discrete-time control problem allows to solve it by using 
the most powerful mathematical programming methods. As a 
by-product of this fact, we claim that additional structural 
constraints can be easily handled. For instance, we defined 
and solved a problem involving actuator’s failure whose 
solution, to our knowledge, was not available until this time 
in the literature. In this sense wc want to emphasize that the 
numerical procedure proposed here is specially addressed for 
solving “non-classical” optimal control problems, 

including additional convex constraints (actuators/sensors 
failure, robustness, uncertain systems control, . . .) which 
cannot be solved by means of the algebraic Riccati equation. 
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A Monte Carlo Approach to the Analysis of 
Control System Robustness* 

LAURA RYAN RAYt and ROBERT F. STENGELJ 

muluvariable control syMe.n.; control system analysts; Monte Carlo 


Abstnct—Slochastic rohuslnes<>, a simple technique used to 
estimate the stability and perlormancc robustness of linear 
Ume-mvanant systems, is described. The scalar probability of 
instability is introduced as a measure of stability rnbuslncss 
Examples are given of stochastic performance robustnes.s 
measures based on classical time-domain specifications. The 
relationship between stochastic robustness measures and 
control system design parameters is discussed. The technique 
is demonstrated by analysing an LQCi/LTR system designed 
for a flexible robot arm. It is concluded that the analyMS of 
stochastic robustness offers a good alternalive fo existing 
robustness metrics. It has direct bearing on engineering 
objectives, and it is appropriate for evaluating robust control 
system synthesis methods currently practised 

introduction 

Standard linkar controi system design methods rely on 
accurate models of the system to he controlled. Because 
models are never perfect, robustness analysis is necessary to 
determine the possibility of instability or inadequate 
performance in the face of uncertainty. Synthesis of robust 
control system is predicted on a good measure of robustnc.ss 
Consequently, much research activity during the past two 
decades has been devoted to developing adequate measuies 
of robustness for linear, timc-invariant systems that can iii 
turn be used for robust control system synthesis. In most 
instances, robustness is treated deterministically, using 
singular-value analysis (e g. Lehtomaki et al., 1981; Doyle, 
1982) or parameter-space methods (e g. Siljak, 1989, Vicino 
et al., 1990). These methods can be applied without regard ti> 
actual bounds on system parameters, hence, the relationship 
of the metric to uncertainties in the physical system often is 
weak. Furthermore, deterministic metrics can be conserva 
live and/or difficult to determine for systems with many 
uncertain parameters. Consequently, overconservalive con 
trol system designs or designs that arc insufficiently robust in 
the face of real world uncertainties arc a danger. 

Stochastic Robustness Analysts (SKA) uses statistical 
descriptions of parameter uncertainly^ determine whether 
slability/performance robustness criteria arc met. Stengel 
(1980) introduced Monte C'arlo analysis of the scalar 
probability of instability, which is central to the analysis of 
stability robustness. SRA is described m more detail in 
Stengel and Ray (1991); exact confidence intervals for the 
scalar probability of instability are presented, and the 
stochastic root locus, nr probability density of the 
closed-loop eigenvalues, is shown to portray robustness 
properties graphically. Ray and Stengel (1991) illustrates the 
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use of SRA to compare control system designs for full-state 
feedback aircraft control systems and to analyse systems with 
finite-dimensional uncertain dynamics. Because it provides a 
stati.stical measure ol robustnes.s and because it uses 
knowledge of the statistics ol parameter variations, SKA is 
inherently intuitive and accurate. The physical meaning 
behind the probability of in.stabiliiy is apparent, und 
ov-rconservalivc or insufficiently robust designs can be 
avoided. 

Concepts behind stochastic .uahilify robustness arc readily 
extended to provide measures of performance robustness. 
Design specifications such as rise time, peak overshoot, 
settling time, and steady-stale error are normally used as 
indicators of adequate performance and lend themselves to 
the same kind ol analysis as described above. SRA can be 
applied to classical criteria giving probabilistic bounds on 
scalar performance measures. Metrics resulting from stability 
and performance robustness can be related to controller 
parameters, thus providing a foundation for design tradeoffs 
and optimization. This paper summarizes stochastic stability 
and performance robustness analysi.s Ihe analysis is 
illustrated by studying the effectiveness of robustness 
recovery on a stochastic optimal control system with 
parameter uncertainties. 

Stochastic stability robu,stne.ss 

Stochastic stability robustness is described in Stengel and 
Ray (1991) and is summarized here, (onsider a hneai, 
time-invariant system where the dynamic, control, and 
output matrices, F(p), G(p), and H(p) may be arbitrary 
lunctions of an r-dimensional parameter vector, p. The 
control gam matrix C is designed using some nominal or 
“mean” value of the dynamic model, denoted F, G, and H, 
that represents F(p), G(p), H(p) evaluated at the nominal 
parameter vector. The actual system has an unknown 
description that depends on the actual (unknown) value of 
the parameter vector p. Eiiviromncnl, variations in the 
nominal state, system lailures, parameter estimation errors, 
wear, and manufacturing differences all can contribute to 
mismatch between the actual system and that used to design 
the controller. For SRA, p is a.ssumcd to have a known or 
c.stirnalcd probability den.sity function, denoted pr(p), that 
expresses the parameter uncertainty statistics due to the 
above factors. 

The eigenvalues of the matrix |F(p) G(p)CH(p)] 
determine closed-loop stability. To estimate the prohahility 
of insiabiiiiy (IP) using Monte Carlo evaluation, the 
closed-loop eigenvalues arc evaluated J limes with each 
element of p specified by a random-number generator whose 
individual outputs arc shaped by pr (p). For less than an 
infinite number of evaliialions, the resulting probability is an 
estimate, denoted IP, and given by the number ol evaluations 
where one or more eigenvalues has a positive real part 
divided by J. Because IP is a binomial variable (i.e. the 
outcome of each Monte Carlo evaluation takes one of two 
values; stable or unstable) exact confidence intervals for P 
arc calculated using the binomial lest (Conover, 
Confidence intLrvals also can be calculated for the diflerenec 
AP between P, and P^ of two different control .systems (Ray 
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and Stengel, 1990): 

Pr[(i,, - t4)<APs(6’| - Lj)|a: 1 - a-, - aj+ aia,. (1) 

(L^, (7,) and (Lj, V^) are the binomial confidence intervals 
for P| and respectively, and cr^ are the individual 
confidence coeflficienis. Confidence intervals for P are 
detailed in Stengel and Ray (1991). 

71ie presentation of a probability wih exact confidence 
intervals is precise (nonconservativc), defensible, and 
repeatable in the context of probability and statistics, even 
though it differs from accepted deterministic measures. 
Although the accuracy of P in describing the true robustness 
depends on the accuracy with which the parameters arc 
described, known or estimated parameter uncertainties arc 
required to determine whether a system is robust for any 
robustness measure (deterministic or stochastic). While 
deterministic measures such as a stability margin (Vicino et 
al., 1990) can be computed without characterizing the 
uncertainty, the control system cannot be pronounced robust 
until the stability margin is evaluated with respect to the 
uncertainties expected in the system. Otherwise, the control 
system may be loo robust, at the expense of performance, or 
nonrobusl in the face of real-world uncertainties. By 
considering only allowed uncertainties, SR A eliminates this 
extra step. For deterministic measures, the problem is 
compounded by the fact that the metric it.self can be 
conservative and/or difficult to compute. 

Deterministic evaluation of the probability of 
instability. Continuously distributed parameters have a true 
underlying probability of instability that remains unknown 
for J < although given a sufficient number of evaluations, 
P can be bounded within a desired confidence interval 
(Stengel and Ray, 1991). Quantized distributions approxim¬ 
ate continuous distributions, approaching them in the limit as 
the number of discrete parameter values goes to infinity. 
When a continuoUvS distribution is approximated by a 
quantized distribution, it may be possible to perform fewer 
deterministic evaluations and obtain an equally good CvStimate 
P. For few parameters (r) and few quantization levels (iv), 
deterministic evaluation may be a valid option. If the 
parameter distributions arc in fact discrete, then evaluation 
of the w' deterministic combinations would give P, while 
Monte Carlo evaluation provides an estimate P along with 
confidence intervals for P. If r and/or w are large, then many 
deterministic evaluations are required, and Monlc Carlo 
analysis may provide adequate bounds for fewer evaluations. 
The tradeoff between Monte Carlo analysis and deterministic 
evaluation depends on the number of parameters, their 
distributions, the number of quantization levels, and 
required confidence intervals For example, a system with 
two binary parameters and P = 0.25 requires 2^ or four 
deterministic evaluations to obtain P, but 62.1H Monte Carlo 
evaluations arc needed to compute 95% confidence intervals 
with a width of 10% of P. A system with 20 binary 
parameters and P = 0.25 requires 2^’ or over 10^ 
deterministic evaluations, yet its probability of instability can 
be bounded within 10% of P at a 95% confidence level with 
the same 6238 Monte Carlo evaluations. Evaluation of binary 
combinations is comparable to determining the stability of 
“corners'’ in parameter space. If the parameters distributions 
arc not binary but the maximum real eigenvalue component 
is monotonic in the individual elements of p, then the 
deterministic binary evaluation.s circumscribe results ob¬ 
tained for the actual continuous or quantized distributions 
with the same limits, and a conservative estimate P is 
provided. 

Extensions of stochastic stability robustness analysis 

Stability robustness of systems with estimators. Stochastic 
stability robustness analysis is easily extended to systems 
incorporating dynamic state estimators. Using G^, and 
as the actual system matrices and F, G, and H as the 
design system, the closed-loop system matrix for state (n) 


and error dynamics (x - i) is (Stengel, 1986) 
i-A - GaC 

(F-FJ - (G-GJ--K(H-HJ 

1 ( 2 ) 

F-(G-GJC-KHJ’ ' ' 

where ii is the stale estimate and K is the estimator gain 
matrix. 'I'hc effect of parameter uncertainly on stability 
robustness is computed by Monte Carlo evaluation of the 
eigenvalues of equation (2), with F(p), G(p), and H(p) 
substituted for F^, G^, and H^. Closed-loop eigenvalue 
densities portrayed on the stochastic root locus show the 
possible interaction of dynamic and estimator state elements, 
and the possible robustness degradation due to the estimator. 
Well-known loss of LO stability margins when a state 
estimator i.s added (Doyle, 1978) can be quantified by the 
probability of instability. 

Performance robustness analysis. While stability is an 
important element of robustness, performance robustness 
analysis is vital to determining whether important design 
specifications arc met. Stochastic stability robustness is 
described by a single parameter, the probability of 
instability. Adequate performance—initial condition resp¬ 
onse, response to commanded inputs, control authority, and 
rejection of disturbances is difficult to describe by a single 
scalar. However, elements of SRA (e.g. Monte Carlo 
evaluation and use of the binomial confidence intervals) 
apply, independent of the performance criteria chosen. 

Numerous criteria stemming Irom classical control 
concepts exist as measures of adequate performance. 
Appealing to these, one can begin a smooth transition from 
stability robustness analysis to performance robustness 
analysis simply by analysing the decree of stability or 
instability rather than strict stability (Stengel and Ray, 1991). 
One method of doing this is to shift the vertical discriminant 
line from zero to Y less than (or greater than) zero. 
Histograms and cumulative distributions for degree of 
stability are readily given by the Monte Carlo estimate of 
Pr(2), the probability that the maximum real eigenvalue 
component is less than X, where Binomial 

confidence intervals are applicable to each point of the 
cumulative distribution, as there are just two values of 
intere.st, e g. “satisfactory ’ or “unsatisfactory”. The 
robustness measure resulting from the cumulative probability 
di.stribution is directly related to classical concepts of rales of 
decay (growth) of the closed-loop response, time-to-half and 
time-lo-doubic. Rather than a vertical discriminant line, one 
can confine the closed-loop roots to sectors in the complex 
plane hounded by lines ol constant damping and arcs of 
constant natural frequency. Systems with roots confined to 
.sectors would be expected to display a certain transient 
response speed. Again, the probability of roots lying within a 
sector is a binomial variable, and binomial confidence 
interval calculations apply. 

While the speed of the transient re.sponsc depends on the 
closed-loop poles, its magnitude and overall shape depend on 
the coefficients of the characteristic exponential and 
sinusoidal terms, and time responses provide the most 
clear-cut means of evaluating performance. When time 
re.sprmses are computed, stochastic performance robustness 
can be portrayed as a distribution of po.ssible trajectories 
around a nominal or desired trajectory. Envelopes can be 
defined around a nominal trajectory based on stated 
performance criteria, and the probability of exceeding the 
envelope becomes the scalar, binomial performance robust¬ 
ness metric (Ray and Stengel, 1990). While it is simple to 
conclude that a response violates such an envelope, 
individual responses within the envelope may not be 
acceptable. In such cases, the derivative of a response and 
envelopes around the derivative also can be used to evaluate 
performance. There is no unique set of criteria defining 
envelopes that bound an acceptable response; the envelopes 
may be defined by segments connecting points based on 
minimum/maximum dead time, rise time, time to peak 




ration o design insight revealed by SR A Solid points indieate closed-loop eigenvalues enclosed 
by hypothetical uncertainty circles (a) Root locus where stability robustness decreases monotonically with 
increased gam (b) Root locus where stability robustness decreases, increases, then decreases with increased 

gam 


overshoot, peak overshoot, settling time and steady stale 
error Segmented envelopes can be smoothed or other 
scalars can be used to define points on the envelope 
However, once an envelope is defined iinu response 
distributions can be computed by Monte C arlo methods I he 
closed-loop time response to a command input disturbance 
initial condition, or some combination is evaluated J times 
and for each evaluation the trajectory is a binomial variable, 
It cither stays within the envelope or violates the envelope 
Although computing time responses are more compulation 
intensive than probability of instability or degree of stability 
estimation such analysis is well within the capability of 
existing worksialions 

Stochastic rnhustnew (l\ a control design aid While general 
“rules of thumb’ regarding the* design of robust control 
systems arc useful, SRA can identify non obvious robustness 
behavior in particular problems Figure 1 provides one 
example Consider Fig la which shows the loot locus of a 
system that has a complex pair of poles and a right hall plane 
zero Hypothetical bounded uneeriainty circles are drawn 
around possible closed loop root locations the uncertainly 
circle at the pole represents possible open loop eigenvalue 
locations due to uncertainty in the dynamic matrix I As gain 
increases along the root locus the uncertainty is magnified 
and uncertainty in the control effect matrix contributes to the 
widening cireles Stability robustness decreases and closed 
loop roots may be in the right-half plane at high enough gam 
This ease illustrates one where the decrease m robustness is 
monotoniL, as indicated by plotting the probabilitv of 
instability vs the root-locus gam m Fig 2d Figure lb 
postulates a system with a real poh md a complex pair of 



Case (a) 

Control Sysicm 


poles and zeros located in a over zero configuration 

The complex portion of the root locus starts near the ju) axis 
bifore ending at the zero in the left half plane Again, 
uncertainty circles enlarge as gam increases In this case, it is 
possible that eigenvalue distributions cross into the right-half 
plane arc entirely in the left half plane as gam increases, 
and finally, cross back into the right-half plane al very high 
gam Here, stability robustness ^as measured by the 
probability of instability) may have local or global minima as 
functions of gain (Fig 2b) For multivariable systems with 
many parameters, the intrinsic structure of the problem and 
thi tradeoff between the spread in closed loop root 
unc rtainty vs the magnitude ol the control gains may nol be 
immediately evident Plots of stochastic robustness metrics vs 
scalar controller design parameters provide the necessary 
insight 


( ase study LQitjLiR system robustness analvsis 
It IS well known that the stability margins guaranteed for 
an LQ system are not retained in LQCj systems (Doyle, 
1978) Nevertheless 1 oop Transfer Recovery (LTR) (Doyle 
et al 1979) has become an established method ol recovering 
transfer properties ol the LQ sysicm (or the linear-optimal 
estimator, Kwakernaak (1976)) in inmimum-phasc LOO 
systems fhe condition lor recovery of LQ transfer 
properties is derived from the fact that if u(/) had the same 
eftecl on both x(t) and fi(/), then the LOO system would 
have the same transtci function properties as the LO system 
If the actual system matrices match those used to design the 
estimator, the transform relationships arc (Doyle and Stein, 



Fig 2 Probability of instability as a 


function ol control system design parameter, for the two cases in Fig 1 
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' i(i) = (^l„-F)-'G.(.), (3) 

*(.v) = |(.a„- F) + KH| ‘fKH(jI„ -F)-‘G«(jr) - Gu,(i)|, (4) 

When the estimator gain is chosen according to the recovery 
condition K/v = G, equation (4) becomes, 

i(j) = A-'G(HA 'G) 'HA-'Gii(j) = A‘ 'Gu(j) = »(.?). (5) 

where A-Cii-F). Asymptotic recovery occurs as the 
positive, scalar design parameter v approaches and q 
estimator eigenvalues approach the q transmission zeros of 
H(.sl„-F) ‘G. The procedure recovers the original loop 
only if the LO system has minimum-phase transmission 
zeros. To meet the recovery condition, the nominal 
disturbance spectral density matrix, Wy, is appended as 

W = W„ + v^GG^ (6) 

The term v^GG’^ represents additional process noise that 
ruins the optimality of the estimator with respect to actual 
measurement noise and disturbances; hence, performance 
suflfers as more process noise is added, but the good transfer 
function properties of LO systems are recovered asymptoti¬ 
cally. llie tradeoff between estimator performance and 
system robustness is made by adjusting v. 

Unstructured-singular-value analysis (USVA) typically is 
used to determine when LQ properties are recovered. 
Nevertheless, USVA docs not indicate the effectiveness of 
LTR in systems with parameter uncertainties, as pointed out 
in Tahk and Speyer (1987) and Shaked and Soroka (1985). 
When the system description is uncertain, the actual system 
matrices do not match those used to design the estimator, 

*(.>) = A 'G[HA ’G] '[HAa'GJuCO, (7) 


where A^^Crl-F^) and F^, G^ represent the actual 
system matrices. Equation (7) shows that when parameter 
uncertainties are present, the original loop is not recovered, 
although partial recovery may improve robustness over that 
of the nominal LOG system, q estimator poles approach the 
transmission zeros as v—and finite minimum phase 
transmission zeros may move to the right-half plane due to 
parameter uncertainty in HA^'G^. Estimator poles also are 
influenced by parameter uncertainty and can vary around the 
transmission zero: hence, increasing v indefinitely can 
decrease robustness in systems with one or more uncertain 
parameters. In such systems, SRA determines the value of v 
required to recover sufficient robustness while maintaining 
adequate performance, as demonstrated by the example that 
follows This characteristic of SRA is not limited to analysis 
of LOG/LTR system but is useful with any control design 
approach in which one or more design parameters is 
arbitrary. 

Single-link robot arm. A flexible one-link robot typically is 
used to study problems associated with controlling a 
compliant system when the sensor and actuator are not 
collocated (e g. Cannon and Schmitz, 1984) In such systems, 
robustness concerns can be severe. The linear model of the 
single-link robot arm retains the first three flexible modes, 
and the tip of the link is controled by applying a control 
torque to the hub, or base of the link. Because the model is 
representative of a general flexible structure, physical 
parameters are easily identifiable, and robustness is a 
concern, it i.s a good candidate for SRA. 

The dynamic, control effect, and output matrices are given 
by 


"() 1 0 i) 

0 0 0 0 

0 0 0 1 

0 0 -0)] 

0 0 0 0 

0 0 0 0 

0 0 0 0 

0 0 0 0 


0 0 0 o' 

0 0 0 0 

0 0 0 0 

0 0 0 0 

0 1 0 t) 

- ujI 0 0 

0 0 0 1 

0 0 -U)] 


(8) 



G = f[0 1 

It 

0 ^.|(0) 0 <Pi(0) 

0 

(9) 

L 0 0,(L) 

0 0,(L) 0 

0,(7-) 

« 1 

0 1 0 

0;ro) 0 0i(O) 

0 0UO)J’ 
(10) 


where x is the length along the arm, 0,(jt), are the normal 
modes, length of the arm, and is the 


total inertia of the arm. The measurements taken through H 
are the tip displacement and hub-rate, respectively. The 
flexibility of the open-loop system is apparent in the 
opien-loop eigenvalues, which are 0, 0, -0.177 ± 11.81;, 
—0.432 ± 21.61;, and -0.968 ± 48.37;. The transfer function 
between tip displacement and hub torque is non-minimum 
phase, with zeros 12.4, -12.0, 21.6 ±24.2;, -22.5 ±24.2;; 
hence, a non-minimum phase response can be expected for 
tip displacement, llie system has a readily identifiable 
14-clemcnt parameter vector: 


p = IC] 0)^ 0;(O) 02(0) 0;(O) 

L iP,(L) (P^(L) 0,(L) y^l- (11) 


Details concerning the modeling and parameter identification 
arc given in Cannon and Schmitz (1984). The linear- 
quadratic regulator de.signed in C?annon and Schmitz (1984) 
is used for demonstration of SRA. The performance index 
weights tip position and tip rate, and the LQR 
state-weighting, control-weighting, and control gain matrices 
are 


Q-O.OIF' 


” [o 0 


HF±H' 



R-0.001, 


(12)-(13) 


C = (35 42 13.38 41.24 2.65 59.32 -0.67 135.46 1.58]. 

(14) 


The nominal clo.sed-loop eigenvalues are -5.41 ±48.8;, 
-6.47±23.S;, -6,1±2.W);, -7.7± 11.42;. 

A uniform probability density function models the 
parameter uncertainties, with variations between ±2% of the 
nominal values for L and and ±25% for the remaining 
parameters. The 5n,(KK)-cvaluation stochastic root-locus tor 
the full-state feedback system is given in Fig. 3. 'Fhe nominal 
eigenvalues are marked, and the distribution is indicated by 
the height above the complex plane in units of roots/length 
along the real axis and rools/area in the complex plane. The 
‘bin” size in Fig. 3 is 0.9 along the axis and 0.9 x 0.9 off the 
axis. For 50,(XK) evaluations, IP, is zero, with 95confidence 
intervals {L, ( 7 ) ~ (0, 7.4 x 10 Each ot the four complex 
eigenvalue pairs appears in Fig. 3 as a “peak”, with a 
surrounding distribution due to parametric uncertainty. The 
peaks can be well-defined (as in the lowest frequency 
complex pair) or broad (as in the highest frequency pair) and 
the nominal eigenvalues are not necessarily at the 
distributions’ peak. Parameter uncertainty causes complex 
pairs to coalesce into real roots resulting in a distribution 
along the real axis. The closed-loop eigenvalues lend to 
.spread into the left-half plane, while definite boundaries are 
delineated on the right. For binary parameter variations of 
the same magnitude as the maximum uniform variations, 2 ^^ 
or 16,384 deterministic evaluations also give P - 0. These 
results indicate good stability robustness in the face of 
reasonably large uncertainties. 

Moving to performance robustness analysis, Fig. 3 shows 
sector bounds defined by 4 s; ^ 65 and C>i)A. For 50,000 
evaluations, the probability that closed-loop eigenvalues lie 
outside of these bound.s is 0.0147, with 95% confidence 
intervals (L, f/) - (0.0136, 0.0158). While the shape of the 
time response depends on closed-loop zeros, a minimum 
response speed can be guaranteed by requiring that all 
closed-loop eigenvalues lie within the specified sector. 

Figure 4(a) presents .segmented step-response envelopes 
and 5(K) Monte Carlo evaluations of the response of the lip to 
a 4.8 cm position command input. The control history 
corresponding to the mean response is given in Fig. 4b. The 
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□ 

- 65.0 


Fig. 3. Stochastic root-locus for the single-link robot with uniform pariinn icrs, 3(),tKX) evaluations. Nominal 

eigenvalues are marked by 'x\ 


initial response is in the wrong direction, since the translei 
function is non-minimum phase; the envelopes m Fig. 4 
indicate the maximum acceptable non-minimum phase 
response. For 5(K) responses, the probability of violating the 
time response envelope is 0.184 with conlidcnce 

intervals {L, t/) = (0.151, 0.221). Individual responses char¬ 
acteristic of those evaluated by Monte Carlo analysis arc 
given in Fig. 5. While responses (ill out the envelope, some 
of the individual responses within the envelope may not be 
acceptable in the lace ot real-world criteria governing rate of 
change of the re.sponse (Fig .5c) Hus is a case where 
checking envelopes around the derivative of the response 
may be necessary. Similar analyses can be perlormed on 



time (see) 


Fig. 4. Time histories associated with tip position command 
of 4.8 cm. (a) 5(X) Monte Carlo evaluations of the tip 
response. Envelopes are defined by scalar performance 
criteria. Nominal response is indicated by the solid line, (b) 
Nominal control input. 


control Irajeciones to make sure bandwidth and control 
effort limitations are not violated. 

It IS instructive from a design standpoint to plot robust¬ 
ness measures vs design parameters used to calculate 
feedback gams. Since there is a single control in this 
example, the scalar control weighting matrix R can be used 
as the design parameter. I'wo stochastic pcrlorniance 
robustness measures arc plotted vs R in Fig. 6^- -the 
probability of violating the time-response envelope and the 
probability of degree ol instability. As control gains increase, 
the closed-loop roots are pushed farther into the left-half 
plane, but they also lend to migiate farther from their 
nominal values. At some value of control gam, there is a 
tradeoff between how lar roots migrate and their location in 



Fig. 5. Examples of individual lip responses, (a) Acceptable 
response within envelope, (b) Response violates envelope, 
(c) Response is within envelope, but criteria governing its 
derivative may be required. 
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I (K)0c-05 1 OOOe-()4 * O.Ool 0 01 


R 

Fig. 6 . Stochastic performance robustness metrics vs control 
weighting matKx R. (a) Probability of violating time 
response envelopes, (b) Probability of degree of instability, 
for values £ along the real axis. 


the left-half plane; thus a local minimum is apparent around 
^=0.001 in the probability-of-degrcc-of-instability curves. 
While degree of instability improves for very small R, the 
control gains become unrealistically large. For larger R 
(smaller control gams), the nominal closed-loop roots have 
real parts in the range of the values of 2 used; hence, the 
probability of degree of instability increases rapidly beyond 
V?=0.01. The probability of violating time-respon.se 


envelopes and the probability of degree of instability show 
similar trends—a broad minimum in the region R = 0.001— 
as functions of R. 

Figure 7 shows the stochastic root-locus for the LOG 
system with estimator gains based on disturbance effect 
matrix L = G, and disturbance and noise covariance matrices 
W = \ (N•m)^ N = diag [0.005 cm'' 10 (rad sec The 
stochastic root-locu.s of the LQG system changes in overall 
character from that of the LQ system. Peaks are sharper, and 
the real-axis distribution is less pronounced. In particular, 
note the eigenvalues associated with the largest i^caks. In 
Fig. 3 a broad distribution is associated with these 
eigenvalues, yet in Fig. 7. this pair of eigenvalues shows little 
variation from its nominal location! While the extent of the 
distribution into the left-half plane is about the same as in 
Fig. 3, LOG system eigenvalues do migrate into the 
right-half plane The probability-of-instability estimate and 
95% confidence intervals for 50,00(1 evaluations are 
IP = 0.0771, and (L, t/) = (0.0748, 0.0795), representing a 
significant loss in the stability robu5tne.ss characteristic of the 
LO system. Figure 8 illustrates the effect of Loop Transfer 
Recovery for this example. Figure 8a shows that there is a 
value of V (v = 2) that minimizes the probability of 
instability. The fact that .such a minimum exists and the value 
of the design parameter that minimizes the probability of 
instability are not apparent by simply examining the nominal 
eigenvalues of the LQG/LTR system. The finite transmission 
zeros of the system are -1.75 ±46.8/, -2.9 ±18.2/, 

—5.8 ±6.6/, and -6.5. As v increases, seven eigenvalues 
approach the system transmission zeros; the minimum can be 
attributed to the tradeoft between uncertainty in eigenvalue 
location as gains increase and the nearness of transmission 
zeros to the imaginary axis. While a minimum probability of 
instability is not guaranteed, the results presented in Fig. 8a 
offer design insight and show robustness characteristics that 
may not be revealed by deterministic robuiitness measures, 
particularly if the deterministic analysi.s is conservative. 'The 
tradeoff between estimator performance and system robust¬ 
ness is shown by comparing Figs 8a and b. Figure 8b 


pr(?l) 



-B5. 


0 


Fig. 7. Stochastic root locus for the single-link robot with state estimation (LQG) 50,000 evaluations 
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FiCi. 8. Evaluation of Loop Transfer Recovery (a) Piobabilily ol Instability vs LOG/LTR design parameter 
V. (b) Sampled estimate of lip position and hub rate covariances vs LOG/LTR design parameter v. 


indicates estimator performance by sampled estimates of the 
covariance of the output, P — H£.'[(x(t) — where 

£[■] is the expectation operator. The output covariance 
(based on simulation of the IQG system) shows that 
performance degradation over that ol the nominal LOG 
system is small at the minimizing value of v 

Conc/usipn 

Stochastic Robustness Analysis offers a rigorous yet 
straightforward alternative to current metrics for control 
system robustness that is simple to compute and is unfettered 
by normally difficult problem statements, such as non- 
Gaussian statistics, arbitrary funclicri^ of uncertain para¬ 
meters appearing as matrix elements, and structured 
uncertainty. Principles behind stochastic robustness can be 
applied to scalar performance metrics and/or time responses, 
making it a good candidate for overall robustness analysis. 
Stability and performance measures resulting from the 
analysis can provide details relating intrinsic robustness 
characteristics and control system design parameters. The 
analysis makes good use of the computing power of modern 
workstations. The example demonstrates stochastic robusi* 
ness analysis applied to LOG/LTR. The analysis determines 
the effectiveness of Loop Transfer Recovery on uncertain 
systems, and the Loop Transfer Recovery design parameter 
that gives adequate stability robustness with minimal 
performance degradation is readily identified. 
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Robust Root Clustering for Linear Uncertain 
Systems Using Generalized Lyapunov Theory* 

R. K. YEDAVALLIt 

Key Words- Robust pole-placcment, root clustering; robust control, liiicai state space systems; 
unstructured uncertainly; structured uncertainty; Lyapunov theory. 


Abstract— In this paper, the problem of matrix root 
clustering in sub-regions of complex plane for linear slate 
space models with real parameter uncertainty is considered. 
The nominal matrix root clustering theory of Gutman and 
Jury (1981, IEEE Trans. AuL Control, AC-26, 403) using 
Generalized Lyapunov Equation is extended to the 
perturbed matrix case and bounds are derived on the 
perturbation to maintain root clustering inside a given 
region. The theory allows us to get an explicit relationship 
between the parameters of the root clustering region and the 
uncertainty range of the parameter space. The current 
literature available on perturbation bounds ioi robust 
stability becomes a special case of this unified theory. 

1. Introduction 

The problem of anai yzing and designing controllers for 
linear systems subject to real parameter uncertainly has been 
an extremely active topic of research in recent years. For 
example, see Doralo and Yedavalh (1990) and Siljak (1989) 
for a summary of recent developments in this area. In 
particular, there is considerable literature available on the 
analysis of robustness of linear state space systems with real 
parameter perturbations where the uncertainty can be either 
norm bounded (unstructured) or in terms of bounds on the 
intervals of the parameters (structured). However mo.st ol 
the analysis is e.ssentially devoted to the robust stability 
problem wherein the stability region is the entire open left 
half of the complex plane for continuous time systems and 
the unit circle with center at the origin for discrete time 
systems. The more general problem of robust D-stabiiily 
where a D-region’ is any given subregion in the complex 
plane has received much less attention Since pole- 
placement' technique is an eflcctive way of shaping the 
dynamical response, both for continuous as well as discrete 
time systems, robust fJ-stability problem is essentially a 
performance robustness problem m which the stability 
robustness problem becomes a special ca.se. Henceforth the 
phrases robust U-stability, robust r('ot clustering, robust 
eigenvalue placement will be used iiU^i changcably. 

Most of the literature on robust D-siabilily is confined to 
family of polynomials (Barmish, 1989; Fu and Barmish, 19K9; 
Soh, 1989; Kokame and Mori, 1991; Ackermanri et ai. 1991; 
Vicino, 1989; Zchcb, 1989). The very few methods reported 
for matrix root clustering confine themselves to some very 
specific O-regions (Juang et al., 1989; Juang, 1991; Sobcl and 
Yu, 1989; Keel et ai, 1991; Tesi and Vicino, 1990), In 
majority of these papers, the relationship between 
perturbation range and the eigenvalue migration range is not 
explicit and is not tractable. In this paper an elegant, unified 
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theory for robust eigenvalue placement is presented for a 
class of D-regions defined by algebraic inequalities by 
extending the nominal Matrix Root Clustering theory ol 
Gutman and .lury (1981) to linear uncertain systems (valid 
for both continuous and discrete time systems; both for 
structured and unstructured unccrtiiinlie.s; as well as analysis 
and design). Incidentally, this type of extension was 
considered in a scries of papers by Abdul-Wahab (1990, 
1991) with continuous time systems in mind. But as pointed 
out by Yedavalli (1992) the results were erroneous. We 
present here explicit conditions for matrix root clustering for 
different D-regions (which in turn have direct effect f>n the 
time response of the system) in terms of twunds on the 
parameter perturbations and establish the relationship 
between eigenvalue migration range and parameter range 
which are valid foi both euniinuous time as well as discrete 
lime systems, llie bounds obtained do not need any 
frequency sweeping or parameter griddmg, 

The paper is organized as follows. In Section 2, we briefly 
review the nominal matrix root clustering theory of Gutman 
and J .ry (1981) using Generalized Lyapunov theory. Section 
3 develops the relationship between parameter perturbation 
range and parameters of the root clustering region for 
different regions. In Section 4 we illustrate the theory with an 
example and finally Section 5 oilers some concluding 
remarks. 


2 Root clustering theory for a nominal matrix 

In what follows, we essentially use the same notation 
followed by Gutman and Jury in their paper. In Ihcir paper 
they consider a two variable transformation region D for 
matrix root clustering. For simplicity in expo.sition wc restrict 
our attention to only real matrices and review the material in 
Gutman and Jury (1981) related to only real matrices. 

Let a be the eigenvalue of A, A, the complex 

conjugate of A, Ji - Rc (A] and y ^ Im [A]. For A e wc 

consider a region symmetrical about the real axis, described 
by the algebraic inequalities 

Hv = {(jf, ,v) ; Y 

' t,K ’ 

siv = {u. y):S -o|. 

' f.K ' 

where v =/ + g, g ^2h, f, g and h are nonnegative integers 
and V is the region’s degree and is a real coclficicnt. Note 
that simply includes the boundary of The following 
facts arc reproduced from Gutman and Jury (19K1). 

X - J(A + A), 

V (A - A), 

f.n 

X (A + A/(A - If = 2 
f-n 
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In addition, let a, p^C and define 

f.K 

For simplicity in exposition, we limit our attention to 
regions Q, and Qj specialize the above notation to these 
two regions. Incidentally, these two regions cover quite a 
large class of regions in the complex plane. The following are 
examples of a class of regions; 

Regions of Degree 1. 


3. Bounds for robust root clustering 
In this section, we extend the concepts of root clustering 
given in Gutman and Jury (1981) to perturbed matrices and 
derive bounds on the perturbation to maintain root clustering 
in a given region (robust root clustering). Towards this 
direction, we consider systems with both unstructured 
perturbation as well as structured perturbation. 

Bounds for unstructured perturbation. Consider the 
following linear state space model 

X = Ax = E)x, jc(0)-j:„, 


^1 ■ J') • VfM) 0}. 

These regions include open left half plane and regions with 
prescribed degree of stability. 

Regions of Degree 2. 

Qj: {(■*. y): r«u + VioJt + rnzy^ + < 0}- 

This represents a conic section (either ellipse, parabola or 
hyperbola, depending on the nature of the coefficients y,^s). 

We now recall a fundamental theorem on root cluslenng 
of a nominal matrix in terms of Generalized Lyapunov 
Equation (G.L.E.) from Gutman and Jury (1981). Consider 
the Genralized Lyapunov Equation 

2 =-0, 

p.fi 

where is the transpose of A and r is the coefficient of 
a^P*^ in the polynomial p(cx, p) given by (l).t 
Note that for the regions under consideration coefficients 
are real. Before proceeding to state an important 
theorem found in Gutman and Jury (1981), in what follows, 
we summarize the expressions for and the expressions for 
the Generalized Lyapunov Equation for four regions, 
namely, LHP, cr-shift, ellipse and circle. 

* Open left half plane. 

; (j < 0}(yon = d, y,() = 1), —0, Cn, = C(n —(2) 

G.L.E.: (PA^ ^AP)=-2Q. (3) 

■ a degree of stability. 

Qi! (arTJC <0, £ir>0), nr. y,o== 1), 

G.L.E.; laP-^PA^ ^ AP^-2Q. (5) 


• Ellipse. 

O 2 : (Voo + Vtcy* + YioX + r20J^^ < 0}(r2o > O. Ym > 0), 

fno^yno' ^^10 ~ ^(11 ~ iVid' (^) 

Cii ~ 2(720 72 n). C()2 = fjd = 4(7211 “■ 702)' 

G.L.E.; c,,,/’ + c,„(Pyl^ + AP) + c^^APA^ 

^c„JiPA^ + A^P) = ~Q. (7) 


• Circle. 

n2:{7rxi+7id^+Jr^ + y"<*’). 

YaU’ 27iui ^11 “ L 

G.L.E.: Cf„P + Cn,(PA'^ +AP) + APA' = ~Q. (9) 

We now state the theorem on root clustering using 
Generalized Lyapunov Equation given in Gutman and Jury 
(1981). 


where is an n x n matrix with a given root clu.stering 
region and E is an unstructured perturbation on /I,,. 

The aim is to derive bounds on the norm of the 
perturbation of matrix, i.e. on ||E|| such that /4,)+E has 
roots maintained inside the root clustering region of A^y 
Note that in a design situation, the matrix 44,, may represent 
a nominal closed loop system matrix with gam matrix 
elements as design parameters (lor either continuous time or 
discrete time systems). 

First consider the generalized Lyapunov equation of (5) 
corresponding to region of degree 1. Assuming that the 
eigenvalues of the nominal system matrix A,, are located 
inside the given region (LHP or or-shifted LHP), we now 
want to derive bounds on the perturbation matrix E such that 
the roots of the perturbed system matrix A„ t- E also lie 
inside the region Q,. 

Theorem 2. I'he perturbed system matrix Af, + E has 
eigenvalues inside the given region of (4) if 




o„,.JP) 


Pur 


where P satisfies 


2(xP^PA,, + A,^P=-2Q. 


Proof. Similar to the proof given in Patel and Toda (1980) 

Remark 1. Note that this bound specializes to the 
standard left half plane (asymptotic stability for continuous 
time systems) bound derived in Patel and Toda (1980) where 
nr = 0. Here denotes the perturbation bound tor root 
clustering for region of degree 1. 

Now consider the generalized Lyapunov equation of (7) 
corresponding to region of degree 2. Assuming that the 
eigenvalues of the nominal system matrix A„ are located 
inside the given region 122 , want to derive hounds on 

the perturbation matrix E such that the roots of the 
perturbed matrix A,j+ E also lie inside the region 


Theorem 3. The perturbed system matrix A,,+ E has 
eigenvalues inside the given region ^2, of (6) if 







where 


^ “ 2r,i2;,, + C| 

^ o)' 

and P satisfies the generalized Lyapunov equation 
+ A„P) -}■ c,,A„PA,y 

“I" A(,P) =~ ~Q, 


Theorem 1 (see Gutman and Jury, 1981). Let A e and 

consider Q 2 given by (6) with Vzo ^ 0- ibe 

eigenvalues of A to lie in it is necessary and sufficient that 
given any positive definite matrix Q, there exists a unique 
positive definite symmetric matrix P sati.sfying (7) (with (6)). 


t Note that there is no loss of generality if one considers 
the equation £ c^^A^^PA^ = -Q. 


and denotes the perturbation bound for root clustering 
for the region of degree 2 and (•)„, denotes the absolute 
value of (■). 

Proof. Sec Appendix A. 

For the special case of a circle in the left half plane with 
center at p and radius r^, the generalized Lyapunov equation 
is given by the following parameters 
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Thus we have the G.L.E as 

Let 


+ PAl) + A„Pa;, + - r^)p = _ Q 

The above equation can be written as 

{ An - PIJ ^ Mu - Pl„) ^ ^ Q 


which is in the form of a Discrete Lyapunov Equation with 
the nominal matnx {A„ - )/r,, For this case the bound 

g‘''cn by 

= f (o,,,...(/»„) - fi)- + 

’ (10) 

Remark!, h may be noted that the bound/i. specializes to 
the discrete system bounds of Kolia et al. (l')89) with ^ 0 
r^=l. 

Bounds for .structured perturbation. For this case, we 
consider the linear slate space system with structured 
p^erturbation as follows: 

x=^A{q)x, j:((l) = jt„, 

where 

Mq)--=Au + E(q) = A„+'^q,E,. ( 11 ) 


with v4uebeing the “nominal” matrix obiaiiietJ at the 
nominal value of the uncertain parameter vector q 
{q,, r - I, 2, . . . , r), i.e. ry" ^ 0 and are given constant 
matrices. This type of representation produces a “polytope” 
in the matrix space. A special case of interest is the so-called 
“interval matrix” family in which E, arc such that they 
contain a single nonzero clement, at a different location in 
the matrix for each different i. 

We now define a set of matrices with the following 
notation. Let [ denote the matrix with all its elements 
taking on absolute values of the elements of the matrix [.]. 
Also let [.], denote the symmetric part of the matrix, i.c 

(i.i+i r)/2. 

Again consider the generalized Lyapunov equation of (3) 
corresponding to region of degree 1. Assuming that the 
eigenvalues of the nominal system matrix A^^ arc located 
inside the given region (LHP or nr-shiftcd LHP), we now 
want to derive bounds on the perturbation matrix E{q) such 
that the roots of the perturbed system matnx A,, 4 Eiq) also 
lie inside the region Q,. 

Theorem 4. The perturbed system matrix An^ E(q) has 
eigenvalues inside the given region S2, of (4) if 




„(£?) 


(T,L) 


where P, = (PE,), and P satisfies 


2aP + PA„ + a!,P = -2Q. 


Proof. Similar to the proof of Theorem 2. 



(E,E,Pl ■ 

• (EiE,P), 


(EjE.P), 

■ (E^E^P). 


l(E,E,Pf {E,E,P), 

(£,£,/>),. 

'{E,PE{), (E,PEl\ • 

(e,peX 

{E,PE;1 (E.PEh 

(E^PE^h 

iiE,PE;\ {E.PEJ-), • 

(ErPE^)j 

= {E,PA^.. 
A,„p (A„E,P)„ 


E.tip ~ (E,AtfP),, 

Pr. = (E,P), 



Now we are ready to state the theorem which gives bounds 
on lOot clu.stcring of (11), assuming Aj, has rool.s inside the 
given root clustering region Q 2 


Theorem 5. The perturbed system matrix AnT E(q) has 
eigenvalues inside the given region £^2 of (6) if 


l^/l< 







where 


fc. ^ C n2,„(S + X j 


where P satisfies (7) and denotes the perturbation bound 
for root clustering for region of degree 2 for structured 
uncertainty 

Proof. Similar to the proof of Theorem 3. 


4 Illustrative example 

To illustrate the theory, consider a simple example with 
the plant matrix (see AbduLWahab, 1991) 




-4.3 

0-2 


-0.4' 

-3.4^ 


with eigenvalues A, = -4.2 and A^ = -3.5. Let us consider a 
circular root clustering region in the left half of the complex 
plane with the center at ~ —4.0 and radius r- 1.0. Then 
the bound on the unstructured uncertainty i7^„,(Zl) is given 
by equation (10). Carrying nut the computations with Q = /, 
we get 


0.0341. 


Remark 3. Note that this bound /ii specializes to the 
standard left half plane (asymptotic stability for continuous 
time systems) bound derived in Keel et al. (1988) and Zhou 
and Khargonekar (1987) where nt = (l Here denotes the 
perturbation bound for rcxit clustering for region of degree 1 
for structured uncertainty. 


That is, as long as the unstructured uncertainty is such that 
a^.J£)< 0.0341, 

the eigenvalues of A(, + E stay inside the circular region of 
the complex plane with the center at -4.0 and the radius 
r-1.0. 


Now consider the generalized Lyapunov equation of (7) 
corresponding to region of degree 2, Assuming that the 
eigenvalues of the nominal system matrix u are oca e 
inside the given region Dj, we now want to drive oun s on 
the perturbation parameters q, such that the roots of the 
perturbed matrix A„ + E{q) also lie inside the region 2 - 


5. Conclusions 

ITiis paper presented a unified theory for matrix root 
clustering for linear state space models (cither in continuous 
time or in discrete time domain) subject to real parameter 
uncertainty. The method explicitly relates the root clustering 
region perameters to the parameter perturbation ranges for a 
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cla&s of root clustering regions described by algebraic 
expressions. Since the method uses the Generalized 
Lyapunov Theory for getting the bounds the problem of 
conservatism is still present. Again the idea of improving the 
bounds by state transformation can be employed in this 
context also as was done for the case of robust stabilization 
in Yedavalli and Liang (1986). Efforts are underway to 
obtain non<conservative bounds for robust root clustering 
using Kronecker matrices and will be reported later. 
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Appendix A: Proof of Theorem 3 
Let 




~2u 


( 12 ) 


where 


f ” ^nvn(Q)l ^ ^ ^ 

and P satisfies 

ewP + ‘•ni(P^u + + c„A„PAl -t r„,(/'/l,V + Af^P) 

--Q 


Let 




denote the symmetric pan of matrix (.). Then (12) can be 
written as 


-d + {d- + 4ac)''- 

X <~ -- — ~ , 

2a 




ax^ dx - c < 0, 


+ 2a„,JA,)o^JF) + )1 < 


a^j2c„,(EPl+c„(HEPA'\ + EPE^) 

+ c,„(2P{A,El + 2{EA„),P + 2P(E.\}\^ o^JQ) 

- Q 4 [2r,„(£/*), + c,,(2(EPA(,), + EPE') 

+ c,„(2P(/1„£), + 2(EA„\P + 2P(E\)] < 0 

(' 4(1 £)^1 

^y4(,+ £ has eigenvalues inside the region 
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Necessary and Sufficient Conditions for Robust 
Stability of Discrete Systems with Coefficients 
Depending Continuously on Two Interval 

Parameters* 
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Abstract —Let a real polynomial in a complex variable, 
whose coefficients are any given continuous functions of two 
real interval parameters, be given. 

Necessary and sufficient conditions are derived for the 
polynomial to have all its zero.s outside (or inside) the umi 
circle of the complex variable plane. 

In a multi-polynomial dependence of the cocflicients on 
parameters, the conditions involve, in addition to a finite 
number of algebraic steps, only resolving single variable fixed 
coefficients real polynomials. 


1. Introduction 

In ANAI-YSING and DESitiNiNC. realistic practical engineering 
systems, it is no longer sufficient to assume a "fixed" 
mathematical model of the system. Uncertainties about the 
model have to be taken into account. Indeed, ever since the 
publication of Kharitonov (1979) there has been an 
increasing interest in the subject. One approach to consider 
the uncertainties is through the coefficients of the 
characteristic polynomial ot the system. 

With regard to continuous-time systems, stability requires 
that the zeros of the characteristic polynomial be confined to 
the open left half complex plane. It was shown in Kharitonov 
(1979) that for the special case where the coefficients of the 
characteristic polynomial are independent "interval" paid 
meters (i.c. they may take on any value within a given 
continuous interval) stability of a certain set o( four 
polynomials is both necessary and .sufficient to ensure 
stability of the entire family of polynomials. However, even 
for the simplest functional dependencies o( coefficients on 
interval parameters, the situation becomes much more 
complicated. A major step towards the solution of the linear 
dependency case has been made in Bartlett et ai (19H7). 
Three different solutions to this case ^ere given in Fu and 
Barmish (1987), Barmish (1988) and Zehcb (19K9). The 
multilinear dependency case has been assessed m Kraus ct at. 
(1991a) and Barmish and Shi (1988). Nece.ssary and sufficient 
conditions for the general case, where the coefficients ot the 
characteristic polynomial arc any continuous functions of the 
interval parameters (e.g. multif)olynomial dependency) were 
given in Zeheb (1990). 

With regard to discrete-time systems, stability requires that 
the zeros of the characteristic polynomial be confined to the 
exterior (or interior, depending on the definition) of the unit 
disk. Kharlinov’s result docs not apply for this case (see c.g 
Bose and Zehcb (1986)) even for the weaker theorem which 
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requires stability of 2" polynomials, where n is the number of 
coefficients (parameters) instead of tour polynomials. 
However, some of the results for the linear dependency case 
in the methods of Barmish (1988) and Zeheb (1989) pertain 
to the unit disk as well. A review on robustness of discrete 
system.s, along with a large list of relevant references, is 
given in .lury (1989) 

Some other approache.s to the problem and contributions 
towards its .solution are given in 7'sypkirt et al. (1991), 
Anagnosl et al. (1989), Kraus et ai (1991b), Qiu and 
Davison (1989), Chapellat and Bhatlacharyya (1989), 
Kokame and Mori (1991), Anderson et ai (1987), Bartlcll 
and llollot (1988), Bose (1989), Ackermann and Barmish 
(1988), Bose et ai (1986) and others 

In fact, the .solution to the general case, where the 
coefficients of the characteristic polynomial are any 
contiii.ious functions of the parameters, can be treated by the 
^-decomposition graphical method (Neimark, 1947) pre¬ 
sented in English in Lanzkron and Higgins (1959). Ihis 
method employs mapping of the imaginary axis of the 
complex variable plane onto the plane of two variable 
parameters. Extensions make it useful for analy.sis of systems 
with more than two, but small number, of parameters. 'Phis 
method permits the designer to determine the number of the 
left half plane roots of the characteristic equation in various 
areas of the parameter plane, but the designer is unable to 
obtain information about, or have control over, the root 
locations. Another method Siljak (1966) which i.s graphical 
too, IS also applicable to the above general case, by 
correlating the .system parameters and the roots of the 
characteristic equation. 

An analytic .solution to the general case, where the 
coefficients of the characteristic polynomial are any 
continuous functions of the parameters, which is applicable 
to a general domain of root clustering, is derivable from 
Hertz et al. (1987), which is ba.sed on the zero sets concepts 
(Zaheb and Walach (1981)). The objective of the present 
paper is to explicitly formulate independent necessary and 
sufficient conditions for such a polynomial, whose coefficients 
are any given continuous functions of real interval 
parameters, to have all its zeros outside (or inside) the unit 
circle, and to provide direct and independent prtwfs to these 
conditions. For the sake of simplicity and explicitness, only 
two independent interval parameters are allowed. A 
preliminary version of the condition.s (without proofs) were 
presented in Zeheb (1991). The computational complexity m 
testing these conditions for the multi-polynomial dependency 
case involves, in addition to a finite number of algebraic 
steps, only the resolution of single variable fixed coefficients 
real polynomials. However, it is not claimed that for a large 
number of parameters the computational complexity is 
manageable. 

Since interval, or even polytopic, stale space matrice.s of 
linear .systems lead to characteristic polynomials with 
coefficients depending pnilynomially on the parameters, the 
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results in the present paper also provide a solution to (wo of 
the research problems suggested in Jury (1991) namely: 

(a) Obtaining necessary and sufficient conditions for 
robust stability of the classes of interval and polytopc 
types of matrices for the discrete case. 

(b) Study of robust Schur stability of a class of 
polynomials with coefficients depending multilincarly 
on perturbations. 

2. Necessary and sufficient conditions for robust Schur 
property in the continuous dependency case 

Let 


Q(.^-Pi-Pi) = ^o,2‘. ( 1 ) 

where 

' = 0,1,- n. fl, eR, (2) 

and / (i = 0, . . . , n) are continuous functions. 

Theorem. 

C(2.Pi.P2)’‘0in |z|s 1 forp.s^jSp,, i = l,2, (3) 

where p,- <p, arc given real numbers, if, and only if. 

(1) 0(^.Pi"*.P 2 )"^0 in | 2 |sl where p*/’’ is any arbitrary 
value in the interval p*"* e (p,, p,]. 

(2) Q(2.Pi,p2)*0 and Cl^.'Pi. P 2 )’'^0 in |z| 1, p,£ 

Pi^Pv 

(3) 0(z. Pi.P2)’‘’0 and Q(z,p,,P2)*0 in |z| = l, p.- 

PjSpj. 

(4) The following set of two real equations in two real 
unknowns 

0(1.P|.P2) = O, 

3g(liPl.P 2) ^ 

3Pl 

has no solution for which 

p^^p.^p^, i-1,2. 

(5) The following set of three real equations in three 
unknowns 


QU.Px. Pii^'Z = 0 , 

I «0 

'.Pi,p2)=i«,i"-' = 0, 





has no solution for which 

l2| = l, p,^p,^p„ » = L2. 

Remarks. 

(1) Consider Condition 2. Note that for |z|= 1, z can be 

expressed as a complex function of a real parameter, e g. 
z = 0^0^ 2n. Hence, Condition 2 is equivalent 

to two sets of equations, each has two real equations 
(c.g, ReQ“0, Im0 = () or (?(z, Pi, P 2 ) = ^ 

Q(z pi, P 2 ) ” unknowns p, and 0, 

which must not have a solution for which p,^ 
p, sp,, Os 0s2;r. A similar remark pertains to 
Conditions 3 and 5, where in the latter, the three 
unknowns become real for |z | = 1. 

(2) The computational complexity involved in the above 
conditions is the following; Conditions 2-5 require the 
determination of whether a set of 2 (or 3) real equations 
in 2 (or 3) real unknowns has, or has not, a solution in 
given intervals. In a polynomial dependency case, one 
way to do it is to use the resultant method which, by a 
finite number of algebraic steps, reduces the problem to 
that of finding the 7.eros of a single variable fixed 


coefficients real polynomial in a given real interval. In 
many cases (where such zeros do not exist) it suffices to 
apply a Sturm test without having to solve the 
polynomial equation. In other cases, it is not 
recommended, from the computational viewpoint, to use 
the resultant method, especially in the three unknowns 
condition, because of introducing pseudo-solutions. 
Other algebraic and graphical methods to solve the 
problem may be more efficient, e.g. the Groebner Bases 
method or the Neimark D-decomposition method. 
Another possibility is to use the Jury-Marden table test 
(see e.g. Barnett (1983)) and require that the array is 
regular. The entries in the array arc functions of p, (in 
Condition 2) or pj (in Condition 3) and their 
non-vanishing in the appropriate intervals can be 
checked by a Sturm test. Condition 1 can be carried out 
applying a Schur test (by one of the well-known criteria), 
which again amounts to a finite number of algebraic 
steps. 

(3) If ‘stability” is defined by 

0 (z,p,,p 2 )#()in |z|a 1 , (4) 

rather than by (3), the theorem is obviously applicable by 
changing variables z—> 1/z. 

Proof of the Theorem. Necessity is obvious. We shall prove 

sufficiency. Consider the zero set V defined by 

(z.3p',’6[p,,Pil, p" 6 [p 2 ,p,l 

such thal (J(z, p", p") = 0}. (5) 

and denote the boundary of by dV. 

Obviously, (3) is equivalent to the requirement 

(V)n(|2|<i)-o. (6) 

Now, if 

(5V)n(|z|<l) = 0, (7) 


then cither (6) is satisfied or 

(|z|<l)crV. (8) 

Therefore, to satisfy (6), it suffices to satisfy (7) in addition 
to requiring (z"- 1)^1^, i.e. 

G(l.Pi.P2)='^0. p|Sp,sp,. Pz^spjSpj, (9) 

which contradicts (8). 

Applying the “positivity theorem” in Walach and Zeheb 
(1980) to 0(1, Pi, P 2 ) and to minus Q(l, p,, pj) one obtains 
that (9) is satisfied iff Condition 4 is .satisfied in addition to 


C?(l. 0(1, Pi. forp, :sp|Sp,, 

(9.1) 

and 


0(1,Pi,P2)’‘O and Q{i, Pi, Pz)*0(ox P2^P2^P2 

(9.2) 

Turn now to (7). If 

OV)n(|z| = l) = o, (10) 

then either (7) is satisfied or 

(3V')cr(|z|s1). (11) 

By Theorem 1 in 2^heb and Walach (1981), a point z" e 3V 
is either a point where one of the following four polynomials 
vanishes: 

QU.Px.Pz). 0(z.Pi.p2). 

Q(^.P\.Pl). 0(Z,P|,P2), 
or else, z" is a solution of the set of equations 
G(‘.Pi.P2) = 0. 

n (13) 

dpt dp2 

for which p^ pf, i — 2. 
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,1.., (L) n (1,1.,). 0 .„d «„, "r.t ;;,i5 ;,Tr4 

not belong to |z| ^ 1 , (7) is established. For (L) n (M n = 
0 Conditions 2-3 ensure that the four polynomials (12) do 

rtat for |z| = l, the complex conjugate of z equals 1 /z 
Condition 5 ensure.s that (13) has no solution for Izl = 1 
Finally, Condition 1 ensures that at least one point ol L 
dojcs nol belong to | 2 | 1 . 

Now since (9. 1 ) and (9,2) are included in Conditions 2 and 
3, Conditions I- S arc .sufflcieni to establish (3). 


Remark. It beotnes clear from the proof that the above 
necessary and sufficient conditions can be stated in different 
modified formulations. For example, in Condition 4 we 
may use the derivative aO(l, p,. p,)/dp^^o instead of 
the derivative with regard to p, or, in Condition I wo may 
use Q(z.pY\p^)*Q in |z| = l. Another po,ssibility is to 
use (z = 0) # K in (9) This would readily lead to the follow- 
ing conditions replacing Condition 4 : 


P 2 ) ^Oand ^ 0 for/^, 'jsp,. ( 14 ) 

P 2 ) and for p^^^p^ (15) 

and the set of two real equations m two real unknowns 
®(i(P I i P 2 ) ~ (li 

^Q nCPi.p?) ^ (lb) 

dp, 

has no solution for which 


P.^P,-Pn 

However, wc will not elaborate on all, quite obvious, 
possibilities. 


3. A numerical example—polytopir state matrix 
Let a discrete lime system be given by 

x(n + 1 ) -/4ji(n). (17) 

where Jt(-) is the state vector and A. the system’s matnx, has 
polyiopic uncertainties, i.e. 




A -- 

^A(p)- 

/If. 

4 A 

if^i 




'l 

0 r 


'1 

0 

0 " 

1 

0 r 

.4,,= 

2 

1 0 

, .4,= 

1 

2 

0 

1 

1 0 


_0 

3 1_ 


J) 

0 

1 _ 

_0 

1 1 


0 :S-ts j, / “ 1,2 (IH) 

Ilic characteristic polynomial becomes 

= z’ — z^(3 + 4p| -H 3pn) 

+ z(3 + Hp, + 6^2 + 5pT t- 3pl + 8 p, P 2 ) 

- (7-l-7pi -I- 14p2 + 5pj + 9p;+ IZpiPj 
+ 2p^^ + 2pi + 5p^pl + 5p]pi). (19) 

Note that Qiz.Pi.Pz) a t*’*'^'* degree polynomial with 
coefficients depiending polynomially (third degree) on two 
interval parameters. 

According to our theorem, 

Q(z, Pi, P 2 ) #= 0 in | 2 | 1 for 0 ^ 1. r = 1, 2, 

if, and only if. 

(?( 2 , pf^ = 0 , 1 ) = 2 ’ - 6 z^ + 12 z - 32^0 in jzl ^ 1 , 

( 20 ) 

( 20 ) is obviously satished since 

32> 1 4 6 -h 12 ^ 19. 

In a more general case, where the above sufheient 
condition is not relevant, a Schur te^st involving a finite 


number of algebraic steps may be earned out to 
determine the validity of this condition. 

( 2 ) 

G»(2,Pi, I)= z“'-z^(6 + 4pi) + z(12+ 16pi + 5pt) 
-(32 + 24p,4 \i}p]-ylp])i^0, 
forlzj^l. ( 21 ) 

It can be checked almost by inspection that (21) is 
satisfied. For |z| = 1 it is sufficient that the coeflicieni of 
z^\ in absolute value, be greater than the sum of absolute 
values of the rest of the coefficients, i.e. 

32 4 24p, 4 lUp; + 2pJ > 1 4 6 4 4p, 4 12 4 I 6 p, 4 5pp 

or 

2pi + 5pf 4 4p, 4 13 > 0, 

which is always satisfied for 1. As indicated in 

Remark 2 above, a systematic algorithm can always be 
earned oui m cases where the above sufficient condition 
IS not satisfied 

Turn now to the second part of Condition (2). 

P(z,pi, 0) = z'- z’(3 + 4pi)-t-2(3-t-Bp, +5pl) 

- (7 + ^p^ -t- 5p-; -r 2p]) 0 , 

for|z| = l, 0!^p,s:l, (22) 

UsSing the Jury-Marden tabic form (sec e.g. Barnett 
(1983^), it Clin be verified that (22) is also satisfied. 

(') 

Q{z, 0, P 2 ) = - z’(3 4 3 P 2 ) 4 z(3 4 6 p 2 ^ ^P?) 

- (7 4 I4p2 4 9p; 4 2p2) ^ 0. (23) 


lor |z| - 1, 0'Sp, 1. 

As in the first part of Condition 2. the sufficient 
condition 

7 4 14p2 4 9p2 4 2p] > 1 4 3 4 3p2 4 3 4 hp2 V?- 

IS obviously satisfied for S: 1. For p. “ 0, we obtain 

from (24) 

z^ - 3z'’ 4 32 - 7 0 tor |z| ^ 1, 

which is readily checked to be satisfied 
The second part of Condition 3 is 

Q(Zr 1, P 2 ) = ~ 2'^(7 4 3 P 2 ) 4 zdfi 4 14p2 4 3p2) 

- (21 4 31p2 4 \4pl 4 2p\) ^ 0, (24) 

tor |zl — 1, 0 ^ p . 1 

Using again the Jury-Marden table form reveals that 
the coefficients are either all positive or all negative and 
thus (24) IS also satisfied 

(4) 

QO.PuPi) = -2p; - 2pl - SpiFz- -Vipi - 

-4p,p3-3p, - llp,-6- 0, (2.‘).1) 

^C?(l > FI ► F 2 ) A ^ 7 c 2 in A in 

= -6p, - 5p; - 10p,p2 - 4p2 - 3 - 0. 

dp, 

(25.2) 

Obviously, none of these equations have a solution for 
which 0 :^p, ^ 1, I = 1, 2 (and so docs the set of 
equations) since all coefficients are negative. 

(5) After some cumbersome but straightforward algebraic 
steps, the third equation in Condition 5 reaches the form 

z\Az^ 4 Bz^ - flz - A] = 0, (26) 


where 

A - 2p] 4 9p2 4 10p,p2 4 18p, 4 36p2 + 35, 
fl = -[ 2 p; -t- 18pJ + 40p,p54 24p?p2 
-i-44p^+ 90p^-l- 140p,p2 
-I- 122p, + 142pj + 701. 
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The solutions of (26) are z = 0, 2 = ±1 and 

+ (27) 

the solutions z-0 and z - ±1 are readily seen to be 
inconsistent with the other two equations in Condition 5. 
Now, combining (27) with the other two equations in 
Condition 5, and since the coefficients of all three equations 
arc real, the only possibilities of a common solution arc: 

(1) The left hand side of (27) is a factor of each of the two 
other equations. 

(2) The common solution is real; i.e. for |zl - 1, it must be 
z = 1 or z = -1. 

Both possibilities are readily checked to be inconsistent 
with 0^p,^ I, i = 2. 

Hence, Condition 5 is satisfied, as well as the other four 
conditions of the Theorem. Wc conclude that the system (17) 
with polytopic state matrix uncertainties (IK) is robustly 
stable. 

4. Conclusions 

Ncces.sary and sufficient condition.s are given, which solve 
the problem of testing for robust stability of a discrete system 
for the general case of any continuous dependency of the 
coefficients of the characteristic polynomial (or those of the 
state matrix) on two interval parameters. 

The price for this generality is in computation complexity. 
However, at least for polynomial dependency (which 
includes, as special cases, the polytopic case and the 
multi-linear case) the number of required computational 
steps is finite, in reducing the problem into that of finding the 
roots of polynomials with fixed coefficients. 

In many cases the computations may be simplified. It will 
be useful to investigate special cases where the computa¬ 
tional complexity may be reduced. 
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Editorial Changes 


A YEAR AGO, in the March 1992 issue, several editorial staff 
changes were described. Now there is another major change: 
Dr H. Austin Spang Ill, who has been evaluating papers in 
all fields of Control Applications (including computer 
control, software, computer aided control system design, as 
well as implementation and operational evaluation) for over 
24 years, ever since Automa/ica became the IFAC Journal, 
wishes to retire as an editor of Automatica at the end of 
1993. Through these many years he has provided a valuable 
contribution to Automatica and IFAC by selecting 
well-written papers describing the application of control 
theory to real world problems, the ultimate goal of automatic 
control. His service and experience will be missed. Unwcver, 
we are fortunate in that he has found a very well qualified 
editor, who has been working with him as an Associate 
Editor for a number of years, to succeed him. This new 
editor is: 

Professor Yaman Arkun 
(jcorgia Institute of Technology 
School of Chemical Engineering 
778 Atlantic Drive 
Atlanta, GA 30332-0UK) 

U.S.A. 

Therefore, effecti\w immediately, papers in the areas of 
interest indicated above must be sent directly to Profes.sor 
Arkun for evaluation and possible publication in Automatica. 
No papers should be sent to Di Spang—except revised 
papcTs which were previously evaluated by him. Note that he 
will continue to evaluate only revised papers through^1993. 

Meanwhile, the other Automatica Editors, K. J. Astroni, 
P M. Larsen, W. S. Levine, A P. Sage, T. Ba^tr, C. C. 
Hang, T. Soderstrom, and Deputy Editor-m-('hicf H 
Kwakernaak will continue their efforts to obtain and to 
evaluate significant contributions in their respective areas of 
interest. Again, all of them, including retiring Editors P. C. 
Parks and H. Austin Spang HI, are to be congratulated on 
the excellent work they have done and the results they have 
achieved. 

To make the editorial efforts of the editors mor*' efficient 
and rapid, authors submitting papers for possible publication 
in Automatica should do the following: (a) prepare their 
papers in the format described on the inside back cover of 
Automatica, (b) designate only one author to correspond 


with the Editor to whom five (or preferably six, for more 
rapid evaluation) copies of their paper are sent for 
evaluation, (c) send the Edilor-in-Chief a copy of the paper 
and a copy of the Idler sent to the Editor, and (d) send the 
Fax and email numliers (and the telephone number) of the 
corresponding author to the Editor and the Editoi-vn-Chicf 
for rapid communication in case questions about the paper 
arise. Items (b), (c), and (d) arc particularly important to 
avoid confusion and delay m processing submitted papers 
and tracking the progress of the reviews. 

As a reminder, the designated, corresponding author ot a 
paper having multiple authors should send five copies 
(preferably six) to the Editor whose area of mlcrcsl is most 
closely related to the subject ot the papci. it should be 
emphasized that to be considered for publication in 
Automatica, the paper must be sent to the Editor iv/ief/irr or 
not It is alsff submitted to an IFAC meeting. Also one copy 
.should be sent to the Edilor-in-('hief to be recorded in a 
central, global, file which is used for mnmioring the progress 
of the evaluation procedures. 

As the editors change .so does the list of Associate Editors 
The Automatica Editorial Board, including the Editor-in- 
Chief in particular, are deeply grateful for the outstanding 
and essential service which they and their referees provide. 

Another significant change must be noted: Automatica 
Deputy Editor-in-Chief, Huiberi Kwakernaak, has been 
designated as the Liai.son Officer with IFAL Meeting 
Organizers to assist them in selecting appropriate meeting 
papers for further review and possible publication in 
Automatica. In connection with this he is also serving as an 
Advisory Editor to help Guest Fdiiors ol Special Issues 
establish publication schedules and guidelines and to assist 
them in preparing the issues foi publication accouling to 
Automatica standards 

With these continuing changes and additions, we expect 
that Automatica will be belter prepared to meet iLitiiic 
developments m the field of automatic control and thereby 
provide increased service to IFAC and to the international 
control community. 

Cieorge S Axelby 
E.ditor-in-Chief 
A utomatica 
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Tutorial Paper 


Robust Control and ^co-Optimization—Tutorial 

Paper* 

HUIBERT KWAKERNAAKt 

Robust control systems may successfully he designed by '^^-optimization, 
in particular, by reformulating the design problem as a mixed sensitivity 

problem. 

Key Words--rf¥;-opiifnal control; robust control 


Abstract--! he paper presents a tutorial exposition of 
^1-optimal regulation theory, emphasizing the relevance of 
the mixed sensitivity problem for robust control system 
design 

1 INTRODUCTION 

Tml iNVLSTKJATiuN t)i -Optimization of control 
systems began in 1979 with a conference paper by 
Zames (1979), who considered the minimization of 
the <^»-norm of the sensitivity function of a 
single-input-single-output linear feedback system. 
The work dealt with some of the basic questions of 
“classicai” control theory, and immediately caught a 
great deal of attention. It was soon extended to more 
general problems, in particular when it was recognized 
that the approach allows dealing with robustness far 
more diiectly than other optimization methods. 

The name ‘^.-optimization” is somewhat un¬ 
fortunate. is one member of the family of spaces 
introduced by the mathematician Hardy. It is the 
space of functions on the complex plane that arc 
analytic and bounded in the right-half plane. The 
space plays an important role in the deeper 
mathematics needed to solve 7i‘,-optimal control 
problems. 

This paper presents a tutorial exposition of the 
subject. The emphasis is oii explaining the relevance 
of ^^-optimization for contio', engineering. The paper 
presents few new results, and does not at all do justice 
to the extensive theoretical and mathematical 
literature on the subject. The presentation is limited 
to single-input-single-output (SISO) control systems. 
Many of the arguments carry over to the multi-input- 

*■ Received 6 February 1992; revised 6 July 1992; received 
in final form 23 August 1992. The original version of this 
paper was presented at the IFAC Symposium on Design 
Methods of Control Systems which was held in Zurich, 
Switzerland during September 1991. The Published Proceed¬ 
ings of this IFAC Meeting may be ordered from. Pergamon 
Press Ltd, Headington Hill Hall, Oxford, 0X3 OBW, U.K. 
This paper was recommended for publication in revised form 
by Editor K, J. Astrom. 

t Systems and Control Group, Department of Applied 
Mathematics, University of Twente, P.O. Box 217, 75tM) AE 
Enschede, The Netherlands. 


multi-output case but their implementation is 
necessarily luorc complex. 

We preview some of the contents. In Section 2 we 
use Zames’ original minimum-sensitivity problem to 
introduce -optimization. Section 3 is devoted to a 
discussion of stability robustness. A well-known 
stability robustness criterion first propo.scd by Doyle 
(1979) demonstrates the relevance of the t^-norm for 
robustness. Doyle’s criterion in its original form has 
severe shortcomings, owing to the oversimplified 
representation of plant perturbations. In Section 4 it is 
explained how the criterion quite easily can he 
extended to a much more powerful result that applies 
to a very general perturbation model. In Section 5 this 
result IS used for a perturbation model that lor lack of 
a better name we refer to as numerator-denominator 
perturbations. It leads to the ‘mixed sensitivity” 
stability robustness test. It is only slightly more 
complicated than Doyle’s original test, and far less 
conservative for low-frequency perturbations. 

Minimization of the mixed sensitivity criterion 
results in “optimal” robustness. In Section 6 the 
re.sulting mixed sensitivity problem is discussed in 
some detail. It is shown that it can be used not only 
for robustness optimization or robustness improve¬ 
ment, but al.so for design for performance. The design 
method based on the mixed sensitivity criterion 
features frequency response shaping, type k control 
and specified high-frequency roll-off, and direct 
control over the closed-loop bandwidth and time 
response by means of dominant pole placement. 

To illustrate these features two design examples are 
included. In Section 7 a textbook example is discussed 
that IS simple enough to be completely transparent. In 
Section 8 the application of the mixed sensitivity 
method to a benchmark example involving ship course 
control is described. 

Sections 9-11 briefly review the theory needed to 
solve ^^-optimal regulation problems. In Section 9 it 
is shown that the mixed sensitivity problem is a special 
case of the so-called “standard” ^’.-optimal regulation 
problem. In Section 10 the frequency domain solution 
of the standard problem is outlined, while Section 11 
describes the main features of the state space solution. 
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We do not go into much detail, because the theory is 
not easy, and the development of algorithms that can 
be used unthinkingly for applications is best left to 
specialists. 

2 SENSmviTY 

^^-optimization of control systems deals with the 
minimization of the peak value of certain closed-loop 
frequency response functions. To clarify this, consider 
by way of example the basic SISO feedback system of 
Fig. 1. The plant has transfer function F and the 
compensator has transfer function C. The signal v 
represents a disturbance acting on the system and z is 
the control system output. Then from the signal 
balance equation z ~ v — PCz, with the circumflex 
denoting the Laplace transform, it follows that 
z = Sv, where 


is the sensitivity function of the feedback system. As 
the name implies, the sensitivity function characterizes 
the sensitivity of the control system output to 
disturbances. Ideally, S = 0. 

The problem originally considered by Zames (1979, 
1981) is that of finding a compensator C that makes 
the closed-loop system stable and minimizes the peak 
value of the sSensitivity function. This peak value (sec 
Fig. 2) is defined as 

||5|U = max|5(;a))|. (2) 

where IR denotes the set of real numbers. Because for 
some functions the peak value may not be assumed 
for any finite frequency, we replace the maximum 


V 



Fui. I SISO feedback loop 


sensitivity \S\ 



Fig. 2. as peak value. 



Fig. 3. Equivalent representation of the system of Fig. 1. 

here and in the following by the supremum or least 
upper bound, so that 

||5|U=sup|5(7a>)|. (3) 

III r Pi 

The justification of this problem is that if the peak 
value 1(51|«, of the sensitivity function S is small, then 
the magnitude of S necessarily is small for all 
frequencies, so that disturbances are uniformly 
attenuated over all frequencies. Minimization of ||5|L 
is worst-case optimization, because it amounts to 
minimizing the effect on the output of the worst 
disturbance (namely, a harmonic disturbance at the 
frequency where 15| has its peak value). 

The worst-case model has an important mathemati¬ 
cal interpretation. Suppose that the disturbance v has 
unknown frequency content, but finite energy ||i^||i 
The number __ 

\\vh= (4) 

is known as the 2-norm of the disturbance v. The 
energy of n is the square of the 2-norm. Then the 
norm ||5|| of the system S as in Fig. 3 with input v and 
output z induced by the 2-norm is defined as 

\\S\\= sup (5) 

II,.II, ■ Ill'll, 

Hence, in engineering terms the norm is directly 

related to the energy gain for the input with the worst 
possible frequency distribution. Using Parsevafs 
theorem, it is not difficult to recognize that 

P'I| = I|5|U. (6) 

Hence, the peak value is precisely the norm of the 
system induced by the 2-norms on the input and 
output signals. This norm is known as the ^-norm of 
the system. 

It follows that 'Jf^-opiimizdUoT] is concerned with 
the minimization of system norms. It is useful to be 
aware of this when studying theoretical papers on 
^..-optimization. 

Worst-case optimization suggests a game theory 
paradigm: The designer wishes to determine the 

compensator C that offers the best protection against 
the worst disturbance that nature has in store. This 
explains why in many theoretical papers 
oplimization is treated from the point of view of 
differential game theory. 

A little contemplation reveals that minimization of 
||5||.» as it stands is not a useful design tool. The 
frequency response function of every physical plant 
and compensator decreases at high frequencies. This 
means that often the sensitivity S can be made small at 
low frequencies but eventually reaches the asymptotic 
value one for high frequencies. Just how small S is at 
low frequencies is not reflected in the peak value but 
is of paramount importance for the control system 
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pertormance For this reason, it is customary to 
introduce a frequency dependent weighting function 
W and consider the mmimizdlion of 

\\WS\\^ = sup \W(jwS(jw)\ (7) 

C'hnractenstically, VV^ is large at low frequencies but 
decreases at high fiequencies 

I he weighted sensitivity minimization problem thus 
dehned has interesting aspects Unfortunately, it does 
not account adequately for basic bandwidth limita¬ 
tions owing to restricted plant capacity, caused by the 
inability of the plant to absorb inputs that are too 
large Before going into this deeper we consider the 
question of robustness 


with S^i the nominal sensitivity function Then it 
follows from (9) that if 


\L(jw) ~ U(jio)\ 

\UM\ 


\UjiO)\< 1 , 


for all a> e IR p 


( 11 ) 


the perturbed closed-loop system is stable 

The factor \L(ja)) - L^^(jio)\/\Ln(jo>)\ in this ex¬ 
pression is the relative size of the perturbation of the 
loop gam L from its nominal value L,, Suppose that 
this relative perturbation as a function of frequency is 
known to be bounded by 


\ L(jio)- LyijW)\ 
\U(}io)\ 


|Wf/o))|. for dll loeU, (12) 


3 ROBUSINFSS 

We illustrate the connection between peak value 
minimi/dtion and design for robustness by considering 
in Fig 4 the Nyquist plot ol the loop gain [ = PC of 
the SISC) feedback system ot F-ig I In partnular, we 
study whether the leedback system remains stable 
under a perturbation of the loop gain from its nominal 
value /.„ to the actual value L 

For simplicity wc take the system to he open loop 
stable (that is L represents a stable system) 
Naturally, we also assume that the nominal closed- 
loop system IS well designed so that it is stable Then 
by the Nyquist stability criterion the Nyquist plot of 
the nominal loop gam /„ docs not Lncirclc the point 
1 1 he actual closed-loop syslem is stable if also ihe 

loop gain I docs not enciiclc the fioint I 

ft is easy to see by inspection of Fig 4 that the 
Nyquist plot I clelinitcly dots not Lnciiclc the point 
I if for cvciy frequency m the distance \L{ju)) 

I i,(j(if)\ between any point / (yto) on the plot ot / and 
the coiicsponding point on the plot ol /.„ is 

less than the distance 1| between th^ point 

LJj(u) and tliL point 1 <hat is il 

\lAj(f>) - I yy{jio)\' |/,)(/f/;) 4- Ij for all (/; c H (H) 


I his IS equivalent to 

_l^ 1 , tor all to e ir.< 

|L„(/m)l \Lu{jv>) 4 1| 




Define the complcmenl.iry isitivity function 7„ of 
the nominal closed-loop system as 


i Lyy 


(H)) 


Im 



with W a given frequency dependent function Then 
\L(j(o)- Luijio)\ 


„ \Ujoj)\ 
\L(jio)\ 


|VV(y(0)| 


Hence, if 


\Wij(o)Uj(o)\ 
\Wij(omjio)\ (H) 


IW(yfu)/i’tfyo))! ‘ I for all (rj e [R 


(14) 


bv (11) the closed-loop system is stable for all 
perturbations bounded by (J2) Indeed, it may be 
shown that the condition (14) is not only sufficient but 
also necessary for the closed-loop system to be stable 
for all perturbations bounded by (12) 

We obtained the condition (14) under the 
u sumption that the open-loop systcMii is stable It may 
be proved that it also holds for open-loop unstable 
systems, as long as the nominal and the perturbed 
open-loop system have the same number of right-half 
plane poles \hc result may also be extended to 
multivariable systems (Doyle 1979) 

Using the norm notation introduced in ('\) the 
condition (14) for robust stability may be rcwnllen as 


l|W7;,iu-' i (IS) 

lliis explicitly demonstrates the relevance of the 
jt-norm, that is, the peak value for robustness 

characterization The peak value criterion arises from 
the Nyquist stability criterion, which forbids the 
Nyquist plot ol Ihe loop gain to cross the point 1 
For stability robustness the feedback system need 
be designed such that is less than one It is 

tempting lo consider the problem of minimizing the 
norm 1|W/„|| with respect to all compensators that 
stabilize the closed-loop system as a way of optimizing 
robustness Stabilit> seldom is the sole design target, 
though and robustness optimization may easily lead 
to useless results If the system is open-loop stable, for 
instance and all perturbations that may arise leave it 
stable ||W7„||. niay be made equal to zero, and, 
hence, minimal, hy simply letting C = 0, so that also 
- U and fli — d This optimizes s^lblllty robustness, 
but does nothing to improve control system 
performance, such as its sensitivity and response 
properties In Section S we introduce an alternative 
stability robustness criterion that allows consideration 
ot the response properties as well 
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It is important to note that although it looks very 
plausible to characterize the plant perturbations by 
the bound (12), the bound may in fact allow far more 
perturbations than may actually occur. By way of 
example, suppose that the perturbations are caused by 
variation of a single parameter. Tlien if these 
variations have an important effect on the loop gain it 
may be necessary to choose W quite large to satisfy 
the bound. This large bound also allows many other 
perturbations, however, including such that change 
the order of the plant. Because of this the robustness 
stability test (J4) may easily fail even if the actual 
parameter perturbations do not destabilize the 
feedback system. 

This phenomenon, usually referred to as conserva¬ 
tiveness, seriously handicaps the applicability of the 
robustness stability analysis described in this section. 
The model is mainly suited to deal with high- 
frequency uncertainty caused by parasitic effects and 
unmodelled dynamics that cause the envelope defined 
by (12) to be densely filled. 

4 A GENERAL PERTURBATION MODEL 
In the preceding section we considered perturba¬ 
tions of the form L(,—►L, where \L{j(iy) - Ln{j(o)\l 
\LQ{jo})\S:\W{j(D)\ for all ajelR. Equivalently, we 
may write 

Ln^Lo(l W), (16) 

where is any frequency dependent function such 
that \bi[ja))\ < 1 for all cu e IR, that is, such that 

(J7) 



Fig. 6. General perturbation model. 

the inequality ||//d|| ^ ||/f || • \\d\\ this is guaranteed if 
||/f|| • ||6|| < 1. Taking in particular the oo-norm it 
follows that the perturbed system is stable for all 
perturbations 6 whose oo-norm is, at most 1 if 

II^IU-1. (IK) 

The perturbation model of Fig, 6 with the 
corresponding condition (18) for stability with respect 
to all perturbations such that is simple yet 

very general. The condition (18) is not only sufficient 
but also necessary. 

To illustrate its application we specialize the result 
to the configuration of Fig. 5. H is the transfer 
function from p to ^ after opening the loop by 
removing the block ‘ 6”. Inspection of Fig. 5 shows 
that 2 -p - so that 


By further inspection we have 

q=- WLJ = - W -4^ p = - Wr,p. (20) 

1 E() 


This jjerturbation may be represented as in the block 
diagram of Fig. 5. Because at this point we are only 
interested in stability the disturbance has been 
omitted. The functions W and are represented as 
frequency response functions of stable systems. 

Figure 5 is a special case of the configuration of Fig. 
6, where H represents the dashed block in Fig. 5. In 
the block diagram of Fig. 6 the perturbation d is 
isolated from the rest of the system H. 

By the small gain theorem (see for instance Desoer 
and Vidyasagar, 197!i) a sufficient condition for the 
closed-loop sy.stem of Fig. t to be stable is that the 
norm \\Hd\\ of the loop map Hd be le.ss than 1. By 



H 

Fig. 5. Feeback loop with perturbation. 


.so that the transfer function H is given by 

H = ( 21 ) 

Hence, by (18) stability under perturbation is 
guaranteed if 

||W7;,|U<1. (22) 

This is precisely the stability robustness criterion of 
Section 3. 

The general model of Fig. 6 with the necessary and 
sufficient condition ||//||.<1 for robust stability with 
respect to all perturbations such that ||6|1, ^ 1 applies 
to SISO as well as MIMO systems. It was conceived 
by Doyle (1984). To use the condition for MIMO 
systems we need discuss how the ^-norm is defined for 
such systems. Consider a stable MIMO system with 
input u, output y and transfer matrix F as in Fig. 7. 
The ao-norm of the system is the norm induced by the 

iiOriTis 



Fig. 7. MIMO system. 
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of the input and output, respectively. The superscript 
” denotes the complex conjugate transpose. Again 
using ParsevaFs theorem, it may be shown that the 
norm of the system induced by these signal norms is 

|lF|U = sup \\F()(i))\\,^ (24) 

where foi a constant complex-valued matrix A the 
notation ||A ||2 indicates the spectral norm 

||A||. = max a,(A), (25) 

with (7, the rth singular valuet. It follows from 
(24)-(25) that the oo-norm l|f'||,. of the stable system 
with transfer matrix F is found by first computing for 
each frequency the largest singulai value of the 
frequency response matrix F(jo)), and then taking the 
maximum of all these largest singular values over 
frequency. 

In the following section we discuss a further 
application of the basic stability robustness result, 

5. NUMERA rOR- Or.NOMIN \ tOR 
PERTURBATIONS 

In this section wc discuss the application of the 
general stability robustness result to what we call 
numerator-denominator perturbations, or copnmc 
factor uncertainty, as they are also known. The model 
may be traced to Vidyasagai (Vidyasagar ct ai, 1982; 
Vidyasagar, 1985) and Kwakernaak (1983, 1986). It 
relies on the representation ot the plant transfer 
function P in the block diagram of Pig. I m fractional 
form as 


P 


N 

T) 


(26) 


In particular, if Pis rational, N obviously can be taken 
as the numerator polynomial and D as the 
denominator polynomial. It is not necessary to do 
this, however, and indeed it sometimes is useful to 
arrange N and D differently. The numerator 
denominator perturbation model represents perturba¬ 
tions in the form 


On + ’ 


(27) 


where the subscripts on N and D denote the nominal 
system. Ihe terms Md,)Wi and model the 

uncertainty in the denominator and the numerator 
respectively. The frequency dependent functions A/W, 
and AfWi represent the largest possible perturbations 
of the denominator and numerator, respectively, and 
dit and 6s are frequency dependent functions of 
magnitude not greater than one. The factor M is 
included for added flexibility. Its use (foi partial pole 
placement) becomes clear in Section 6. 


t If i4 is an n X m matrix, the singular values of A are the 
min {n, m) large.st of the m nonnegative numbers V^A A), 
i = 1, 2, . . . , m. Here A, denole.s the ith eigenvalue. The 
singular v alues arc also the min(n,rrj) largest of ihe n 
numbers VA,(i4A"), / - 1, 2, . . . , 


u 


( 

r 




^2 






^0 



M 


M; 0 — 


r 


If 

1 

L 

[- 


Fig. H Numeratoi-denominator perturbation model. 


It is not difikult to check that the perturbation (27) 
may be represented as in the block diagram of Fig. 8. 
Note that the perturbation appears in a feedback 
loop. Including the plant m the control system 
configuration of Fig. 1 the block diagram may be 
arranged as in Fig. 9, where the block ""dp" is 
described b . 





(h 


q 


(28) 


The dashed lines in Fig. 9 indicate how the situation 
may be reduced to the perturbation model of Fig. 6. 
For the application of condition (18) for robust 
stability wc need to assume that the closed-loop 
sy.stcm is stable, and that moreover M, W,, and W, 
have all their poles in the open lefl-half plane. 

The block marked "H" has input p and output 
q = col (^,, ^ 2 ) To find the transfer matrix H, we first 
inspect Fig. 9 to establish the signal balance equation 
f = Du '{Mp - It follows that 


, Du 'M 


(29) 


where we define V = /!„ 'M. By further inspection we 
see that 


W,V 

w.cv 

q,= -W.(z = -^- p = -W. UuVp, 


(30) 



Fig. Numeralor-denomin'dtor perturbation model in 
feedback loop. 







260 


H. Kwakernaak 


where 




1 

l + /^oC' 




c 

1-fPoC' 


(31) 


are the nominal sensitivity function and the nominal 
input sensitivity function, respectively, of the feed¬ 
back system. The input sensitivity function V is the 
transfer function from the disturbance to the plant 
input. It is related to the complementary sensitivity 
function 7 as 7 = PU. 

From (30) we see that the transfer matrix H in Fig. 
9 is given by 

»-[ (32) 


It is easy to check that the 1 x 2 matrix A = [^4, ^42] 
has the single singular value Vl>4,|^ -I- Hence, 

the square of the oo-norm of the perturbation 
6p = [-dn is 


||<5||^. - sup (\6oU^)\^ + (33) 


The 2x1 mat rix A = lA^ A 2 V also has the single 
singular value . Hence, the square of the 

®-norm of the system with transfer matrix H as given 
by (32) is 

||//||i= sup (|H'|(yw)\(yai)V(y(y)|^ 

(i^cR 

+ |W2(ya))U.(yw)l/(yoj)|^). (34) 

It follows that the closed-loop system is stable for all 
numerator-denominator perturbations 
satisfying the bound 

I<^d(/^w)I^ 4" 1* for all weIR, (35) 

if and only if the sensitivity function and the input 
sensitivity function L/i, satisfy the inequality 

\mjw)S,ij(o)V(j(o)\^ 

-^mjio)U,(jio)V{jw)\\^<\, (36) 

for all oj e IR. 


Alternative interpretation 

It is useful to consider an alternative interpretation 
of this stability robustness result. For the numerator- 


denominator perturbation model (27) the relative 
perturbation of the denominator is given by 


D-Do 


A, 


A, 




(37) 


where w^ = VWl. Similarly, the relative perturbation 
of the numerator is 


N-M, Md^W, 

-=-= —;— O/v = W.Oj 

N,, /Vn Pu 


(38) 


where W 2 =W 2 VtPu- From (35) it follows with 
(37)-(38) that we consider perturbations satisfying 


D-D„ 


N-N„ 

D„ 


N„ 



W 2 


(39) 


on the imaginary axis. Substituting V = k’, and 
W 2 V = WjA into (36) we see that the system is 
robustly stable for such perturbations if and only if 

+ |7iW 2|" < 1 ,on the imaginary axis, (40) 

since 7,, A = 

By (39), the functions ►v, and w. are measures for 
the relative sizes of the perturbations in the 
denominator and the numerator of the plant transfer 
matrix 7, respectively. The stability robustness test 
(40) then indicates that the nominal sensitivity 
function 5,, should be small for those frequencies 
where the relative perturbations in the denominator 
are large, and that the nominal complementary 
sensitivity function 7, should be small for those 
frequencies where the relative perturbations in the 
numerator arc large. 

Conversely, this interpretation provides us with an 
indication how to model perturbations compatibly 
with performance requirements. Customary perfor¬ 
mance speciheations require the sensitivity function to 
be small at low frequencies, and to level off to one at 
high frequencies (see Fig. 10). Designing the system 


complementary 

sensitivity sensitivity |7| 




Fig. 10. Bode magnitude plots of typical sensitivity and complementary .sensitivity functions. 
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this way ensures disturbance attentuation over the 
widest possible frequency range given the plant 
capacity (that is, given the largest inputs the plant can 
absorb). 

This means that low-frequency perturbations are 
best modeled as denominator perturbations. The most 
important low-frequency perturbations arc normally 
caused by parameter uncertainty, often referred to as 
.structured uncertainty. On the other hand, high- 
frequency perturbations are best modeled as numera¬ 
tor perturbations, because (complementarily—see Fig. 
10) the complementary sensitivity function 7;, is small 
at high frequencies. High-frequency perturbations are 
usually caused bv parasitic effects and unmodeled 
dynamics, often known as unstructured uncertainty. 

The examples in Sections 7 and 8 illustrate the 
application of these ideas. 

6. THF MIXED SENSITIVITY PROBLEM 

In Section 5 it was found that stability robustness 
with respect to numerator- denominator perturbations 
such that 


16z,(7(n)|'T - 1. wtU, (4») 

IS guaranteed if ||//||, ^ I, where (omitting the 
subscripts on .V and U) 


l|//||^--sup(|H',(yo.)Y(y^i;)V(yrn)|’ 

-h\WA]io)U{jo>)V{jw)\^^) (42) 


(liven a feedback system with compensator C that 
docs not satisfy this inequality one may look for a 
different compensator (hat does achieve inequality 
An effective way of doing this is to consider the 
problem of minimizing |1H||, with respect to all 
compensators (' that stabilize the system. If the 
minimal value of ll//|l. is greater than 1, no 
compensator exists that stabilizes the systems for all 
perturbations satisfying (41). In this case, stability 
robustness is only obtained for perturbations satisfying 
(41) with the right-hand side replaced with 1/A‘. 

The problem of minimizing 


W,SV 
W, IJV 


(43) 


(Kwakernaak, 1983, 1985) is a version of what is 
known as the mixed sensitivity problem (Verma and 
Jonckheere, 1984). The name derives from the fact 
that the optimization involves both the sensitivity and 
the input sensitivity function (or, m other versions, 
the complementary sensitivity function). 

In what follows we explain that the mixed sensitivity 
problem cannot only be used to verify stability 
robustness for a class of perturbations, but also to 
achieve a number of important design targets for the 
one-degree-of-freedom feedback configuration of 

Fig. 1. 


Frequency response shaping 
The mixed sensitivity problem may be used for 
performance design by shaping the sensitivity and 
input sensitivity functions. The reason is that the 


solution of the mixed sensitivity problem has the 
property that the frequency dependent function 


\WAjw)S(jio)V(ja))f f \WAj(o)U(jio)Vijo})\\ (44) 

whose peak value is minimized, actually is a constant 
(Kwakernaak, 1985). This is known as the equalizing 
property. If we denote the constant as A*, with A 
nonnegative, it immediately follows from 


mj(o)S(jw)V{jw)\^ I- \WAjw)Uijw)V{j(o)\^ 


that for the optimal solution 


A^ 

(45) 


\WAj(o)S(j(i})V{jio)f r A\ fu e IR, 
\WAi(o)U(jio)V(jw)f - Al wlU. 

Hence, 


|.S(yw)| 


_ 

|W,(/w)K(7f.7)i' 


fo c I 


\IA •u')l' 


X 

TOyriO’v'r/w)! ’ 


fu ^ [R. 


(46) 


(47) 

(4S) 


By choosing the functions, VV',, W., and V suitably S 
and U may be made small in appropriate frequency 
regions. 

If the weighting functions are appropriately chosen 
(in particular, with W^V large at low frequencies and 
VTpT large at high frequencies) often the solution of 
the mixed sensitivity problem has the property that 
the first term of the criterion dominates at low 
frequencies and the second at high frequencies: 

|Wi(/>i;).V(yj.;)V(;r,0i; 
dominates al low frequencies 


-f 1 WAja))U{ j(x) 

s_ 

11 

(49) 

--- 

dominates at high frequencies 


As a result. 




for ii) small. 

(5(1) 

\WAji»)v\jo>)\ ■ 

tor (I) large. 

(51) 


This allows quite effective control ovei the shape of 
the sensitivity and input sensitivity functions, and, 
hence, over the performance ot the feedback system. 


lype k control and high-Jrequency roll-off 

In (50)-(51), equality may often be achieved 
asymptotically. Suppose that IW|(/(U)\A(yfu)l behaves 
as 1/cr)^ as cu—►(). This is the case if W,(.s)T(.s) 
includes a factor in the denominator. Then |.SXyo))| 
behaves as a/ as w-->(), which implies a type k 
control system, with excellent low-frequency distur¬ 
bance attenuation if A: 1. If k = 1, the system has an 
integrating action. 

Likewise, suppose that \WAi(o)V(jw)\ behaves as 
(o"‘ as fu—This is the case if W.V is nonproper, 
that IS, if the degree of the numerator exceed' that of 
the denominator (by m). Then |(/(yfu)| behaves as 
0 } as at > From (/ = -C/( 1 + PC) it follows that 
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C = - (//(I H- UP). Hence, if P is strictly proper and 
also C behaves as <d and T = PC/{I PC) 

behaves as with e the |x»le excess of P. This 

means that by choosing m we pre-assign the 
high-frequency roll-off of the compensator transfer 
function, and that of the complementary and input 
sensitivity functions. This is imp>ortant for robustness 
against high-frequency unstructured plant perturba¬ 
tions. 


Partial pole placement 

There is a further important property of the 
solution of the mixed sensitivity problem that needs to 
be discussed before considering an example. This 
concerns a pole cancellation phenomenon that is 
sometimes misunderstood. First note that the 
equalizing proF)erty implies that 

WXs)W,(-s)Sis)S{-s)V(s)V{-s) 

^ W2C0W2(-^)f;(5)C(-j)V(5)V(-.y) = k\ (52) 

for all s in the complex plane. Next we write the 
transfer function P and the weighting functions IV,. 
W 2 . and V in rational form as 


P 


N 





(53) 


with all numerators and denominators polynomials. 
Note that at this point we do not ncce.ssarily take the 
denominator of V equal to D as before. Then if the 
compensator transfer function is represented in 
rational form as C = YlX it easily follows that 


5 = 


DX 
DX^ NY' 


(7 = 


DY 

DX ^NY 


(54) 


The denominator 


A, = DX^ NY, (55) 


is the closed-loop characteristic polynomial of the 
feedback system. Substituting S and U we easily 
obtain from (52) that 

D D M M (A, A^B2B2X X ^A^A2B,B,Y Y) 

E E B;B^ B 2 B 2 

= A^ (56) 


where if A is any rational or polynomial function, A 
is defined by A (j) = /4(-j). 

Since the right-hand side of (56) is a constant, all 
factors in the numerator of the rational function on 
the left cancel against corresponding factors in the 
denominator. In particular, the factor D D cancels. If 
there are no cancellations between D D and 
E EBI BiBjBj, the closed-loop iharacteri.stic poly¬ 
nomial (which by stability has left-half plane roots 
only) necessarily has among its roots the roots of D, 
where any roots of D in the right-half plane are 
mirrored into the left-half plane. 

This means that the open-loop poles (the roots of 
D), possibly after having been mirrored into the 
left-half plane, reappear as closed-loop poles. This 
phenomenon, which is not propitious for a good 
design, may be avoided, and indeed, turned into an 


advantage, by choosing the denominator polynomial 
E of V equal to the plant denominator polynomial D, 
so that 



With this special choice of the denominator of V, the 
polynomial E cancels against D in (56), so that the 
open-loop poles do not reappear as closed-loop poles. 

Further inspection of (56) shows that if there are no 
cancellations between M M and E EB^ B 2 , and 
we assume without loss of generality that M has 
left-half plane roots only, the polynomial M cancels 
against a corresponding factor in If we take V 
proper (which ensures V{jw) to be finite at high 
frequencies) the polynomial M has the same degree as 
D, and, hence, the same number of roots as D. This 
means that choosing M is equivalent to reassigning the 
open-loop poles (the roots of D) to the locations of 
the roots of M. By suitably choosing the remaining 
weighting functions W, and W 2 these roots may often 
be arranged to be the dominant poles. 

This technique, known as partial pole placement 
(Kwakernaak, 1986; Posticthwaitc et al., 1990), allows 
further control over the design. It is very useful in 
designing for a given bandwidth and good time 
response properties. 

In the design examples in Sections 7 and K it is 
illustrated how the ideas of partial pole placement and 
frequency shaping are combined. 

A fuller account of pole -zero cancellation phenom¬ 
ena in ^.-optimization problems is given by Sefton 
and Glover (1990). 

Design for robustness 

As we have seen, the mixed sensitivity problem is a 
promising tool for frequency response shaping. By 
appropriate choices of the functions V, W,, and VV?, 
the sensitivity function may be made small at low 
frequencies and the input sensitivity function (or 
equivalently, the complementary sensitivity function) 
small at high frequencies. These are necessary 
requirements for the system to perform adequately, 
that is, to attenuate disturbances sufficiently given the 
plant capacity and presence of measurement noise. 

On the other hand, as seen at the end of Section 5, 
a small .sensitivity function S at low frequencies 
provides robustness against low-frequency perturba¬ 
tions in the plant denominator while a small 
complementary sensitivity function T at high fre¬ 
quencies protects against high-frequency perturbations 
in the plant numerator. Investigation of the 
low-frequency behavior of S and the high-frequency 
behavior of T permits to estimate the maximal size of 
the allowable perturbations. Conversely, any informa¬ 
tion that is available about the size of the 
perturbations may be used to select the weighting 
functions V, W, and W,. The choice of these functions 
generally involves considerations about both perfor¬ 
mance and robustness. These design targets are not 
necessarily incompatible or competitive. 

It is clear that the crossover region is critical for 
robustness. The crossover region is the frequency 
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region where the magnitude of the loop gain 
L ^ PC which generally is large for low frequencies 
and small for high frequencies—crosses over from 
values greater than 1 to values less than 1. In this 
region, neither the sensitivity function nor the 
complementary sensitivity function is small. 

Only for minimum-phase plants—that is, plants 
whose poles and zeros are all in the left-half 
plane—the sensitivity and complementary sensitivity 
functions can be molded more or less at will. The 
presence of right-half plane zeros or poles imposes 
im]X)rtant constraints* Right-half plane zeros constrain 
the bandwidth below which effective disturbance 
attenuation is possible, that is, for which S can be 
made small. In tact, the largest achievable bandwidth 
is determined by the right-half plane zero closest to 
the origin. Right-half plane poles limit the bandwidth 
of the complementary sensitivity function T, that is, 
the frequency above which T starts to roll off. Here, 
the smallest possible bandwidth is determined by the 
right-half plane pole furthest from the origin. If the 
plant has both right-half plane poles and right-half 
plane zeros the difficulties are aggravated. Especially 
if the right-half plane poles and zeros arc close 
considerable peaking of the sensitivity and com¬ 
plementary sensitivity functions occurs 

These results arc discussed at length by Hngell 
(1%8) for the SISO case and Freudenberg and Looze 
(1988) for the scalar and multivariable cases. The 
results show that plants with right-half plane poles and 
zeros have serious robustness handicaps. 


7 EXAMPI.E 1: DOUHl E IN I EGRATOR 
In this section wc illustrate the application of the 
mixed sensitivity problem to a textbook style design 
example that is simple enough to be completely 
transparent. Consider a SISC) plant with nominal 
transfer function 


n(.») = A. (58) 

The actual, perturbed plant has the transfer function 


no¬ 




(59) 


where g is nominally one and the parasitic time 
constant 0 is nominally 0. 

We stau with a preliminary robustness analysis. The 
variations in the parasitic time constant 6 mainly 
cause high-frequency perturbations, while the low- 
frequency perturbations are primarily the effect of the 
variations in the gain Accordingly, we model the 
effect of the time constant as a numerator 
perturbation, and the gain variations as denominator 
perturbations, and write 


1 


Pis) = 


1 -f 


( 60 ) 


Correspondingly, the relative perturbations of the 


denominator and the numerator are 


A)(0 g 

Nis)-N,(.s) -se 
NJs) 1-1-50 


(61) 

(62) 


The relative perturbation (61) of the denominator is 
constant over all frequencies, hence also in the 
crossover region. Because the plant is minimum- 
phase, trouble-free crossover may be achieved (that 
is, without undue peaking of the sensitivity and 
complementary sensitivity functions) and, hence, we 
expect that—in the absence of other perturbations— 
variations in |l/g - 1| up to almt>st 1 will be tolerated. 

The size of the relative perturbation (62) of the 
numerator is less than 1 for frequencies below 1/0, 
and equal to 1 for high frequencies. To prevent 
destabilization it is advisable to make the complemen¬ 
tary sensitivity small for frequencies greater than 1/0 
As the complementary sensitivity starts to decrease at 
the closed-l(M)p bandwidth, the largest possible value 
of 0 dictates the bandwidth. Assuming that perfor¬ 
mance requirements specify the system to have a 
closed-loop bandwidth ot 1, wc expect that - -in the 
absence of other perturbations- values of the parasitic 
time constant 0 up to 1 will not destabilize the system. 

Thus, both for robustness and performance, we aim 
at a closed-loop bandwidth of 1 with small sensitivity 
at low frequencies and a sufficiently fast decicase of 
the complementary sensitivity at high frequencies with 
a smooth transition in the crossover region. To 
accomplish this with a mixed sensitivity design, wc 
successively consider the choice of the functions 
V = M/D (that IS, of the polynomial M), W^ and W,. 

To obtain a good time response corresponding to 
the bandwidth 1, which does not suffci from 
sluggishness or excessive overshoot, we assign two 
dominant poles to the locations '\/2(-l ±j) This is 
achieved by choosing the polynomial M as 

M(.v)-|,i - (\/2(-l +»||.9- ,'.V2(-1 -»| 

= r'+.T\/24l, (63) 


so t)ial 


.r + ^\/2+l 

V(.v) = 


(64) 


Wc choose the weighting function W, equal to I. Then 
if the (irst of the two terms of the mixed sensitivity 
criterion dominates at low frequencies we have 
|.5|-|A|/|m,|. or 


|5(/m)| - |A| 


{j(oy -H j(o\/2 -K 1 


(65) 


at low frequencies. Figure 11 shows the Bode 
magnitude plot of the factor 1/K. which implies a very 
good low-frequency behavior of the sensitivity 
function. Owing io the presence of the double 
open-loop pole at the origin the feedback system is of 
type 2. 

Next contemplate the high-frequency behavior. For 
high frequencies V is constant and equal to 1. 



264 


H. Kwakernaak 



Consider choosing as 

= r(i-h r^), (66) 

with c and r nonnegative constants such that c # 0. 
Then for high frequencies the magnitude of MA 
asymptotically behaves as c if r = 0, and as cru) if 
r 0. 

Hence, if r = 0, the high-frequency roll-off of the 
input sensitivity function IJ and the compensator 
transfer function C is 0 and that of the complementary 
sensitivity T is 2 decades/decade (40 dB/decade). 

If r^O, 1) and C roll off at 1 decade/dccade 
(20dB/decade), and T rolls off at 3 decades/decade 
(60 dB/decade). 

We first study the case r = 0, which results in a 
proper but not strictly proper compensator transfer 
function C, and a high-frequency roll-off of T of 2 
dccades/decade. Figure 12 sht)ws the sensitivity 
function .S' and the complementary sensitivity function 
T for c = 1/100, r = l/l0, f = 1, and c = Ml 
Inspection shows that as c increases, ITI decreases and 
\S\ increases, which conforms to expectation. The 


smaller c is, the closer the shape of \S\ is to that of 
Fig. 11. 

We choose r = l/10. This makes the sensitivity 
small with little peaking at the cut-off frequency. The 
corresponding optimal compensator has the transfer 
function 


C(.y) = 1.2586 


.y+ 0.61967 
1 +0.15563.V’ 


(67) 


and results in the closed-loop poles i\/2(-l ±y) and 
—5.0114. The two former poles dominate the latter 
pole, as planned. The minimal ^-norm is ||/Y||.^ = 
1.2861. 

Robustness against high-frequency perturbations 
may be improved by making the complementary 
sensitivity function T decrease faster at high 
frequencies. This is accomplished by taking the 
constant r nonzero. Inspection of U/, as given by (66) 
shows that by choosing r = 1 the resulting extra 
roll-off of U, C, and T sets in at the frequency 1. For 
r = 1/10 the break point is shifted to the frequency 10. 
Figure 13 shows the resulting magnitude plots. For 
r = l/10 the sensitivity function has little extra 
peaking while starting at the frequency 10 the 
complementary sen.sitivity function rolls off at a rate 
of 3 decades/dccade. The corresponding optimal 
compensator transfer function is 


CCy)= 1.2107 


+0.5987 

1 +0.20355,y + 0.01267s'’ 


( 68 ) 


which results in the closed-loop poles Ij) 

and —7.3281 ±71.8765. Again the former two poles 
dominate the latter. The minimal ^-norm is 
||//|U = 1.3833. 

Inspection of the two compensators (67) and (68) 
shows that both basically are PD compensators with 
high-frequency roll-off. The optimal compensators 
were computed using a MATLAB package for the 
.solution of ^^-optimization problems (Kwakernaak, 
1990b) based on the polynomial method of Section 10. 


complementary 


aenaitivity \S\ aenaitivity |r| 



Fkj. 12. Bode magnitude plots of S and Tfor r = 0. 
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S 
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FKi 14 Slability region 

We LOJiclude this example with a brief analysis to 
check whether our expectations about robustness have 
come true Given the compensator C=YIX the 
dosed loop characteristic polynomial of the perturbed 
plant IS «(T)A'(s) + /V(s)y'(s) = (l + i«)v'A'(v) + 
gY(s) By straightforward root locus computation,! 
which involves hxing one of the two parameters g and 
0 and varying the olhei, the stability region of Fig 14 
may be established for the compensator (67) Thai for 
the other compensator is similar Ihe diagram shows 
that lor 0^0 the doscd-lo^p system is stable lor all 
that IS, for all — 1 ^ i/g - 1" ^ This stability 
interval is larger than predicted For g = I the system 
IS stable lOr I 179, which also is a somewhat 

larger interval than expected 

8 LXAMPLE 2 SHIP COURSF CONTROL 
In this section we discuss a more concrete design 
problem It deals with the heading control of a ship 
moving at constant velocity, and is included in the 
1990 IFAC Benchmark Problems for Control System 
Design (Astrom, 1990) Our presentation is related to 
the discussion in a recent doctoral dissertation by 


Lundh (1991) The ship transfer function from the 
rudder angle to the yaw angle is 


sis ■Fa^,){s + M,) ’ 


(69) 


where the values of the parameters 6,, a^, and a,, 
depend upon the operating conditions, including 
speed, trim, and loading In Table 1 the parameter 
values are given for five operating conditions The 
table also includes a set of values that were chosen to 
represent the “nominal” plant The table shows that 
the sign of the pole a, depends on the operating 
conditions, so that the number of unstable open-loop 
poles IS not constant Lundh (1991) formulates the 
following design specifications 

(J) constant load disturbances at the plant input are 
rejected at the output, 

(2) the closed-loop system is stable at all operating 
conditions 

We first discuss requirement 1, that constant input 
load disturbances be rejected In the configuration of 
Fig 1 the transfer function from a load disturbance 
that IS additive to the plant input to the control system 

output Z IS 

R =. _ , (70) 

1 + PC DX +NY 


t At the suggestion of one of Ihe reviewers The author is vvherc we wnte P = N/D, C = Y/X Inspection shows 
indehted to this and the other reviewers for many positive constant input load rejection the denominator 

jnd constructive comments 






266 


H. Kwakernaak 


polynomial X of the compensator needs to contain a 
factor s, that is, the compensator should have 
integrating action. 

Another way of looking at this requirement is to 
recognize that the transfer function R is related to the 
sensitivity function 5 as /? = PS. For low frequencies 
the ship transfer function P is proportional to lAv. 
Hence, for R to behave as s, the sensitivity function S 
should behave as at low frequencies. In view of the 
asymptotic analysis of Section 6 this means that the 
product of weighting functions should behave as 
l/j^ In Section 6 we chose V = MID. Wx = Ax/B^. 
For the example at hand, D contains a factor s, so that 
V already has this factor in the denominator. To 
provide the additional factor s needed in the 
denominator of Wi V we modify the weighting function 
W, to 


W,(s) = - 




(71) 


relative 

denominator perturbations 


10“ 



10 ^ 10 ‘ 10 10 ‘ 10 “ 


with (Oi the frequency up to which the effect of the 
integrating action extends. Wl provides further 
freedom in selecting W,. This technique of choosing 
Wi to provide integrating action in the compensator 
also applies to other examples. 

Next we discuss the stability robustness specification 
2. Since we plan to use the mixed sensitivity design 
method, we consider which perturbations should be 
assigned to the plant numerator and which to the 
denominator. Obviously, the perturbation that causes 
poles to cross the imaginary axis, that is, the 
perturbations in the parameter a,, should be relegated 
to the denominator. The perturbations in the gain 
parameter 6, strongly affect the low-frequency plant 
characteristics, and therefore are. also included in the 
denominator perturbations. The variations in 
principally affect the high-frequency characteristics 
and therefore arc included in the numerator 
perturbations. The variations in the far-away pole 
are mainly important for the high-frequency behavior, 
but because of the way the pole enters P it also affects 
the gain. We therefore write the transfer function P in 
the form 


P(s)- 


where 


bffS + 1 


0i \flfi / 


bifS -f- 1 

^,i(croJ + 1)(J + 0|) ’ 


1 « “n 

«» = -, />I=7-- 

Oo ft I 


(72) 

(73) 


Including the variation of or,, in the numerator 
perturbations we thus rewrite the plant transfer 
function as 


P(s) = 


N(s) 

Dis) 


boS + 1 
ar„j + 1 


(d„j' + 1) 


j8i(.r + fl,)(d,>T + 1) ’ 


(74) 


where the overbar denotes the nominal value. 

It is easy to find that the relative perturbations of 


FiCr. 15. Magnitudes of relative denominator perturbations 
for the ship transfer function. 

the denominator may be expressed as 

D(j) - D„(s) _ (Pt - p,)s + - a,^,) 

D„(r) P,U + a,) 

with the overbars again denoting nominal values. 
Figure 15 displays magnitude plots of the relative 
perturbations for the various parameter combinations 
of Tabic 1. Wc establish a bound for the perturbations 
in a suitable form. As a preamble to this, wc note 
from Fig. 15 that above the frequency 1 the relative 
denominator perturbations are less than 1. For this 
reason, we aim at a closed-loop bandwidth of about 1. 

From (37) we see that at low frequencies the 
relative denominator perturbations need to be 
bounded by 


^\VW,\ (76) 


Choosing W, as in (71), with W^(s)— 1, wc consider 


+ )(« + !) 


(77) 


with M a polynomial of degree 3 to be chosen, and ro, 
a constant to be selected. M and w, should be 
determined such that, first, the bound (76) holds. 
Second, while doing this we need to remember that 
the roots of M reappear as closed-loop poles. Third, 
the constant ft?, delimits the frequency interval over 
which the integrating action extends. We choose 
a?, = l, in line with our decision to choose the 
closed-loop bandwidth equal to 1. Furthermore, we 
choose one of the roots of M as -2.13. This 

means that the open-loop faraway nominal pole at 
—2.13 is left in place. Next, we place the two 
remaining poles of Af in a second-order Butlerworlh 
configuration with radius 1, that is, we choose this 
pole pair as i\/2(-l ±j). The choice of the radius is 



267 


Robust control and 5if’w-optimization 



FKj 16 MagnitudLN ol iht sensitivilv and input sensitivity for the ship Lontrolkr disign 


again governed by the selection of a closed-loop 
bandwidth 1 Finally, for normalization wc provide M 
with a leading coeflftcitnl such that V(^ - 1 

Thus, we let 

M(r) = ^,(j +V2^ + lK^-fl,.) (78) 

Figure includes a plot of the magnitude of VWy if 
this magnitude is a bound for the relative denominator 
perturbations the closed-loop system is robustly 
stabilized if the minimal value of the mixed sensitivity 
criterion is less than 1 If the minimal value A is 
greater than 1 the bound needs to be rescaled to 
VWJk For the plant at hand, which has no right-half 
plane zeros, and gi\en the normalization 
V/(oo)W,(oc) = 1 ^ from experience it may be expected 
that the nimimal value A is somewhat larger than 1, 
say, between ] and 2 Figure 15 shows that a margin 
of this order of magnitude is available 
The next logical step is to analyse the numerator 
perturbations A cursory exploration that is not 
reproduced here indicates that the numerator 
perturbation present no great danger to stability 
robustness 

The final step in the preparation of the mixed 
sensitivity procedure is to choose the weighting 
function W Choosing W. constant is expected to 
make the compensator transfer function proper but 
not strictly proper In view of the normalization 
K(c^5)W,(ao)= 1, we tentdt'vely let W.(n) = 0 01 
Solution of the mixed sensitivity problem results m 
a minimal «j-norm of A = 1 0607 The optimal 
compensator transfer function is given by 

0 5468(^ + 2 1277)((j ^ 0 4351)> 0 4098 ^ 

“ s(s + 47 9401)(i + 1 0204) 

(79) 


C has a pole at 0, as expected and the translcr 
function IS propci but not slnctly propci, as predated 
The nominal closed loop poles are -45 2010, 
-2 1277 -0 7071 ±;0 7071 -1017? and - 1 (KHO 
In^y include the pre-assigned poles -2 1277 and 
—0 7071 ±y0 7071 It may be checked by direct 
computation of the closed-loop poles that the 
leedback system remains stable with good stability 
margins, at all operating points Figure 16 shows plots 
of the magnitudes of the sensitivity function S and the 
input sensitivity function U tor the various operating 
points 

For improved robustness against high frequency 
unstrucluied uncertainty it is necessary to make the 
compensator strully proper to provide roll off of the 
complemenlarv sensitivity / and the input sensitivity 
U To accomplish this we modify the weighting 
function W to 

W.(0 = ('t)l(l+~) (HO) 

The minimum J^-norm now is 1 1201 while the 
compensator transfer function takes the form 

_ 0 5177(s -i- 2 1277)((s + 0 4262) 4 0 4065^^) 

^ " 7^7 + 28 4n4)’ + (24 'i716’)(5 + 1 0204) 

(81) 

C IS strictly proper The closed-loop pole at -45 2010 
is replaced with a pole pair at -27 0438 ± y22 9860, 
while the remaining closed-loop poles alter not at all 
or very little Figure 17 shows plots of the magnitudes 
of the system functions Compared with Fig 16 there 
IS little change in S but U has the desired roll-off 
The results show that the control system specifica¬ 
tions may be met with comfortable margins They may 
be tightened by including extra specifications, such as 
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Fig. 17. Magnitudes of the sensitivity and input sensitivity for the second ship controller design. 


minimal bandwidth. Lundh (1991) considers addi¬ 
tional specifications on the peak values of S and U, on 
the responjie to input load disturbances, and on 
measurement noise sensitivity. 

The optimal compensators in this section were again 
computed using the package based on the polynomial 
method (Kwakernaak, 1990b), with a small modifica¬ 
tion needed for the pole at (1 of the weighting function 
IV,. 

9 THE STANDARD ^^-QPIIMAL REGULATOR 
PROBLEM 

The mixed sensitivity problem is a special case of 
the so-called standard i^^-optimal regulator problem. 
We introduce the standard problem by considering the 
mixed sensitivity function in the configuration of Fig. 
18. The diagram shows V as a shaping filter for the 
disturbance, and W, and as frequency dependent 
weighting functions for the control .system output and 
the plant input, respectively. The .signal w is an 
external input that drives the di.sturbance shaping 
filter V. 



It is easy to check that the Laplace transforms of 
the weighted control system output z, and the 
weighted plant input Z: are given by 


2 , = W^SVw, 
-W.UVw, 


m 


so that 2 = col (f,, z,) = with 


// = 


W,SV 

-W.UV 


(H3) 


Hence, the mixed sensitivity problem amounts to the 
minimization of the ^-norm of the transfer matrix 
from the external input w to the composite output 2 . 
The freedom available in the minimization problem 
consists of the choice of the compensator C\ 

By i.solating the compensator the block diagram of 
Fig. 18 may be represented as in Fig. 19, which is the 
configuration of the ‘"standard” -optimal regulator 
problem. The signal w is an external input, 
representing driving signals for shaping filters for 
disturbances, measurement noise, and reference 
inputs. The signal z represents a control error. 
Ideally, z is identical to zero. The signal y represents 
the measured outputs that are available for feedback. 
The signal u, finally, represent the inputs that may be 
controlled. 

The dynamics of the block G depends on the 
particular problem at hand. Inspection of Fig. 18 
shows that for the mixed sensitivity problem 


= (84) 

y = ~Vw - Pu. 


Fig. 18. The mixed sensitivity problem. 
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We rewrite this as 


i = 


W,V 


w,p 

w. 


w 

.u 


(H5) 


which defines the transfer matrix G as 


C7, 

Gt 1 f i -. 


w,v 

W,F 

0 

w. 

-V 

-F 


(Hfi) 


I’hcre are many other control problems that can be 
cast as special cases of the standard problem, 
including problems involving measurement noise and 
two-degree-of-freedom configurations. Of all these 
potential applications so far only the mixed 
sensitivity problem has been investigated in any 
depth. 

The standard problem was first discussed by Francis 
and Doyle (1987) and is treated at length by Francis 
(1987). 


10. FRFOUENC Y DOMAIN SOLUTION OF THE 
STANDARD PROBLEM 

The bulk of the research on ^^-optimization in the 
198()s was devoted to the theoretical and mathematical 
aspects of the solution of the standard and other 
problems. Although the theory by no means has 
reached full maturity, algorithms and software arc 
now becoming available for the solution of the 
standard problem. The software has not achieved a 
degree of user-friendliness, however, that affords the 
user to be totally unfamiliar with the details of the 
algorithms. For this reason we include in this paper a 
brief survey of two types of algorithms that are 
available. The present section is devoted to a 
frequency domain algorithm. Hie next section deals 
with state space algorithms. 

Because -optimization problems are basically 
frequency domain oriented, it makes sense to consider 
frequency domain solutions. The simplest of these 
solutions relies on what is called y-spectral factoriza¬ 
tion, a notion that actually is ba.sic for all solution 
methods. We outline this solution. It is ba.scd on 
Kwakernaak (1990a), which w '.urn is closely related 
to work by Green (1989). 

It is not difficult to find that the closed-loop transfer 
matrix H of the configuration of Fig. 19, that is, the 



transfer matrix from w to z, may be expressed as 

H = Gu0,2(1 ~ KGj,) 'KG.,. (87) 

It is characteristic foi all -pproaches to the solution of 
yiC-optimization problem.s that the problem of 
minimizing ||//||, is not tackled directly, but that first 
the question is studied how to determine suboptimal 
compensators. Suboptimal compensators are compen¬ 
sators that stabilize the closed-loop system and 
achieve 

\\H\U-X, ( 88 ) 

with A a given nonnegative number. Optimal 
compensators follow by finding the smallest value of A 
for which such compensators exist. 

The inequality |lf/||.^A is readily seen to be 
equivalent to 

{~)w)H{j(o) ‘ A‘7, for all w eU, (89) 

where the inequality is taken in the sen.se of 
definiteness of matrices. We write (89) more 
compactly as 

ff H A^/, on the imaginary axis, (90) 

where if // is a matrix of rational functions H is 
defined by// (s) = H'(-s). 

For reasons of exposition, consider the special ca.se 
where H - F - K, with F the tran.sfer matrix of a 
given unstable plant, and K a stable transfer matrix to 
be determined. It is easily recognized that this is a 
standaid problem with G,, ~ F, = /. and 

(i, -0. This problem, which is of more mathematical 
than practical interest, is known as the Nehari 
problem (Francis, 1987). wSubstitiiting H -- F - K into 
(90), we obtain F F-F K~K F^K Kk'l on 
the imaginary axis. This in turn we may rewrite as 

P ,1/ 


F 

-I 


K 




(91) 


n, 


on the imaginary axis, which defines the rational 
matrix 11;^. We ntiw represent the compensator 
transfer matrix K in the form 

K - YX (92) 

where Y and X are matrices of stable rational 
functions. By multiplying (91) on the right by X and 
on the left by X it follows that the inequality 
||/9||. ^A is equivalent to the inequality 


\X Y ID; 


7), on the imaginary axis. (93) 


This .simple derivation applies to the Nehan problem. 
It may be proved that also for the general case the 
inequality is equivalent to (93), with the 

matrix FIa defined as 


U,= 


0 

/ XI GiiOii 

-G’liG., j 

--G„ 

l./ 2 ? ~ O2 1 G 1 j 



-G,2 

-0.2 


■i 


(94) 


The matrix IT, is para-Hermitian, that is, -11^. If 
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detn^ has no poles and zeros on the imaginary axis, 
ri;^ may be /-spectrally factored as 

n, = Z,/Z,. (95) 

Za is a square rational matrix such that both Z^ and 
Z \' have all their poles in the open left-half plane. / is 
a constant matrix of the form 



with the two / blocks unit matrices of suitable 
dimensions. J is called the signature matrix of 
Given the factorization (95), the condition (93) may 
be rewritten as 

[X y ]ZA/Z;^|^^j ^ 0, on the imaginary axis. 

(97) 

Defining the square stable rational matrix A and the 
stable rational matrix B by 



it follows that (93) is equivalent to 

A A^B B owiht imaginary axis. (99) 
By inverting Za we find from (98) that 



This expression, together with (99), provides an 
explicit formula for all compensators K = YX ' that 
make \\H\U^X 

There are many matrices of stable rational functions 
A and B that satisfy (99). An obvious choice is A = /, 
5 = 0. This is known as the central solutionf. 

The question that remains is whether the compen¬ 
sators (100) actually stabilize the closed-loop system. 
It may be found that a necessary condition for a 
compensator given by (99)-(100) to stabilize the 
closed-loop system is that the numerator of del A have 
all its roots in the open left-half plane. It may 
furthermore be proved that if any stabilizing 
compensator exists that achieves ||//||«<A, all 
compensators such that ||//||^, follow from 
(99)-(100) with A such that detA has all its roots in 
the open left-half plane. 

These results suggest the following search 
procedure: 

(1) Choose a value of A. 

(2) Determine the/-spectral factor Za and compute 
a corresponding compensator from (99)-(l(X)) 
such that 6ctA has all its zero*^ in the left-half 
plane. An obvious possibility is to compute the 
central solution. 

(3) Check if the compensator stabilizes the 
closed-loop system. If it does, decrease A. If it 
does not, increase A. 

t Note that the central solution as defined here is not 
unique, because the spectral factorization (95) is not unique. 


(4) If the optimal solution has been approached 
sufficiently closely, stop. Else, return to (2). 

The rational /-spectral factorization (95) may be 
reduced to two /-spectral factorizations of polynomial 
matrices; one for the denominator, one for the 
numerator. Algorithms for this factorization are now 
becoming available (Kwakernaak, 199(Jb; §ebek, 
1990; Sebek and Kwakernaak, 1991, 1992). 

The search process may terminate in two ways 
(Kwakernaak, 1990a; Glover et ai, 1991). The less 
common situation is that A may be decreased steadily 
until it reaches a lower bound below which the desired 
/-spectral factorization is no longer possible. All 
suboptimal compensators for this least possible value 
of A then are optimal. 

The more usual situation is that A may be decreased 
until it reaches a value where the factorization exists 
but no suboptimal solution stabilizes the closed-loop 
system. The search procedure may then be used to 
delimit the optimum. It turns out that as the optimum 
is approached, the /-spectral factorization becomes 
singular in the .sense that the coefficients of the 
rational functions occurring in the spectral factor Za 
grow without bound. At the same time, one of the 
closed-loop poles of the central solution approaches 
the boundary of the left-half plane, and actually 
crosses over from the left-half to the right-half plane, 
or vice-versa, at the optimal value A„,„. Since the 
closed-loop transfer matrix H cannot have this 
closed-loop pole as a pole (because otherwise it would 
make ||//|U infinite), this closed-loop pole cancels in 
H, It turns out that it actually cancels within the 
compensator transfer matrix C, and hence may be 
removed. 

The singularity phenoment)n in the /-spectral 
factorization may be avoided by only performing a 
partial factorization, which then may be exploited to 
compute exactly optimal solutions. The details are 
described elsewhere (Kwakernaak, 199()a), where also 
a characterization is given of all optimal solutions, 
.similar to the characterization (KKl) of all suboptimal 
solutions. An experimental MATLAB macro package 
is available for the numerical computation of the 
optimal solutions (Kwakernaak, 1990b). 

There are a large number of details that are not 
di.scussed here for lack of space. They concern 
assumptions on the dimensions of the signals w, u, z, 
and y, and on the transfer matrix G. Many of these 
assumptions may be removed or circumvented. 

The singularity and cancelation phenomenon does 
not always occur. If it does not, optimal solutions are 
obtained corresponding to the largest value of A such 
that det Ha has a pole or zero on the imaginary axis. 

The suboptimal and optimal solutions normally arc 
by no means unique. An exception is the SISO mixed 
sensitivity problem (Kwakernaak, 199()a). 

11. STATE SPACE SOLUTION OF THE STANDARD 
PROBLEM 

The mainstream work on algorithms for the solution 
of the standard problem focuses on state space 
algorithms (Doyle et al., 1989; Glover and Doyle, 
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1989). We limit our exposition (which follows that of 
Welland, 1990) to a special situation, whose solution 
admits a neat and compact presentation. The starting 
point is the description of the system 


a-i:]. 


( 101 ) 


in state form as 


X — Ax + Bu + Ewi, 



>» = Cx + 

with A, B, C, D, and E constant matrices. This state 
space representation is not completely general. Note 
the special structure of the control error r and the fact 
that the external signal w splits into two separate 
components w = col (w,, w^). For a more general 
formulation see Glover and Doyle (1989). Note that 
even in these more general representations the 
transfer matrix G is limited to be proper, a restriction 
that is not necessary in the frequency domain solution. 

First consider suboptimal regulation \ising state 
feedback, that is, when y =x. It turns out that in this 
case \\H\\^<k, if at all possible, may be achieved by 
static linear state feedback of the form 


u = - Fx, 


(103) 


Thus, in the case of full state information one 
algebraic Riccati equation needs to be solved, and 
static state feedback solves the problem. The output 
feedback problem, >^ith measurement 

y-Cx-^w^i, (108) 

is more difficult to solve, although its solution is quite 
elegant and has a separation structure reminiscent of 
the LOG problem. It turns out that for output 
feedback the suboptimal solution needs to be modified 
to the feedback law 


u = ~Fx, (109) 

with F = B^X as before. The quantity i may be 
viewed as the estimated state, and is the output of an 
observer-type system given by 

i = ^ ^jc + Bu + ZYC^iy - Cf). (110) 

The symmetric matrix V, if any exists, is a 
nonnegalive-delmite solution of the algebraic Riccati 
equation 

AY-i-YA^ + E^E - ^D’d) =0, (111) 

such that the matrix A - Y{C^C - {\fk^)D^D) has all 
its eigenvalues in the open ieft-half plane. The 
constant matrix Z in (110) is given by 


with F a constant matrix. The gain matrix F is given 
by 




( 112 ) 


F = B^X, ( 104 ) 


where the symmetric matrix X is a nonnegative- 
definite solution of the algebraic matrix Riccati 
equation 

A^X+XA+D^D-xi^BB^ -^= 0, (105) 


such that the matrix A -(BB^ - (\ /k^)EE’ )X has all 
its eigenvalues in the open left-half plane. If no such 
solution X exists, there is no stabilizing state feedback 
such that ||//|U ^ 

One way of proving this result is to note that with u 
given by (103) the closed-loop transfer matrix from w 
to z is 


_ r D(sl-A + BE) 'E 
^^^'>~[-F{sI-A+BE) ‘E 


( 106 ) 


Manipulation of the Riccati equation (105) in a way 
similar to the proof of the well-known Kalman- 
Yacubovitch equality (Kalman, 1964) results in the 
expression 

[a/ - J E^-Sl - A ') 'Xe][xI -1 E^XisI - A) 'e] 
= AV-//"(-.0WW. (107) 


where A = A-BF. If A - {BB^ - 
all its eigenvalues in the open left-half plane, he 
left-hand side of (107) is positive-definite on the 
imaginary axis, which proves that ||//|lr. 


The compensator defined by (109)-(110) is .subop¬ 
timal and stabilizes the feedback system if and only if 
||AT|U,<A. 

The order of the (suboptimal) compensator equals 
that of the ‘plant’’ G. Representations of ‘all” 
suboptimal solutions are also available (Glover and 
Doyle, 1989). The Riccati equations (105) and (111) 
are the equivalents of the two polynomial /-spectral 
factorizations in the frequency domain solution. The 
Riccati equations are normally solved by spectral 
decomposition of the corresponding liamiltonian 
matrix. Numerically reliable routines are available in 
MATLAB. 

An implementation of a search procedure to delimit 
the optimal solution analogous to that for the 
frequency domain approach is available commercially 
as part of the MATLAB Robust Control Toolbox 
(Chiang and Safonov, 1988), and the more recent 
MATLAB fi-Analysis and Synthesis Toolbox. As 
for the polynomial package, considerable expertise is 
needed for the use of these toolboxes. As the 
optimum is approached singularities occur that are 
similar to those for the frequency domain solution. 
Glover et al. (1991) have analysed these phenomena. 

The stale space solution of the problem requires 
more assumptions (for instance that the transfer 
matrix G be proper) than the frequency domain 
solution. On the other hand, the numerical algorithms 
for solving Riccati equations are better developed 
than the /-spectral factorization algorithms needed in 
the frequency domain approach. 
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12- CONCLUSIONS 

^»‘Optimal regulation is a rewarding research 
subject both for theoreticians and engineers. Theoret¬ 
icians and also mathematicians find an unequaled 
opportunity to penetrate deeper into the rich and 
intricately structured world of linear systems. 
Engineers recognize many of the issues and design 
targets of “classical” control theory, which now can be 
handled algorithmically. 

Although the subject has received much attention it 
has not reached maturity. There are several important 
topics that we have not been able to touch upon in 
this tutorial exposition, and are subjects of intensive 
research. 

On the theoretical side, the discrete-time 
optimal regulation problem is more or less under¬ 
stood. Francis (1990) is doing interesting work on the 
application of i^^-theory to sampled-data systems. 
Theoretical work on distributed-parameter system is 
in progress (Curtain, 1991), and attempts are being 
made to deal with nonlinear problems (Van der 
Schaft, 1990). 

Further theoretical work on finite-dimensional 
linear systems is directed towards exploring the 
connections between various solution methods such as 
those based on the state space approach, /-spectral 
factorization, operator theoretic methods, interpola¬ 
tion theory, and differential game theory. It cannot be 
claimed that all aspects of optimal solutions (as 
opposed to suboptimal solutions) are fully under¬ 
stood, and no doubt considerable attention remains to 
be spent on this topic. 

A problem of considerable interest, where relatively 
little progress has been made, is how to use the 
freedom still present in X-optimal solutions resulting 
from the lack of uniqueness. The reason for this lack 
of uniqueness is that the oo-norm involves the peak 
value of the largest singular value of the closed-loop 
frequency response matrix only. This leaves con¬ 
siderable freedom in the behavior of the smaller 
singular values. The control theoretical interpretation 
of this freedom is not clear. One way of eliminating 
the nonuniqueness is to look for solutions among all 
^rK,-optimal solutions that succes.sively minimize the 
peak values of all lesser singular values. This leads to 
the notion of supcroptimality (see e g. Kwakernaak, 
1986; Jaimoukha and Limebeer, 1991). 

Another way of eliminating nonuniqueness is to 
choose so-called “minimum-entropy” solutions (Mus¬ 
tafa and Glover, 1990). Other researchers use the 
remaining freedom for further optimization purposes. 

Another line of research is directed towards making 
the theory applicable. As we have shown, the SISO 
mixed sensitivity problem has considerable design 
potential. The multivariable mixed sensitivity problem 
shares this, but not all the conclusions for the SISO 
case generalize straightforwardly. Other special cases 
of the standard problem, such as criteria involving all 
three of the sensitivity function, the complementary 
sensitivity function, and the input sensitivity function, 
are being looked into. A monograph has been 
devoted to a special version of the mixed sensitivity 
problem deriving from what is known as normalized 


coprime factor plant descriptions (McFarlane and 
Glover, 1990). 

A further question that by no means has been 
settled is how to translate practical information about 
plant uncertainty and modeling inaccuracy into 
quantitative terms that allow the application of 
techniques. Doyle’s “structured singular value” 
(Doyle, 1982) no doubt is an important step in the 
right direction. 

The fact that algorithms and software become 
slowly available strongly stimulates work on “real 
world” applications. More and more interesting 
design studies are reported, with encouraging results. 
Several papers presented at a recent meeting in 
Cambridge attest to this (see for instance Kelletl, 
1991; Marshfield, 1991; Walker and Postlethwaite, 
1991). Other applications are reviewed by Postle¬ 
thwaite (1991). 

The wealth of results on ^..-optimization is finding 
its way into books. Besides the monographs by 
Mustafa and Glover (1990) and McFarlane and 
Glover (1990) a book by Morari and Zafiriou (1989) is 
attracting considerable attention. A recent text by 
Doyle et ai (1991) introduces some of the material 
at the level of a second course on control. 
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Application of EKF Technique to Ship 
Resistance Measurement* 

GENGSHEN LllJt 

A new method of ''measuring" the resistance of full size ships by applying 
system identification technique requires only a simple sea trial during a 
ship's regular voyage but gives the estimated resistance coefficient with good 
accuracy. 
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Abstract— The measurement of ship resistance is imfxirtani 
in naval architecture and marine hydrodynamics research 
The method of predicting ship resistance from scaled model 
tests has been used for over 100 years but llic results suffer 
from 'scale effect” 

This papei presents a new method of “measuring” the 
resistance of full size ships by using a system identification 
technique Fxtended Kalman Filtering is used tor estimating 
intermediate coeflicients m the ship's surge motion equation, 
then the ship resistance coefficient is derived from the 
identified results 1 his method only requires a simple sea 
trial dunng a ship’s regular voyage 'Ihe ship resistance 
coefficient as well as Che wake fraction and the thrust 
deduction factor were obtained with excellent accuracy The 
success of this method should impact the design of ship hulls 
and propellers and also research in marine hydrodynamics 


I INTRODUCTION 

Reducing the resistance of ships and making 
their propulsion systems work efficiently has 
been one of the most important goals for naval 
architects. However, the complexity of ship 
bodies makes it difficult to obtain the resistance 
of ships either by numerical method or by direct 
measurement. The method of using smaller 
scaled ship models to do the experiment, and 
then extending the model results to real ships 
has been in use since William Froude first 
proposed this in 1868. But the difference between 
the model and the full ship still causes serious 
errors. Efforts to minimize this scale effect 
have resulted in the current method for 
estimating ship resistance which consists of a 
combination of tests on both the model and the 
ship. The principle rests on the model test results 

•Received 11 December 1990, revised 18 June 1991, 
revised 7 April 1992, received in final form 30 July 1992 The 
onginal version of this paper wa.s not presented at any IFAC 
meeting. This paper was recommended tor publication in 
revised form by Associate Editor R Boel under the direction 
of Editor H. Kwakernaak. 

t McDermott International Inc., P.O. Box 218218, 
Houston, TX 77218, U.S A. 


to get ihe ship resistance curve through 
extrapolation and to acquire the information of 
the interaction between the ship hull and the 
propeller. Based on these results sea trials of the 
ship are conducted to measure the resistance of 
the ship by measuring the ship speed and thrust 
(Comstock, 1967). However, sea trials on real 
ships in these procedures are very costly, since 
the ship has to be taken out of service and 
special instrumentation and qualified personnel 
have to be placed aboard. (For example, to get 
the ‘EXXON Philadelphia' on one trip between 
San Francisco and Velde/, Alaska, would cost 
more than $500,(XX), which does not count 
extra-costs for the delay caused by bad weather 
and other unpredicted reasons.) Furthermore, 
this method can only be used for newly- 
constructed ships. Once a ship is put into 
service, changes in the condition ol the ship hull 
and in the propeller system due to fouling, 
corrosion addition of appendages, or hull 
alterations cannot be monitored. The informa¬ 
tion about ship resistance during service is the 
key to the determination of the best time to put 
a ship into dry dock, which is closely related to 
the operating efficiency of the shipping company. 

Our method of applying a system identifica¬ 
tion technique provides a new way of estimating 
the ship resistance from simple sea trials. By 
applying the Extended Kalman Filtering (EKF) 
technique, not only the ship resistance coefficient 
Cf( but also the full scaled wake fraction and 
the full scaled thrust deduction factor t, can be 
accurately estimated. 

In Section 2, the state equation is derived by 
applying the theory of marine hydrodynamics. In 
Section 3, the discussion is focused on the 
application of EKF and the derivation of the 
ship resistance coefficient. Brief discussion of sea 
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trial design is presented in Section 4. The results 
of applying this technique to the tanker 
‘EXXON Philadelphia’ is presented in Section 5, 
followed by a conclusion in Section 6. 

This article is partially based on the research 
presented in the author’s Sc.D. Thesis (Liu, 
1988), interested readers can consult this 
reference for detail. 

2. STATE EQUATION 

The study of ship resistance requires only the 
ship surge motion equation as a state equation. 

2.1. Resistance and resistance coejficient of ships 
The resistance of a ship has several com¬ 
ponents; the frictional resistance caused by the 
viscosity of water; the eddy resistance caused by 
the formation of eddies in the boundary layer; 
and the wave-making resistance caused by the 
energy lost in the wave made by surface ships. 
The magnitude of ship resistance R is propor¬ 
tional to the square of the ship surge speed u : 

R = ipSC^u^, 

where p is the water density, and 5 is the wetted 
surface area of the ship body (Comstock, 1967). 

2.2. Thrust and thrust coefficients t}s 

Previous results (Liu, 1988) show that the 
thrust T provided by a screw propeller can be 
expressed as 

T = rip^pDV + r]pfiD^un + 

where D is the propeller diameter, n is the 
rotation rate of the propeller shaft, rj^,^ is the 
propeller drag-thrust coefficient, rj^, is the 
propeller induced drag-thrust coefficient, and 
7]p^ is the propeller lift-thrust coefficient. 

Introducing the thrust coefficient = 
TlpD^n^ then, 

where J = uInD is called the advance ratio. Tht 
K-J curve is widely used for describing the 
characteristics of ship propellers. 

2.3. State equation 

When a ship keeps its straight course, the 
external forces are the sum of the thrust force 
and the resistance force. The application of 
Newton’s law, without considering the interac¬ 
tion between the propeller and the ship hull 
yields, 

(m - Xu)u = T — R 

= pD^Vp^y + pD^np.,un 
+ pD*r]p,y - jpSC^u^, 


where m is the mass of the ship, -X^ is the 
added-mass of the ship. From now on, rj^^s are 
used to represent the ship propeller thrust 
coefficients and the model propeller thrust 
coefficients are represented by 

However, the interaction between the pro¬ 
peller and the ship hull is not negligible. 
Therefore the above equation has to be 
modified. 

First, it is noticed that the advancing speed of 
the propeller u^, which is the speed of the flow 
in front of the propeller, is usually lower than u; 

u^ = (l- yv)u, 

where w (wake fraction) is less than one. 

Second, because of the interaction between 
propeller and ship hull, the hydrodynamic 
pressure at the stern of the ship changes. The 
suction effect changes the flow past the ship hull 
and usually increases the resistance of the ship, 
or equivalently, the thrust force is reduced: 

r = 7 ;,(i-o. 

where T is the effective thrust force acting at the 
ship, TJ) is the thrust force provided by the ship 
propeller without the interaction between the 
ship hull and the ship propeller, and t is the 
thrust deduct factor. 

After modification, the ship surge motion 
equation becomes 

pD^r}*u^ y pD^r)*un -f- pD^7]*rr 

«= V 

m A„ 

where, 

vt = Vps,0 - 0(1 - w')' - 

n* = V:(i ” 0(1 - 

ri* = T]p,,( \ -1). 

This is the state equation to be used in the 
system identification procedure which leads to 
the estimate of the ship resistance coefficient. 

It should be noted that strictly speaking t is 
not a constant. But as the ship speeds up, t 
approaches a steady value. Therefore, f/*s can 
be taken as constants. When a ship is speeding 
up from zero speed, at the beginning stage, its 
speed is still very low. The value of t is near to 
zero. Correspondingly, the values of the in 
this case are also different, we take them as fjs. 
Then equation (1) becomes 

y = pD'fi^un + pD*f\y ^2) 

m-Xi. 

The difference of equation (1) and equation 
(2) is beyond the difference of their appearances. 
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In the latter, since the value of w is very low 
while the rps value is high, only t), plays an 
important role. Based on this idea, 
and the model test value of C« can be used in 
identifying fi^, since their inaccuracy due to the 
scale effect has little impact on the accuracy of 
the estimation of The value of f), should be 
very close to the value of since t is very 
small 

Furthermore, there is a special case in ship 
surge motion-deceleration with propellers wind¬ 
milling. In that case, the advanced ratio is a 
constant, although both u and n changed. The 
state equation is then simplified to 




(3) 


where 

— 


(1 - - W)" + 


, . - W') 






-CV. 


which is called the integrated resistance 
coefficient. If a ship can operate with its 
propeller wind-milling, this equation can also be 
used as the state equation. 

In the above equations -X^ is solved through 
a numerical method, so that m — X„ is also 
known (Liu, 19K8). 


3 CKh AND COLFFIC ILN'I DFRIVATION 

3.1 Application of EKf^ 

For a nonlinear system with the following state 
model and measurement model: 

I X =f(x, t) + r(0 fU) ^ ^(0. QiO) 
t V. = h,(x(t,)) 4- y, A: - 1, 2, .... n - N((). R,) 

where j: is the state vector, / is a nonlinear 
function of the state. the process noise 

which is zero mean Gaussian noise with spectral 
density matrix 0(0- (Denoted by £(t) — 
/V(o, 0(0).) is the sampled measurement 
vector at time is a nonlinear function of the 
state vector at time y^ is the measurement 
noise which is a white random sequence of zero 
mean Gaussian random variables with associated 
covariance matrix R}, (denoted by A^(0, Rk))- 
The continuous-discrete extended Kalman 
filter is as follows: (Gelb, 1982) 

i =/(#- 0. 

P{t) = F{i, t)P + PF^ {s. 0 + Q{t). 

fk( + ) ~ ik(-) 


^*( + ) ~ “ l^kRkiSk(- ))]^*(- )> 

~ kt,-)Fl[(,ik(-)) 

where x is the expectation of the state. is 
the expectation of the state just before update at 
time tk. is Ifi^ expectation of the state just 
after update at time tk^ P(t) is the estimation 
error covariance matrix defined by 

p(t)^E\{x(t)- m)(m~my] 

Pk{-) iind are respectively the value of 

P(t) just before and after the update at lime 
Kk is the gain matrix at time (k. 


Hi, t) = 


Hk(ik{-)) = 


df(i. 0 


There are three ca.ses for the application of 
FKF: 

(1) To identify r/*s: 


/ 


^ = 


pD^T]*u^ T f)D^7]*un T pD^t]*n^ 


\ 


\ 


m - X„ 
0 
0 
0 

V= [10 0 0 ]x + t,. 

where 


X = 





is the augmented state vector, is the process 
noise, and is the measurement noise. 

(2) To identity fje, 

+ pD*fi^n" — 






m - AT,, 

0 

y = \\ 0U + i;„. 

where is the model test value of C’,, and the 
augmented state vector is 

(3) To identify Cp\ 



>’ = [1 0]f+^„. 
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The augmented state vector is 

‘O 

3.2. Parameter identifiability 

3.2.1. Identifiability of intermediate coeffici¬ 
ents. The parameters in a state equation are 
identifiable if they can be uniquely determined 
by using given information. Clearly, if we put 
explicitly in the state equation: 

u = 

+ pD^r]p,^un + pD*r\p,^n^ - ipSCnU' 

m-X^ 

then both and Cr are unidentifiable, 

because, these two parameters are coupled in 
the terms of u^. However, the intermediate 
parameters in the state equation are identifiable. 

For cases (2) and (3), because there is only 
one parameter to be identified, it follows that 
both ff-i and C/j are identifiable. For case (1), 
remember that while the ship is accelerated, the 
rps « is a constant. Therefore, u is the only 
variable in the state equation. Furthermore, 
when u is known at any instant, then u is also 
known. And since t]*, r ]2 and r]* are the 
coefficients of terms with different orders of u, 
they are also uniquely determined, 

3.2.2. Feasibility of deriving from the 
identified results. Since the scale effect between 
the real ship propeller and the model propeller is 
considered, r]p^^ and are also unknown. 
Furthermore, since the value of t and derived 
from the traditional method are not reliable, 
they are also considered unknown. Therefore, to 
get a unique solution of additional equations 
are introduced. 

(1) Although the curves of the model 
propeller and the ship propeller are different, 
the zero K, point of the two curves correspond to 
the same value of advance ratio. In other words, 
if the two curves are drawn in the same 
coordinate system, they will intersect at /C, = 0 . 
(see Fig. 1). The reason is that for both the 
model propeller and the ship propeller, the 
attack angles of the flow necessary to create zero 
propulsion force can be taken as the same, 
though the Reynold's numbers are different for 
two propellers. And by the same token, to 
extend curves to negative J values, they 
should reach their peak values at the same J 
value. This means that 


^p.S\ ^P^i 



Fig. 1. K, curves for ship propeller and its model. 


(2) According to Abkowitz (1987): 


kSCrJI 


jtJI 


( 4 ) 


where k depends on the geometry of the ship 
(hull and propeller). 

From this equation, it is clear that t can be 
calculated at any given value of 7^,, if k is known. 
We can use either the identified f]^, or the 
identified to calculate /c, since the values of 
and are both known. 

We have just shown that by introducing new 
equations, the six unknowns w, 

Cfi and K can be solved for. However, it may be 
difficult to obtain these unknowns by solving the 
equations simultaneously. Therefore, two meth¬ 
ods of solving for the unknowns using iteration 
procedures are presented in the following 
section. 


3.3. Derivation of the resistance coefficient 

As mentioned before, w 

and t are combined into the intermediate 
coefficients 77 *, 77 *, r/t and which are 

identified by the EKF. The following arc two 
methods for the derivation of 77 ,,,,, 

and /. 

3.3.1. Direct comparison method. This 
method is designed to get the ship resistance 
coefficient C,^ through direct comparison be¬ 
tween identified values of 7 /*s, 77 ,,^s. During the 
derivation, t and t/^^s can also be determined by 
using the derived Cft, and w. 

(1) First w is determined: 


14 / = ] — ^ = 1 — 

vUp., r]*Vpm/ 
(2) Then 77^1 is obtained: 


( 5 ) 



^pmf]2 
^3 ^pm2 


which brings in two extra relations. 


(6) 
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(3) The ship resistance coefficient is then 
determined: 



2D^ 


( 7 ) 


The determination of C/j and is completed. 
With ihe derived Q and w, t and r/^,s can be 
derived through iteration. 

(4) Derivation ol t and 
Take rjp^^ as ihe first approximation of 
—the value of / at equilibrium is then 
determined: 


tr 



( 8 ) 


Since the calculation of t, uses instead of 
it is not accurate and cannot be used to 
calculate the ship propeller coefficients. In the 
following, we shall discuss how to derive the 
values of while correcting value of through 
iteration. 

Recall from equation (4), that k can be 
calculated through 

t^D^K, (1 - w) 

^ =-^-, (9) 

where and AT,, are respectively the values 

of f, Jp and Kt at equilibrium. 

During the first approach, is obtained from 
the model propeller curve instead of the ship 
propeller curve. This causes the inaccuracy in 
the calculation of k, which leads to the 
inconsistency of the t value corresponding to fj^ 
when it is obtained in the following two ways: 

• by comparing with the model test 




Vprn s 

by using the formu;a just established: 


( 10 ) 


kSC^JI / 8/C, \ 

'= . ^ / - A v 1 ■ (11) 

D~K,(l-w)\^ nJl } ' ' 


In order to eliminate this inconsistency, the 
following steps are needed: 

(a) Obtain t by substituting Jp and Jt, into 
equation (11). 

(b) Substitute this t value into equation (10) to 
obtain an updated rfp^^. 

(c) Substitute rjp^^ into equations (5), to get an 
updated Then r]p,^ and T]p,^ are updated by 
using this updated 

(d) If the updated is unchanged, then stop. 
Otherwise, repeat the whole procedure until the 
inconsistency disappears. 


This iteration process will lead to a proper 
value, and at the same time gives a K, curve for 
the ship propeller. 

3.3.2. Recurrente method. This method com- 
bines the wind-milling deceleration procedure 
and the acceleration procedure. The basic idea is 
comprised of the following steps. 

(1) Obtain the advanced ratio of the ship 
propeller by using the measured data and 

- 

J = '^>v'r.(l - »*') 

where w is obtained from equation (6) by using 
the acceleration data. 

(2) Use and tfp^^ to calculate t,.. 

(3) Recall equation (9), * can be determined 
when r,., 4 and Kt are known. Once k is 
determined, is obtained. 

(4) The ship resistance coefficient is then 
obtained: 


. 2D^K, (1 

--• 02) 


The (\ value is adjusted through the updating of 
t^. This is similar to the procedure mentioned 
earlier. 

(a) First calculate / based on the value. 

(b) Then with the updated J value, update 

and k. 

(c) Update Cp. 

This procedure is repeated until C\ converges. 


4 SLA TRIAL DESIGN AND MEASURED DATA 
4.1. Sea trial design 

wSea Inals used for system identification are 
important, since the accuracy of the identified 
parameters depends both on the identification 
technique and on the quality of the data 
obtained in the experiment. 

The sea trial experiment is conducted to 
provide data with the maximum information 
content for the system identification under given 
constraints. The experiment design is an 
integrated procedure which includes choosing 
input variables, designing test signals, determin¬ 
ing sampling rate and experiment time length, 
and designing pre-sampling filters. 

For the identification of the ship resistance 
coefficient, the system equation has already been 
established with n as the input variable. The 
experiment design is reduced to the choice of the 
signal pattern for n, the determination of the 
sampling rate and of the sea trial time length. 

4.1 1. Sea trial pattern. To get accurate values 
of the parameters through identification, the 
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input signal n should have two characteristics. 
The first characteristic is that for the coefficients 
rj*s or ff^ or in the model, the input signal n 
should be able to discriminate between different 
parameter value groups. In other words, if two 
different groups of values are assigned to the 
coefficients, the same input signal n will produce 
a different output u. The second characteristic is 
that the variance of the estimates of the 
parameters should be minimized. The input 
signals possessing these characteristics are called 
the persistent exciting signals, and an experiment 
conducted with that class of input signals is 
called an informative experiment (Ljung, 1987). 

In practice, a perfectly designed experiment 
plan based on the theoretical analyses may not 
be feasible due to constraints. Therefore, a 
priori knowledge of the system plays an 
important role. For the identification of the ship 
resistance coefficient, the constraint for n is that 
its value should not be changed too rapidly for 
the sake of the safety of the propulsion system. 

Based on these ideas, two simple maneuvers 
are considered. The first one is the acceleration 
maneuver which is to start the propeller from 
zero rps to full rps. Through this procedure, the 
measurements of u and n can lead to the 
estimates of tj*. rj* and tj* simultaneously, and 
the data collected at the beginning of the 
acceleration can be used to identify fj. The 
second maneuver is the deceleration or ''wind¬ 
milling'’, in which a ship is slowing down with its 
propeller in the wind-milling mode. However, 
some ships cannot make the propeller wind-mill, 
hence for them this maneuver is not applicable. 

Checking the two experimental procedures: 
“speeding-up” and “wind-milling” with simu¬ 
lated data shows that signals of n in both 
procedures arc persistently exciting for the 
identification of r;s or Cf^. 

4-1.2. Sampling rate and experiment length. 
Sampling rate is also related to the accuracy of 
the identification. For system identification one 
has to avoid the aliasing phenomenon which 
occurs when the sampling interval is larger than 
the time constant of the system (Gustavsson, 
1975). The time constant concept is used in 
linear system analysis. Although, the ship surge 
motion system is a nonlinear system, for a step 
input n, the response of u is an exponential-like 
function. Following the definition of the time 
constant of a linear system, the pseudo-time 
constant t for ship surge motion system can be 
defined as 

u, 

r = — , 

where is the value of the u at the equilibrium 


state, and Uq is the value of u at time t = 0. For 
the tanker ‘EXXON Philadelphia’, a pseudo- 
time constant r of —410 sec was obtained both 
by solving the equation analytically, and by 
simulation of the ship motion (Liu, 1988). The 
sample interval of the measurement in the sea 
trial is 1 sec. 

The duration of the experiment should be long 
enough to insure the reliability of the data and 
the accuracy of the identification. Insufficient 
time for an experiment may lead to a pitfall: in 
some instances, an “accurate” model that fits the 
data very well may be obtained through 
identification, however, the model may be 
wrong. Data from another similar experiment 
may also lead to an “accurate” model but with 
quite different parameters obtained through the 
same identification procedure by the same 
identification method. Furthermore, the variance 
of the estimates is usually proportional to the 
inverse of the experiment length (Gustavsson, 
1975). Nevertheless, longer duration may not be 
better, because extra disturbances are often 
introduced in a long time experiment, especially 
during a sea trial. Zarrop (1974), in his thesis, 
cited an example where lime duration is only 
twice the time constant in practice. 

The sea trial time length for each procedure 
was over KXXlscc. Identification on simulated 
data showed that this duration is good enough. 

4.2. Measurements 

I’he sea trial for measuring w and n was 
conducted on 12 May 1987. The 233 m long, 
76,(XX) dead-weight ton tanker 'EXXON Phila¬ 
delphia’ was on its way back from Valde/, 
Alaska to San Francisco along the West Coast. 
The initial position of the sea trial was chosen in 
calm .sea at 58MH.67'N, and 143^29.25'W. 
There was a slight swell of 0.3-0.6 m and no 
wind. 

First, the deceleration procedure was con¬ 
ducted by cutting the steam to the engine and 
letting the propeller wind-mill. The ship kept 
moving straight forward. When the ship speed 
was down to half of the cruising speed, the 
engine was set in reverse which brought the 
tanker speed to zero. The acceleration was then 
initiated by bringing the propeller rotating rate n 
to 70rpm as quickly as possible. The whole 
experiment took about 35 mins and went very 
smoothly. 

5 RESULTS AND DISCUSSION 

5.1. Results of system identification 

During the identification, the basic parameters 
of the system were set as follows: 
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u( m/ #ec) 



Fif. 2 EsfirTiLiUJ u by EKF during the decclcrdiioii 
prnecdure 

• displacement ot the tanker(m): ()()() ton 

' added-mass( —A',J; 4390 ton 

• overall length(/v): 233 m 

• wetted surface area(A). E25 x 10^ m' 

■ diameter of the ship propcller(/)): S.2m 

■ model drag-thrust coefficienl( 

-0.1725 

• model associated drag-thrust coefficient 
(V.J: ~n.2415 

• model lifl~tbrust coefficienl(r],0.3796 

• model wake Iraclion(w): 0 383 

• model thrust deduction factor(0' 0.216 

• water density(p): 1.028 ton m \ 

5.1.1. Result oj estimated I wo figures are 
presented here to illustrate the identification 
procedure and its result. In Fig. 2, the filtering 
procedure of the EKF is displayed along with 
the measurement of u. Figure 3 plots the 
estimated C,i m the early path and in the final 
path of the identification. In the first path, the 
initial value of is a rough guess; while in later 
paths, the initial value is based on the 
identification results before it. fhe validity of the 
identified results is checked through statistical 
hypothesis testing (Scliweppe, 1976; Hwang, 
1980). 




Ihe result of the identification is 
fV? - 0dX)238. 

The advanced ratio is; 

= E03. 

5.1 2. Identified at low J value. In the 
procedure for identifying the initial part of 
the measured data was omitted because ot the 
low signal-to-noise ratio. At the same time, to 
avoid the influence of the increasing thrust 
deduction factor t during the estimation of f;,, 
only the data ranging over 2(K) sec are used at 
slow speed. Since the input data set is small, the 
e.stimation .should be conducted carefully. The 
identified value of f/^ is 

r), = 0.359 

5.1.3. Results of identified r}*, r;* and 

7]*. Figures 4-6 are plots o1 the curves of the 
e.stimated rj*, rj* and r/T obtained by the EKF 
technique. The results of the estimation are: 

^ 77 -0.285 

^ 7/;--0.135 
“0.279. 

5.2. Derived coefficient values 

5.2.1. C\ by direct comparcson method. 

(H The derived values of the hydrodynamic 
coefficients are; 


w = 0.239, 
0.00227. 



Fig ^ Cff e.stimated in twn paths. 


Fig. 5 Estimated coefficient r]* 
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Fig. 6. Estimated coefficient 7 ;*. 


(2) The first iteration of adjustment of 1 ^ and K, 
is as follows; 


t, = 0.265, 

/f,^ = 0.1H6, 

*- = 0.4179, 

^, = 0.332, 

0.085, 

»?,,„ = 0.3923, 

4 = 0.2828, 

-0.178, 

7 ,^,, = -0.249. 

Since the value of 4 changed, another iteration 
is needed. 

(3) After the third iteration, 4 converged to a 
constant. The final results are 

4 = 0.296. ,,^,, = -0.180, 

= -0.252, r;,,,, = 0.396. 

5.2.2. Ck by the recurrence method. 

(1) From C„ = 0.00238 and /„„ = 1.03, the ship 
propeller thrust coefficient of wind-milling state 

is obtained: 

f^,_= -0.058. 

(2) Then ir is determined. 

*; = 0.4179. 

(3) The ship resistance coefficient Cr then is 

Cr = 0.002'4. 

Through adjusting the ship propeller K, curve, 
the updated Cr value is obtained as follows. 

(1) Using the value of Cr just obtained to 
calculate t : 

7 = 0.087. 

(2) Ba.sed on the new value of 7, a new set of 

values of 4 , K„ K, and k are 


obtained: 


7,,,,. = 0.3932, 
V,= -0.2077, 
j^, = 0.3441, 
/C,, = 0.196, 


4 = 0.2901, 
-0.2497, 
K, = -0.048, 

‘Krfi ^ 

jc = 0.4538. 


(3) Then is updated: 


C^ = 0.00219. 


The updating procedure is repeated until the 
value converges to a constant. 

For the tanker ‘EXXON Philadelphia', the 
final value of O four iterations by the 

recurrence method is 


0 = 0 . 00221 . 

When this value is compared with the result 
obtained from the direct comparison method, 
the difference is less than 3% of 0.(X)227. 
Furthermore, considering the fact that when the 
ship propeller is wind-milling, the wake fraction 
is different from that in the acceleration 
procedure, therefore the O value can be further 
improved through the adjustment of w. For 
example, if the adjustment is made by referring 
to the curves given by Huang and Groves (1980), 
the final result of Cf^ will be 0.(X)226. This value 
will make the difference of between the two 
methods less than 1% of 0.00227. 

To check the results of the estimation, the 
surge motion of the tanker 'EXXON Philadel¬ 
phia’ is simulated with the estimated hydrodyna¬ 
mic coefficients, and the simulated results are 
compared with the measured data (see Figs 7 
and 8). 

I'he main purpose of this paper is to introduce 
a new method of measuring ship’s resistance. A 
comprehensive methodology with details of 
technique for the accurate estimate of resistance 


u(m/ jec) 



Fig. 7 Simulated u during the deceleration procedure. 
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u(T7i/5ec) 



of different kind of ships needs further 
development. However, the comparison be¬ 
tween the simulation using the hydrodynamic 
coefficients identified by the new method and the 
traditional method can still show the advantages 
of the new methods. In Fig 9, the simulation of 
the surge speed of the ship is compared with the 
measurement, using t = 0.221, w-0.38 and 
C,j=0.25. The differences between the simula¬ 
tion and the measurements are obvious 

6 CONCLUSIONS 

The resistance coefficient of the tanker 
‘EXXON Philadelphia’ was successfully estim¬ 
ated by applying EKF technique. The wake 
fraction w, the thrust deduction factor t, and the 
ship propeller coefficients were also derived from 
the identification results and from the model 
propeller coefficients. The results presented in 
this paper show that the acceleration process of a 
ship from zero speed to full speed is sufficient for 
providing the data necessary for the estimation. 

The application of this new method to 
different kinds of sti.ps will require the 
modification of system models and redesigning 
the sea trial. However, the principle of this new 
method is applicable to most ships. This is not 
only better for measuring ship resistance than 
what has been used for over 10(1 years, which 
benefits the design of ship hulls and ship 
propellers, but will contribute to marine 
hydrodynamics research particularly at the high 
Reynolds’ number. 



Hi(. 9 Simulated u with parameter estimate by traditional 
method 


It should be pointed out that this new method 
cannot completely replace model testing, not 
only because model test results corii»titute a 
reference for system identification, but also 
because model tests are used in the design stage 
to get the hydrodynamic coefficients (in spite of 
the scale effects). However, with the information 
derived through the EKF technique, great 
mprovement in the extrapolation methods used 
to predict ship performance from model test data 
can be expected (Liu, 1988). 
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AbNtract— Advanced control design requires a mtidcl thal 
discribes process behaviour iidequately Such model e in be 
conslrueled using physical modelling or si ilisiical identdica 
linn leehmques Both hive thur disadv tillages physical 
models are rigorous ind thus ollen expensi c in eonslruet 
while black box model structures arc not neecss inly 
eompttibh with pliysieai leility Since both approaches have 
undtnnbit merits as well their combination seems to be 
ittnctivc tnd rewirdifig 

Ihis p ipci discusses an appro ich to stilistinl estim Hum 
where best hncarlv paranictn/ed dynamic regression 
model IS identified which is also consistent with specified 
knowledge about piocess responses Such eh ii leleiislKs ne 
forced upon the black box models yielding i (iicy Box 
model set II will bi shown that eiueial physical knowledge 
such as process stability and s'gn of stationary gams cm be 
translated into linear inequality eonstraints on the black box 
model paiamelers In order to select best estimators in the 
Cirey Box class a Bayesun approach is adopted Cnven a 
prior distribution assoeiati d with the physical knowledge 
and given the d ita likelihood a posterior distribution is 
constructed Maximum and average A Pouenon estimators 
ire andysed Explicit solutions are given tor special eases ol 
Ciaiissian likelihood and j prior which is unihunilv or 
piecewise linearly distributed on a line iily eonstr lined 
Cirey Box model class I inally simulation results ind m 
application to a distillation process show the advantage ol iIk 
conlramed estimates under realistic experiment conditions 
Consideiable variance rtdueturns at the cost ol a somewhat 
larger bias lan be achieved m almg the poti nlial tor e g 
adaptive control applications 

1 INIRODUCTION 

luf DFSIGN of an ddriptive or other advanced 
controller otter multivariable dynamic 
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piocesses requires a model that gives an aceuidle 
dcsenplion of the process behaviour Such a 
model can be constructed by means of rigorous 
phwual modelling or using statistical 
identipcation techniques assuming a black 
box’ model structure Both appioaches have 
their disadvantages an accuralc physiial model 
IS haul to get and will be relatively elaborate the 
black-box’ model has nol necessarily a 
structure eonipaliblc with the underlying physi 
eal reality As a rtsull conventional (black box 
model) identification techniques vicld unrealistic 
or even non adequate results in too many cases 
e g due to modelling cirors and non ideal 
experiments of limited duration This bectymes 
apparent when chai aeleristics of the model 
identihed contradict common physital knowl¬ 
edge about the process such as stability and sign 
of static gams Section 2 discusses these 
observations, validated with published Simula 
tion results Also an actual application to a 
distillation column is briefly mentioned to 
illustrate the physical consistency problems in 
practice 

A combination ot the two approaches might 
well be attractive and rewarding as a means ot 
combining the obvious advantages Ihis papei 
discusses an approach to statistical estimation, 
aiming at the identification of a linearly 
parametrized dynamic regression (black-box) 
model that explains the data “best” under the 
restriction that its parameter estimates are 
consistent with prcspccihed knowledge Such 
knowledge is translated into a number of 
inequality restrictions on the black-box model 
parameters Together with the regression struc¬ 
ture they dehne a “Grey-lBjx” class of models 
that match the given characlerislics, which will 
be discussed and illustrated in Section 1 It will 
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be shown that crucial features such as (open- 
loop) stability and bounds on the stationary 
gains of the process give rise to linear 
inequalities Ad^B on the parameter 0. This 
section relates to earlier work such as that of 
Jury (1974), Shieh et al, (1987), Shieh and 
Sacheti (1978) and J0rgenson e( al. (1985). It 
offers a novel outlook on the translation of 
practical (plant) knowledge to an admissible 
parameter set. 

In Section 4, a Bayesian approach to 
Grey-Box parameter estimation is described. 
Using the Grey-Box model class a prior p.d.f. 
(probability distribution function) for the model 
parameters is constructed. This prior is com¬ 
bined with the likelihood using Bayesian 
techniques yielding a posterior p.d.f. which is 
conditioned on data as well as on process 
characteristics. 

In order to select a “best” model based on the 
posterior p.d.f., suitable estimators have to be 
defined. To this end maximum A Posteriori and 
average A Posteriori estimators are discussed in 
Section 5. These notions are generalizations of 
standard estimation problems, replacing the 
likelihood by the Bayesian posterior. Three 
constrained estimators are highlighted. 
Moreover, explicit solutions to the estimation 
problems will be given for the case of Gaussian 
likelihood, and a prior, which is uniformly or 
polyhedrally distributed on a linearly restricted 
Grey-Box model set. Relations exists, with 
respect to general statistical work, with Akaike 
0978, 1986) and Liew (1976), as well as with 
identification-oriented publications such as in 
Bard (1974), Gertler (1979), Peterka (1981) and 
Goodwin and Sin (1984). 

In addition, some simulation results will be 
presented and the techniques will be applied to 
distillation column data. Finally, in Section 6 
conclusions will be drawn. 

2. PHYSICAL INCONSISTENCIES OF 
IDENTIFIC ATION RESULTS 

2.1. Realistic experimental conditions in the 
process industry 

In the petrochemical industry the identifica¬ 
tion of a statistical dynamic process model is 
often troublesome. This structural problem can 
arise from violated assumptions for model and 
estimation structure (e.g. linearity, stationarity) 
but is often primarily due to insufficient 
information in the field data. This is the case for 
instance where, due to safety and/or quality 
considerations the duration of field experiments 
and the intensity of test-signal perturbation must 
be kept to a minimum. Typically, the length of a 


test-signal perturbed field test will be about three 
or four times the 99% process settling time 
whereas the signal-to-noise-ratio will be about 
unity or slightly better. In the following we will 
refer to this as realistic experimental conditions. 
As a result popular and theoretically well- 
understood one-slep-ahead prediction error 
based identification methods such as least 
squares (LS), maximum likelihood (ML), 
generalized/extended least squares (ELS), in¬ 
strumental variables (IV), etc. (see e.g. Eykhoff 
(1974), Goodwin and Payne (1977), Soderstrdm 
and Stoica (1989)) often fail to produce 
physically consistent models from finite, noisy 
experiments. For realistic experimental condi¬ 
tions but otherwise ideal factors (stationary 
stochastics; deterministic and noise model 
structure correctly chosen) Monte-Carlo simula¬ 
tions show that lack of stochastic convergence of 
these estimators results in a significant per¬ 
centage of unrealistic or even non-sense models. 
This does not relate to parameter bias only, but 
also to the de.scription of the input-output 
behaviour of the plant (see Wahlberg and Ljung 
(1986) for frequency-domain and Tulleken 
(1990) for time-domain characteristics). Since 
these problems can occur quite easily under 
rather idealized conditions, one could expect 
even more trouble in practical situations where 
model structure uncertainty and non- 
stationarities come in. This is indeed often the 
case when realistic experimental conditions 
apply as the following illustrates. 

2.2. Example 1: physical inconsistencies in 
identified distillation models 
Consider part of a distillation train as shown in 
Fig. 1. A feed stock consisting of many 
components is fed halfway into the main column. 
As a result of the heat injection of the reboiler, 
components with a high volatility (vapour) 
escape through the top, are cooled down in the 
condenser, accumulated in a reservoir and partly 
fed back into the column for improved 
separation. On the other hand, components with 
a low volatility (liquid) flow off to the bottom. 
From the middle of the column a side flow is 
withdrawn, with on average, middle-cut com¬ 
ponents (i.e. with an intermediate boiling point). 
This material is fed into a smaller “side- 
stripper”, which sends the relatively heavy 
components into its bottom stream and recycles 
the lighter components into the main column. A 
multivariable model was required for controller 
design, specifying the dynamic relationship 
between two inputs (i.e. small set point changes 
on reflux flow Wi := 0^ and side-stripper flow 
W 2 -= 0 .s) and two outputs, i.e. the qualities 
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FjCi 1 Relevant IraLlion til dislillaruni prtieess dunni; data acquisilion (rKC-Flow Rale C ontioiler, 

LRC - l.cvel Rale *,"onlrollcr) 


y\ '—Qi dnd yi'.— Qu ot lighter and heavier 
components, both in the bottom flow of the 
side-stripper. In addition, intermediate outputs 
were defined: the top temperature 7^4 at tray 34 
and the bottom temperature 7^2 at tray 12 of the 
main column. These easily measurable tempera¬ 
tures have the advantage to be usually highly 
correlated with the less easily attainable quality 
measurements. The latter suffer from se>'ere 
delay (in this case 8 mins) and inaccuracy. Under 
difficult, but realistic fieM conditions an experi¬ 
ment was carried out using Pseudo Random 
binary Noise (PRBN) test signal perturbations 
on the two inputs (set points), whereas feed rale 
and composition, set points of reboiler duty and 
level range controllers were not manipulated. 
Together with these test-signal realizations, 
temperature and quality variations were re¬ 
corded using a relatively short sampling period 
TJ) of half a minute (the 95% settling time was 
about 5h). A 10 n period of zero-averaged, 
scaled input-output behaviour (12(XJ samples) is 
shown in Fig. 2. 

As can be seen from the temperature data, the 
signal-to-noise ratio is unfavourable. This is a 
consequence of operational constraints imposed 
on the process excitation. Because of the urge to 


operate in a safe and economical manner, 
relatively small excitation and rapid switching of 
the flow controller set points as well as a short 
experiment length had to be accepted. Since 
these conditions are often encountered in 
process industry, the challenf^e is to produce an 
acceptable model with such limited, noisy 
experiments. This argument applies especially to 
real-time (e g. adaptive) control applications. 

The model class considered for this distillation 
process consisted of multivariable discrete-time 
ARMAX(n, m, I, 6) models (Goodwin and Sin 
(1984), Tullekcn (1987)): 

T 1 “1“ ‘ T (1) 

Here y, ^ and u are discrete-time v-dimensional 
output, white noise and /u-dimensional input 
signals, respectively, where =>'(^7^). etc. and 
T is the sampling period. Furthermore n, m and 
/ are the dynamic orders while g is the 
overall discrete dead-time index, which is 
present in all input components. The v x v Auto 
Regressive matrices A, the v-dimensional Load 
vector (I (modelling a trend component, arising 
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Fio. 2 The experimental flow/tempcraturc/qualily data over a period ol 10 h 


from disturbances, non-zero noise mean and/or 
linearization of non-lincar dynamics) as well as 
the V X V Moving Average matrices C and the 
V X /i external matrices B are deterministic 
(possibly slowly varying), unknown ARM AX 
parameters. 

In this application, v = fi-2 with u := 
[0/?. (ps^ and y ;= [7^4, 7 , 2 ]^, respectively y := 
IQ//, QlY . For a number of reasonable choices 
of the structural parameters n, m, I, d and T, 
successive ‘‘ELS-best” models of the form (1) 
have been obtained using the data presented in 
Fig. 2. 

The results were discouraging: only few 
statistically ' most likely” models did represent 
essential, known characteristics of the process at 
hand, such as stability and correct sign of static 
gains. A characteristic example for the ELS 
estimation of the case n- m-l~A and various 
choices for overall dead-time 6 (resulting in an 
input-output shift correction for process dead¬ 
time) and T (chosen as a multiple k of the basic 
sampling time 7;) = 0.5 min, by simply ignoring 
intermediate data) is shown in Fig. 3. Each 
marked pair {k, (^) represents an 
ARMAX(4,4,4, 6) ELS-optimal model dis¬ 
cretized with sample time irTJ) which, in addition, 


turned out to be stable and representing the 
correct signs of all four static gains (known from 
physical considerations); all other {k, t^) pairs 
corresponded to ELS-optimal models which did 
not represent all those vital characteristics 
correctly and were not marked. Two types of 
models are considered, having either tempera¬ 
tures Tm and T ^2 qualities and Qj as 
outputs. A relatively small number of potentially 
valuable, provisionally accepted models of each 
type fall within a "cigar-shaped” region in which 
non-oplimal choices of structural parameters 
such as T and d are apparently compensated for, 
cf. Fig. 3. 

However, almost all of these models had to be 
rejected, since additional step-response charac¬ 
teristics (i.e. the knowledge that some loops 
showed (non)minimum phase/(non)oscillatory 
dynamics; rough knowledge about settling-times) 
were not represented correctly. Lower order 
choices for n, m and / (including ARX only, 
/ = 0, using LS identification) yielded quite 
similar results. Higher order models could not 
reasonably be considered because of the 
imbalance between parametrization load (too 
many parameters) vs the information available 
(a relatively short and noisy experiment). 
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OVERALL DELAV (mm) 



U 5 10 15 


SAMPLING TIME T (mm) 

Fki 3 Doited cases refer to ELS-ideniiricd, with sample 
lime I discretized ARMAX(4, 4, 4, <^) models being siable 
and having correct sign of all lour static gains, using the data 
shown in Fig 2 Empty positions refer to ELS-optimal 
models failing for at least one ol those fcaturc'i. 

In conclusion, with this relatively noisy and 
limited data set, hardly any acceptable black-box 
model could be identified. 

2.3. Use of physical knowledge 
It is not our intention to advocate brute force 
search or discuss the obvious merits of model 
structure discrimination methods, either statisti¬ 
cally (e.g. Akaike, 1974) or using validation with 
respect to model characteristics. What we would 
like to stress is that the often heard slogan “let 
the data speak for themselves” does not apply to 
these non-ideal (finite, noisy) experiments, even 
when the process would be in the model class. 
Due to the lack of information, structural 
identified model inconsistencies arise, no matter 
what statistical data analysis method is being 


used. Still, one has to come up with an 
acceptable model since performing a prolongued 
experiment is usually loo costly or impossible. In 
those cases, availaole physical knowledge about 
crucial plant characteristics should be supplied 
and somehow be integrated in the identification 
method. Moreover, it has been widely accepted 
that modelling errors are unavoidable but 
influenceable aspects of identification practice. If 
at least the most relevant process characteristics 
are sufficiently accurately reflected, approximate 
modelling is fully acceptable, in our view this 
implies the use of short-cut, low-order black-box 
model structures where crucial information is 
translated into constraints on model parameters, 
restricting the “black-box” models to a physi¬ 
cally more consistent “Grey-Box” model class. 
Systematic errors (Goodwin and Salgado, 1989) 
can be circumvented or obstructed with such an 
appioach. A similar line of reasoning applies to 
the non-linear modelling approach. 

3 GREY BOX MODELLING USING PHYSICAL 
KNOWLEDGb 

3.1. 77ie Grey-Box model class 
Consider the ARMAX(n, m, /, 6) model class 
(1), which belongs to a more general black-box 
class of linearly parametrized multivariable 
discrete-time regression models 

:= { y, = ezf, 6 kel). (2) 

where is the v-dimensional output vector of 
the process, 6 is a constant, unknown parameter 
matrix, z* a known vector of p regressors and 
a v-dimensional (discrete-time) white noise, all 
at time k. This can be seen from the following 

[yJf-‘ ’ \y k-n\ ■ ■ ■ 

* ■ ■ \t^k-m ft I ^k-l\ ’ ■ ■ \^k-l I !]• 

However, m general the regression vector Zf, can 
also be a non-linear function of selected 
regressors which may be motivated with physical 
arguments (conservation laws, use of dimension¬ 
less groups). Our aim will be to force important 
physical structure upon the models in by 
constraining the candidate model parameters 6 
to a Grey-Box (GB) model parameter set Q, 
defined for suitable (in general matrix-valued) 
functions / and g as 

Q := {6/ F I f{6) = (J; g{e) ^ 0}. (3) 

This subsequently leads to a Grey-Box model 
class 

{y, = (9,^ -P eJ e Z; (9 £ Q} c (4) 
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each model in being consistent with the a 
priori physical knowledge. 

Imposing physical constraints on statistical 
models is not a new topic. Much has been 
written for the case of equality constraints 
(/(0) = 0), as it often happens that we can fix (a 
combination of) parameters beforehand (e g. 
Goodwin and Payne, 1977). A typical example is 
in dead-time correction for multivariable 
ARMAX models where some elements of 
leading B matrices in (1) are set to zero. A more 
advanced application uses a Dynamic Quantity 
Interaction Diagram from which it can easily be 
seen which parameters in a multivariable 
ARMAX model must equal zero (see e.g. 
J0rgensen et al, 1985). However, inequality 
constraints on black-box model parameters 
(g(0)>O) received only little attention in the 
statistical literature, probably because of the 
foreseen additional complexity and the ‘‘let the 
data do the job” attitude. For instance, 
Goodwin and Sin (1984) and Liew (1976) do 
make some general remarks about estimated 
parameter consistency. On the other hand, texts 
more related to physical (non-linear) modelling 
and estimation (such as Bard, 1974) emphasize 
the importance of this kind of model structure 
enhancement. In the following sub-section we 
will focus on inequality constraints solely. If 
present the constraint f{6) = 0 may be reformu¬ 
lated in [f{d)\ —f(0)] ^0 which can be included 
in g{9) >0. For reasons of simplicity and in view 
of the estimation problem we restrict ourselves 
to convex and even linear g. Consequently (3) 
reduces for suitable matrixes A (possibly rank 
deficient) and B to the convex, possibly 
unbounded set 

Q:={0eI \Ae^B}. (5) 

Vital, important knowledge such as admissible 
areas for poles and static gains (often denied by 
identified model results) can be specified in 
terms of (5), 

3.2. Inequality constraints from physical 
knowledge 

In this section we consider the discrete-time 
transfer function matrix 3^' of the deterministic 
part of (1) where is the unit shift backward 
operator and sd and ^ are polynomial matrices: 

= ( 6 ) 

with j!/(z-') = /+ E A,z-‘; 3S(z-')= E 

1-1 i^\ 

We will derive inequality constraints on the 
parameters of the model (6), necessary for the 
satisfaction of known properties of the underly¬ 


ing process. 

3.2.1. Admissible pole locations yielding an 
admissible parameter set. First we will consider 
the situation that the plant is known to be 
open-loop (perhaps including low-level control¬ 
lers) stable in the sense of Lyapunov. In the 
prt>cess industry this holds for the vast majority 
of the processes, with e.g. exothermic chemical 
reactors being one of the rare exceptions. In 
many cases the plant is even known to be 
asymptotically stable, with additional knowledge 
about an upper bound for the settling time. We 
shall aim at the formulation of the sharpest 
necessary linear inequality conditions on the 
coefficients of .5^ in the model (6) that emerge 
from a given admissible area for the poles of 

M(z~') = I + A^z''-^A2Z (7) 

i.e. the roots of det ,^{z ') = 0. In other words, 
we are interested in the smallest, linearly 
hounded convex hull of the admissible parameter 
set. In the general multivariable case one is 
rather restricted with respect to deriving 
necessary linear constraints since the associated 
admissible parameter set will often be un¬ 
bounded and non-convex. Since stability is an 
important special case, the following may serve 
as a simple illustration. 

3.2.2. Example 2: stability constraints for 
two-dimensional first-order AR system. Consider 
a first-order discrete-time AR model with two 
outputs described by: 


Its asymptotic stability is equivalent with the 
roots of det j^(z) = det(z/+ A) = -1-ciiZ-f 

02 = 0 being strictly within the unit circle, where 
fli := trace A = fln +^'22 and a 2 :=detA = 
a, 1^22 “ ^ 12^221 B is well-known that necessary 
and sufficient conditions for the stability of 
-I-fl,z + fl 2 = are (Jury (1974); Goodwin and 
Sin (1984)): «;<! plus 1-I-Oi-E a 2 > 0 and 
l-ai-ha2^0. Since 02 is a non-linear, non- 
convex function of the original parameters (the 
Hessian of 02 with respect to the four a,jS is not 
positive-definite), the stability region is non- 
convex. Moreover, it is unbounded. However, 
we can state the following necessary linear 
constraint in this case, resulting from the 
substitution of a 2 <l in the other two 
inequalities; 

--2<aii-f ^ 22^2 |traceA|<2. □ 

Definition 1. Let C(m, R) denote the set of 
complex numbers within or at a circle with 



Grey-box modelling and identification 


291 


real-values centre m and radius R, i.e. 

C{m, R) := {z e C| ||z - m\\ ^R\ 

m £U\R 

Lemma 1. Necessary linear conditions (not the 
sharpest which will be discussed below) such 
that all roots of the polynomial equation 


z' + a, a ‘ + ■ 
are 

+ fl 1 z 4- = 0 

fall in r(0, R) 

1 

11 \ 

. for 1 - 1 . 2 

\i/ 

_ /. 

Proof. Let the 

roots z,, . . . , z, 

eC{0,R), I.e 

\z,\ ^ R. For the 

lih coefficient a, 

one can write 

kl- 




1 - k 

, A.' k 1 


<■ 

V h 1 . . 

- 1^ ! 


ZJ ^A,I 

KA 1 

1- A, 

A 1 A / 






1- A, A k - i 


= [number ot ways to select i 

objects out ol l\x R' =- R'l^^ ■ 

Theojem 1 11 all nv poles ol (7) arc in C’((), R) 
then Itiaccyl.l-/^A/v and \dci A„\ - 

Proof Consider the polynomial following from 
the expansion of det ‘) - 0 

Since the roots ot this polynomial arc in C (0, R), 
It follows from Lemma I that |aj ^ 
for \ ^ t ^ nv On the other hand, the co¬ 
efficient fl| of z in the above equation equals 
trace A , as can be easily checked. Thus 

Itrace y4,| = |n,| - 

Morcov^'r |dety4„| = ■ 

It will be clarified below that the poles ot 
discrete-time models that emerge from correctly 
sampled, continuous-time systems cannot be 
situated in C', the open left half of the complex 
plane. Let the underlying continuous-time 
system have n poles of sufficient physical 
significance (in the sense that they contribute to 
the dynamical properties in a non-neglectable 
way) say cr, ± jfi, with ft, 0. If a discrete-time 
equivalent model of the same degree is being 
used, It is well known that its poles are given by 
exp T{a,~^jp,) = c\p(Ta,)\coh(rf3i ) Ty sin ( 7 %)] 


where T is the sampling period. Let T be 
selected correctly, then the sampling frequency 
must necessarily satisfy In IT ^ 2 max, P, 0, 
i.e. minimally twice as fast as the fastest 
eigenfiequency of practical relevance. This 
choice is based on the sampling theorem (e.g. 
Franklin and Poweli (1980)). However, for the 
choice of the sampling frequency the -3 dB 
frequency of the system is more important. As 
the -3 dB point usually will be tar beyond 
max,/3,, one should select the sampling fre¬ 
quency considerably faster than the conservative 
lower bound mentioned above. Thus, it is 
perfectly reasonable to asume that the sampling 
frequency at least satisfies InU ^ 4 max,/3, 0. 

Consequently, the real parts of the discrcte-timc 
poles (i.e. :3^e(n, ±y/i,) exp (La,) cos (T/3,)) 

are situated between 0 and exp (7'a,), i.e. 

0 ~ exp (Tor,) cos (jr/2) 
exp ( Ter, ) cos ( 7p ,) 
exp (Tnr,) 

We may therefore conclude that poles of 
dLscrete-time models that emerge from correctly 
sampled, continuous-time systems cannot be 

situated everywhere in C^O, max exp (7’a,)j, but 

in addition, must be situated in d . This 
additional knowledge can be translated into 
inequality constraints on the parameters too. 

Lemma 2. A necessary (but not sufficient, 
except for degree / 2) condition for the /.eros 

of df^ - fry,tTc/„ (Ce<I , d, eM) 

to have positive real parts, is- 

d,^[) (/ = 1.2, . . . , /). 

Proof. See e g C asti (1977). ■ 

Theorem 2 If all poles of the general model (7) 
have non-ncgative real parts, the following 
inequalities hold, trace A, ^ 0 (linear) and 
(-11'*" det A„ ^ 0. In case (7) is scalar (v = 1), its 
coefficients a, satisfy (-l)'a, n. 

Proof. Since by assumption the zeros of the 
characteristic polynomial (8) have non-negative 
real parts, all zeros of the associated equation 
(-2)"^ + fl„._,(-z)"^">+ (-7)^ no = 0 

will have non-positive real parts. Application of 
Lemma 2 yields that ( —1)V/, 2:0, for all 
I 1,2, . . . , nv. In particular, ~ trace A, = 

and (-l)""det A„ = (-l)'’Xn ^0. ■ 

From here on, we will consider the derivation 
of linear inequalities on the parameters of (7) for 


Aino Wiz-o 
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the more general situation in which it is known 
that all poles of (7) are in circle C{m, R), m e R, 

£ 0. This formulation has considerable practi¬ 
cal relevance. Trivially, if one knows that the 
process is stable, m = 0 and R = 1 can be 
selected. If the magnitude of the largest pole is 
known to be below f?max. one can set m = 0 and 
^ = ^m»»- If a lower bound for the damping ratio 
of a stable oscillatory process is known, the 
logarithmic spirals that embrace the po.ssible 
locations of the relevant complex pole pairs can 
be covered by C(m, \ — m) for suitable m>0. 
Furthermore, any combination of a number of 
circle requirements on the process poles can be 
accommodated with the intersection of the sets 
of constraints that each circle requirement will 
be shown to introduce. Of course the “.sound 
poles’’ constraints (Theorem 2) can be added 
too. 

It is convenient to study the €( 2 , 2) ca.se first, 
since the associated parameter constraints are 
relatively easily derived and allow a simple 
generalization to the C(m, R) case. Consider the 
Mobius-transformation z{s)=l/{\-s) which 
maps the imaginary axis in the j;-domain onto 
3C(2, since ||z(j = jo)) - ^|| = | for real to 

and (due to continuity), C~ into this circle. This 
means that the roots of (7) are at or within 
C(i, 2 ) iff fbe roots of the Mobius transformed 
equation (7) are not in C^. Substitution of 
2 ~* = l-5 in (7) yields jj/(1-.v) = / + /4|(1- 
j)-t-^2(1 - + • • • + A„(l - A')". Binomial ex¬ 

pansion shows that the coefficient matrix /)* 
corresponding to the power / is 



(0-A:<at), (8) 

where we have set Aq:= L These matrix 
coefficients constitute a polynomial matrix 
^{s) := D(| + D^s + D 2 S^ 4- ' ■ ■ T Based on 

the above established relationship, the roots of 

(7) strictly are in 6 ( 2 , 2 ) iff roots of 
det2)(*s) = 0 have negative real part. Necessary 
and sufficient stability conditions, characterized 
in terms of the coefficients of ® (e.g. Shieh and 
Sacheti (1978); Shieh et al. (1987)) for the 
general, multivariable case, invariably lead to 
non-linear constraints not suitable for our needs. 
However, in case all Af^ in (7) are either upper 
or lower triangular matrices (and thus all in 

( 8 ) being triangular loo) it is possible to describe 
a smallest, linearly bounded, convex hull of the 
admissible parameter region. The first observa¬ 
tion is that the non-zero off-diagonal tenn.s may 
have any value. They do not affect the position 


of the poles at all, as can be seen from 
det = 0. Secondly, the analysis simplifies 

to the question whether the poles of each of the 
V scalar systems of the form (corresponding to 
each diagonal term of . 2 /( 2 ”’)): 

fl(z'‘) = 1 + 022 '^+ ■ ■ • 4-fl^z"'', (9.1) 

are strictly in i) (for simplicity we have 
dropped the indices of the individual polyno¬ 
mials). Equivalently, wc could ask whether all 
poles of 

d(.r) : = df) T d^s T 4 ■ ■ • T d„s'', 




(9.2) 


are in C~. Using Lemma 2, we may conclude 
that d^ ^0 for Consequently, 

necessary ctmditions for the zeros of (9.1) to be 
in C( 2 , 2 ) 



0 
0 
0 

( 10 ) 

0 

()_ 

or, briefly, i^Axa^O where u := [ I, a,, . . . , a„Y 
is the coefficient vector and lA is an n x n upper 
triangular matrix with ['^ 1 ,; = (- 1 )'({) for each 
non-zero (r, /)th element I ^ n,{) ^ j ^ i). In 
other words, the triangular part simply equals a 
sign-corrected version of Pascal’s triangular 
expansion. The recurrence in (10) allows simple 
expansion, whereas all lower-dimensional cases 
remain available. The 4 1 equations in (10) 
actually describe a convex set in the n- 
dimensional coefficient space, known as simplex. 
This simplex is spanned by all linear combina¬ 
tions of n 4 1 vertices. These vertices can be 
read from the columns of A,, skipping each first 
element because of the leading constant in the 
parameter vector. This can be seen from the fact 
that the /th contribution ( 0 ^- y ^n) corresponds 
to a parameter vector with /th element (-!)'({) 
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for 1 ^ r j and zero for y 4 1 j n. 
Consequently, it represents a system a( 2 '^) = 

having (n - /) poles at z = 0 and y at z = 1 . Since 
these poles arc at the edge of C(i, k), the 
associated yth parameter vector must be on the 
boundary of the admissible parameter region. 
Since this is true for all y, the set of linear 
\.onstraints ( 10 ) constitutes the smallest convex 
hull of the admissible region. 

We will now discuss the simple relation to the 
general case where the poles ot (9.1) arc in 
C{m, R). Note that C{m, R) is the image of 
C(i, 2 ) under the linear mapping q{z) ’= yz p 
where y := 2R and fi .= m — R. Theretore, if the 
poles of a(q) = 0 are in C’(m, R) this implies that 
the poles of fl(yz4^) = 0 are in r(], \). Since 
we have derived necessary conditions ( 10 ) for 
that property, we expand q'‘ci(q) in z, 
introducing coefficients cy. 


to the spanning vertices have all their poles at 
m±R, i.e. at the edge of r(m,/?), (11) 
constitutes the smallest convex hull of the 
admissible parameter region corresponding to 
scalar systems having all poles in C{m, R). 

3.2.3. Example 3: Stability constraints for 
scalar AR system. It is straightforward to derive 
conditions for (asymptotic) stability of (9.1) from 
the (strict) inequalities (11). Since the unit circle 
is the discrcte^time stability boundary, we set 
m = 0 and R = \ , yielding 

For example, x y becomes, for n =3: 


"1 

1 

1 

r 


" K 

0 

0 

()■ 

0 

-1 

-2 

-3 


-12 

4 

0 

0 

0 

0 

1 

3 


fi 

-4 


0 

_0 

0 

0 

- 1 _ 


-1 

1 

- 1 

1 _ 


^ 'jn - / 4 I 


(yz^pya(yz 4/f) 

” (yz 4 py 4 afyz 4 py ’ 4 • • ■ 4 
^ y”lz'' 4 r,z" ' 4 • • • h r„ ,2 4 rj. 

Application of (10) yields f/i* x r ^ 0 where 
(■;-[1, c’l, , , . , (:„J^ On the other hand it is 
easily seen that the /:th coeflicient ( 0 ^ /c ir ai) in 
c equals: 



stating that c = .T X a where 7 is a lower 
triangular matrix with ['/% = P' 'y\L]‘)IY ' 
for non-zero elements. In conclusion, we have 
shown the following. 

Theorem 3. If the poles of 1 4 c/,z ‘ 4 UjZ " 4 
■ • ■ 4 a„z " are in C'(m, R) then the following 
linear constraints on the :fficienls are satisfied: 

.4? X y X [1, fl,, ^2. • ■ - - d, (11) 

where the non-zero elements of the right and left 
triangular matrices and !/' are defined as: 

[n, = {m-Rr '(f~.y.^y2Rr ■ 

Note that since y is linear, the admissible region 
given by (11) remains a simplex. Likewise, its 
4 1 spanning vertices can be read from the 
columns of yM x y) skipping each first element. 
Consequently, since the systems corresponding 


" 1111 " 

_ 3 1 - ! 3 

~ 3 -1 - 1 3 ■ 

_1 -1 I “L 

So. it is seen from the coefficients in the first and 
the last matrix that linear C(\, \) and stability 
opstraints lor the third-order system y^ 4 
^ly’k I + ^zyk : y ^^yk d are: 

I 4 4 cIt d; I + i f/, a ^ 0 , 

“ n, — 2 fi 2 - 3n^ ^ 0; 3 4rii r/ . *0, 

Ui 4 3(i ^ 0; 3 — - 1 - 3ci^ '0, 

— 1 u, 4 ii 2 " \ - d. 

The induced simplices are shown in Fig. 4 
together with their spanning vertices. For 
example, ( 0 , 0 , 0 ), (- 1 , 0 , 0 ), (- 2 , 1 , 0 ) and 
(- 3, 3, - 1 ) span the r’(') simplex. Jury (1974) 
and Peeters (19K7) have treated the stability ease 
directly. However, Theorem 3 is more transpar¬ 
ent due to the x y decomposition and it is not 
restricted to stability. 11 

3.2.4. Constraints on 'itationary f^ains. Often 
not only (asymptotic) stability ot the plant is 
known beforehand, but also admissible intervals, 
or at least signs, for the several static gains in 
such a stable multivariable process. As argued in 
Section 2, “best” identified models often 
reproduce low-frequency characteristics poorly, 
if at all. The significance of a priori knowledge 
must, therefore, not be underestimated. It will 
be shown in the sequel that inlormation about 
the (sign of the) static gain will introduce a 
dramatic increase of overall model quality. This 
can be crucial to satisfactory controller design. 
For instance, Nemir and Kashyap (1986) show 
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Fig 4 Convex hulls of Ihc stability and C(l/2, 1/2) regions 


that in the case of a minimax one-step-ahead 
variance criterion, open-loop operation of a 
stable process is optimal when the model suffers 
from uncertain low-frequency characteristics. 

When the deterministic part of the process has 
an asymptotically stable transfer function (6), 
the stationary gain matrix ^ equals H(])~ 
[.rf(l)]"‘i39(l). Now, suppose is known to be 
between a lower bound and an upper bound 
^, i.e. ^ in an element-wise fashion, 

allowing ±^. If J2/(1) 0, one may conclude that 

^ 9^{\) ^ or equivalently: 

(/ 4* + ■ ■ ■ -f i4„) X ^ 

< (Bi -h ■ • 4“ 

+ ( 12 ) 

Note that .s^(l)^0 (element-wise) is not a 


consequence of asymptotic stability of si^ in the 
general multivariable case, even if M is 
triangular as we assumed before. However, it is 
true if the asymptotically stable M is of diagonal 
shape. This is a necessary consequence of the 
fact that in that case each nh scalar system 
0 ,( 2 "*), read from the /th diagonal element 
of .srf, has to be asymptotically stable. Then the 
first equation in (10) implies (cf. Example 3) 

= ^/,(1) = J] a\>{) for all u where a\ is 

the kih coefficient of a,. For the popular class of 
FIR (Finite Impulse Response) modelling, 
where the autoregressive model part is missing 
(n =0), additional opportunities arise because of 
the straightforward, linear relation with the 
successive discrete-time step respon.se matrices 
!/'ikT), k^ \ . In practice, often some idea about 



Fig 5. For FTR models a tube’ of acceptable step responses can be easily translated into linear Grey-Box 

model constraints 
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a “tube of possible responses’, bounded by 
maiTices and .^ {kT) at discrete-time 

index k, is established over time. This in¬ 
corporates aspects as dead-times, (non)mimmum 
phase properties, rise/setting time and overshoot 
(cf. Fig. .S) yielding for k 


(6 I Q) I Z, namely 

/h|z.u(^^|Z-^06Q) 

I ^ I ^ 6 Q )_ 

/zj(0|o)(^ I I ^ e Q) df^ 


X B,^J^(kT). (13) 

I .m) 

4 BAYESIAN INFERENCE OF GB PARAMETER 

DISTRIBUTION USING PHYSIC AI. KNOWLEDGE 

4.1. Introduction 

In the previous section we have defined a GB 
mode) class (4) with GB parameter set (3), 
containing physically consistent candidate model 
parameters 6. It was shown how important 
physical information could be translated in the 
inequality form (5), see Theorem 2 and 
equations (in-(13). In this Section we will 
adopt a Bayesian approach and regard the 
unknown parameter estimator ot 6 to be a 
random variable (r.v.), denoted 0. Using 
Bayesian inference, we will construct the 
following conditional posterior p.d.f. for a 
realization 0 — 


/h|zo(^^|Z-z,0eQ), (14) 

which is conditioned not only upon the data 
stack Z (Z = z denoting observations) but also 
on the probabilistic Grey-Box statement 0cQ. 
Hereby we express that any realization of 0 
must be consistent with the a prion given 
knowledge, i.e. e y. Our aim will be to 
construct (14) from the likelihood Z and a prior 
p.d.f. which is associated with the GB model 
class. 

4.2. Bayesian inference in Grey-Box estimation 
The rule of Bayes plays a central role in 
Bayesian statistics, e.g. Box and Tiao (1973) or 
Press (1989). Let Pr {A \ B) := ?t (AB)/Pt (B) 
denote the conditional probability of the event 
A, given the occurrence of event B. Since 
Pr (AB) := Pr (A and B) of course equals 
Pt{BA)-Pt{B and A) it follows directly that 
for Pr (B) > 0 


v4)xPr { A) 

^r(B) 

which IS known as Bayes' rule. Setting A = 0 | £2 
and B - Z, the rule can be used to relate the 
posterior p.d.f. (14) of the s.v. (0 | Z, U) := 


Pt(A I B) = 


Pi{B 


to the prior p.d.f. /d|c(0 | 0 6 Q) (which equals 
zero for ^ Q) and to the likelihood 
with data z, given prior parameter 
estimate (0 | £2) = The integral in the 
denominator of (15) is providing consistency 
with Pr ((0 I Z, ^2) e Q) = 1. This is crucial for 
the deiivation of one ot the constrained 
estimators in the next section. However, for our 
needs it is not necessary that the prior or the 
likelihood be normalized. Figure 6 illustrates 
how the combination of the above introduced 
elements i*' organized. 

An appealing feature is that the likelihood 
occurs explicitly in the calculation of (15), 
allowing for calculation of conventional results 
as well. First we will consider the likelihood. 
Given are N output vectors measured at 
successive lime instants and stacked in a 
measured data matrix In addition, a number 
of known input vectors and noise realizations 
i/, (assumed to be correctly reconstructed or 
measured) up to time index N are in matrix 
stacks 11^ and We assume that the output is 
generated by a (with true parameter 

1 ^*) from a general class of linearly parametrized 
regression structures 

Z^ ' |y* ' 1 t/"' ‘I 5"^“']. 

For convenience, 6 is considered to be a r/ vp 
dimensional vector, consisting of the columns of 



Fifj h The Bayesian creation of the poslenor from prior 
and likelihood. 
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the earlier introduced v x p parameter matrix 6. 
It ivS an alternative representation for (2), which 
includes the ARM AX “process” (1). The 
successive residues ^ are assumed to be 
independent, but not necessarily identically 
distributed v-dimensional s.v.s. Then, given this 
independence, N repetitions of Bayes’ rule 
applied to the likelihood of given the 
regressor and parameter 6 will result in 

(Goodwin and Payne, 1977): 

I 6>) = n /s.(l*(e) I y*- Z^-\ 8), 

k = l 

where |*(6^) '= yk ~ x ^ is the pre¬ 

diction error. We will restrict ourselves here to 
the case where the white noise sequence is 
normally distributed with common covariance 
matrix X and mean vector ]U = 0, i.e. ~ 
>^(0, X). Then the above likelihood is Gaussian. 



However, other choices for the likelihood can be 
made when applicable, e.g. from the assjumption 
of a uniformly distributed white noise sequence, 
which case has considerable practical relevance. 
The other branch in Fig. 6 defines the prior 
p.d.f. Obviously, this is chosen to be zero 
outside the GB set t2, sec (3). The choice of the 
shape within Q often is a subjective one. We will 
focus primarily on a prior that is uniformly 
distributed on Q: 


/•7„(0|0en) = 



for 0 £ Q 
otherwise. 


Using this p.d.f. definition it is necessary that the 
integral exists, i.e. Q must be bounded. 
However, with this uniform prior, the posterior 
p.d.f. becomes a to Q “restricted” likelihood: 


r:;,je\z = z,&s^) 



for h ^ Q 
elsewhere. 


(17) 


see (15). Limiting arguments show that (17) is 
also valid when is unbounded if the integral in 
the denominator exists. Contrary to common 
belief, the uniform prior is usually informative 
on Q (it thus adds knowledge, although it is 
constant on Q) but some aspects of it are not 
fully satisfactory (Peterka (1981); Bard (1974); 
Box and Tiao (1973)). In our philosophy, the 


prior distribution must be made as informative 
as can be, which is usually not the case with a 
uniform prior. However, a uniform prior is 
attractive since precise ideas about the likeliness 
of each model parameter in £2 are often lacking. 
Moreover, it allows of analytical derivation of 
the constrained estimators. 

An alternative, and useful novel prior, 
allowing of prior model discrimination within fi, 
is a (continuous) prior Piece-wise Linearly 
distributed on £2. Here £2 has the form (5) and is 
assumed hounded. This prior is chosen to be 
zero outside £2 and also at its edge dQ, and is 
given some global, positive maximum M 
(normalizing constant) at some fixed interior 
parameter which is considered a priori to be 
most likely. Furthermore, at the line segment 
from any parameter 0'^ on 5£2 to 6^^ ^ dQ, the 
prior is increasing linearly from 0 to M 

/^lo((l - A) X 0" + A X I 0 e £2)) = A X A/. 

OsAsl, e''€di2. (18) 

This definition is consistent since the interiors of 
all such line segments are disjoint and in £2 
because of the convexity of £2. If £2 is 
unbounded (this is the typical case in applica¬ 
tions) we could add bounding constraints to (3), 
which can always be chosen such that the 
integral of the likelihood over the ignored part 
of £2 is neglectable. Much more attractive (since 
it is easier and more flexible) is an extension of 
the pi prior with a uniform component in the 
unbounded case. To this end, consider the rth 
facet (9£2, (with 1 f ^ where C is he number 
of constraints) on the edge 9£2 of £2, which is 
defined as the admissible subset with active ixh 
constraint: dQ, {0 e \ al 0 = b,]. Next, a 
polyhedron £2, c £2 can be defined by forming all 
possible linear combinations of O' e i3£2, with 0^\ 
i.e. 

£2, = {6e£2 I fy - (1 - A)r 4- A6/^ 0 ^ A 1; 

0^ €dQ,\0''eQ\di2). 

Note that £2, is a cone with top 6/^' and base 
hyperplane a!6 = b,, containing the facet (3Q,. 
The p/wriform (Piece-wisc Linear with possibly 
Uniform component) prior is defined as (cf. Fig. 

7) 

ri;j8 I 0 e £2) = (a!8- - h,) 

1 

if ^ £2 

if e £2, for some / (1 s: j < f) 

otherwise (i.e. a uniform component). 

Another choice, well-known from Bayesian 
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inference, is a Gaussian prior with mean vector 
and covariance matrix IL In the GB context 
one can consider a Restricted Gaussian prior: 

- for^^eQ 

elsewhere, 
(19) 

where C ^ 1 is Ihe p.d.f. normalization constant. 
Note that il may be unliounded. However, a 
practical drawback of this prior is the non-lrivial 
specification of the prior parameter covariance 
matrix 11, with the possible exception of FIR 
modelling. 

S. BAYLSIAN MAP AI4!» AAP ESTIMATORS 
5.1. Introduction 

In the previous section it was explained how 
the posterior (15) can be calculated. In addition, 
some specific choices ot the prior and the 
likelihood have been presented. Once a 
posterior has been made available, it presents 
information about the “likeliness” of the very 
many potential models, based on data as well as 
prior knowledge. However, this information 
does not imply directly which model should be 
preferred. Thus, since for practical reasons we 
need one particular model, a criterion should be 
chosen in order to define what is “best”. In this 
section we will consider a Maximum A Posteriori 
(MAP) and an Average A Posteriori (AAP) 


c exp ( - 

\/(2jrr'd^Il 

0 


estimator. Both are Bayesian analogies of 
standard (non-Bayesian) estimation concepts, 

5.2. Definition of the MAP and CML estimator 
The conventional Maximum Likelihood (ML) 
estimator selects the parameter that maximizes 
the data likelihood (Eykhoff, 1974; Goodwin 
and Payne, 1977): 

0M, argmax//|H(2 | 0 ), 

ff ^ ‘r 

over an admissible parameter set which is 
usually unconstrained, = We define a 

Maximum A Posteriori (MAP) estimator using 
the same type of crilerion (Box and I'iao, 1973; 
Ljung and Siiderstrdm, 1983; Goodwin and Sin, 
1984) where the posterior replaces the 
likelihood: 

0mai>-= | 0 ) 

Utli 

( 20 ) 

If the prior in (20) is uniformly distributed on Q 
(cf. (17)), this “Bayesian” estimator is equiv¬ 
alent to an ML estimator that is constrained with 
respect to = £2. We will therefore refer to it as 
MAP/un or Constrained Maximum Likelihood 
(CML) estimator. Note that 

otherwise, ’ 

where the latter expression is the most likely, in 
general non-orthogonal projection on o£2 of the 
unconstrained ML estimator, cf. Goodwin and 
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Sin (1984). However, when the prior is 
nonuniformly distributed, this relation is more 
complicated. We briefly treat the interesting case 
where the prior is piecewise linearly distributed 
(cf. (18)) on a bounded, polyhedral set Q of the 
form (4), with Gaussian likelihood (16) and 
linearly parametrized regression model. The 
associated estimator seems to be more attractive 
than the CML estimator, since it is composed of 
a (in general) more informative prior. Moreover, 
it will never be on dQ. This is attractive as it is 
unlikely or even impossible that model para¬ 
meters are on 8^ when Q follows from reliable 
knowledge. With the piece-wise linear prior 
assumption, it is attractive to decompose Q in a 
finite number of almost disjoint cones Q, which 
have the vertex 6^^eQ\dQ as common top, cf. 
(19). Moreover, the prior behaves strictly linear 
on each cone. Each linear prior component 
follows from the fact that it is positive (the value 
is not crucial and can, e g. be chosen unity for all 
indices) at the inner vertex 6'^ and zero at the 
vertices on 3£2. 

Theorem 4. If the maximizing argument of (20) 
is an interior point of some cone 9^ czQ having 
base facet QD 31^ [6 \ a'0 -¥ ft = {)} and 

top we obtain for A := ©KJap 

x[a/A+i (22) 

Proof, If the maximizing argument of (20) is in 
the interior of one of the cones that constitute 
fi, say P, it follows from: 


= arg max 

f}c ^ 


{a'e + P) 


X exp 



(yk 





where is the non-negative linear prior 

behaviour on P (zero on dP), Equating the 
derivative of the posterior to zero yields the 
necessary condition for the unconstrained case: 

exp { - 2 E (n - '{y, - :¥,d)\ 

' k^ I ) 


X 


a - 4- fj) 


2 





from which it follows, introducing 
<p:= S and 3^-.= ^ 

k^l 

0 = 3'~^[a/{a^B 4- jS) + qj] = ^~*[a/A + <p], 

stating the desired result. The real-valued 
quantity A is the non-negative solution (since 
must be admissible) to the quadratic 
problem 

A^ - (a^3F~^q) -h /3) x A — = 0. ■ 


Note that the solution (22) for ar = () reduces to 
the standard ML case, here resulting in a 
Weighted Least Squares (WLS) estimator 




WLS 


2 3^1^- V* I. (23.1) 
k r] J La-1 J 


since 2 is assumed known (Goodwin and Payne, 
1977). We can thus interpret (22) as a shifted 
(corrected) WLS estimator. When it is not an 
interior point of one of these cones, it must be 
on some interior intersection of at least two 
cones, since the posterior equals zero on 3Q. An 
algorithm can be given which systematically 
locates the maximum. Apart from some 
remarks, we will discuss it elsewhere. It has 
some relation with the projection algorithm 
needed in the CML case (uniform prior) for 
Gaussian likelihood and polyhedral Q, which is 
explained in detail by Liew (1976) and Peeters 
(1987). As in that case the parameter condi¬ 
tioned with respect to data is a Gaussian 
distributed s.v. © | (Z - z) - ^"(p, H) where 
(see (23.1)) 


n := 


n;=var6w..s= 


, (23.2) 


since the prediction errors are assumed to be 
linearly dependent on B. In the other cases, we 
consider the normal assumption a reasonable 
one. In order to solve the CML and MAP (for 
piecewise linear prior) problems it is very 
convenient to introduce the linear transforma¬ 
tion B := ^ x(B - p) as it (stochastically) 

orthogonalizes the transformed parameter. Here 
9^ is a Choleski factor of the above covariance 
matrix H, i.e. n = 9A^CA. The above notions are 
for the transformed, two-dimensional situation 
illustrated in Fig. 8. It shows a situation with 
transformed mean |U € Q, respectively ft (Q (the 
circles represent equidensity curves of the 
transformed Gaussian s.v.). In addition, the 
CML projection onto dCi is shown for the 
non-admissible mean. Figure 8 also offers some 
insight into the MAP with piecewise linear (pi) 
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(SITUATION 1| (SITUATION 21 

FiCi. H ML, C'MI, and MAP,7)/u cslimalnrs in translormed two-dimensional space 
(using orlhogoiiiilized <^laiissian Likelihood). 


prior (with centre if) cslimalor. In both 
situations, MAP//?/ is in or on ihc edge ol the 
cone (in this case a two-dimensional simplex, i.e. 
triangle) in which the unconstrained ML or its 
CML projection is located. When it is within this 
simplex, the translormed result is on the line 
segment that starts off with the LMI. estimator 
and which is perpendicular to and directed off 
the edge, cf. Theorem 4. A detailed discussion 
of the merits of the novel MAP//?/n estimator 
and its properties together with an efficient 
algorithm will be published elsewhere. 

5.3. Definition of AAP and ( MV estimufor 
the conventional Minimum Variance (MV) 
estimator is defined as the following conditional 
expectation 

Omv:=[ 

J[Ht ' r 



This coincides with the ML estimator for a 
symmetrical likelihood and minimizes the mean 
parameter error variance (Ljung and 
Sbderstrom, 1983). We define an Average A 
Posteriori (AAP) estimator using the same type 
of criterion, replacing the likelihood by the 
posterior (15): 

0 AAp:= ©efi] 

[ 0X/;.|,e|„,(z lfl)x/e|„((^|0eQ)df^ 
_ Ai_ 

A 2 

Next we will discuss, in some detail, the 
analytical calculation of the AAP estimator 


when the prior is uniformly distributed on a 
polyhedral set e | </)} Similar 

to the situation where MAP for uniform prior 
simplified to restricted ML (CML), we can 
interpret AAP/un in the uniform case as (with 
respect to Q) constrained MV (CMV) estima¬ 
tion, which equals, cf. (17) 

©f'Mv = I 0 ^ I f^) df// 

hi 

X f / 2 t|,h |»)(2 I 0 ) de. (24) 

hi 

The denominator is needed for p.d.f. normaliza¬ 
tion. For notational convenience and to stress 
the generality, we will introduce the abbrevia¬ 
tion A' := 0 I (Z “ z). Thus we are interested in 

©C MV - I X E S2] = I AX < bl 

where A is a t x r; matrix (r/ vp) and h is a 
C-vecior, following from (5). Observe from (24) 
that 

nx\AX^h\^^-^ I 

xf^{x)dxlPxiAX^b), (25) 

writing ^ for the unconditioned mean. 

From here, we assume X to be Gaussian 
distributed with mean vector p and covariance 
matrix ri, i.e. X-'.X{t^,Tl) as we did in the 
discussion of the MAP problem. We will first 
focus on the calculation of Pr(AA" ^b) for the 
case where A is assumed to have full rank 
C To this end, define the s.v. Z :-AX. It is 
well-known that since Z depends linearly on X, 
it is also Gaussian with X{Aiii, AYIA^). Let 
W be a C ^ Choleski factor of cov (Z) (i.e. 
W^ U -AflA^) then, since W~\Z - A^)-- 
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^(0, /): 

Pr(AX^b) = PiiZ^b) 

= Pr (‘r-^(Z - Am) :£ W-'^{b - An)) 

= tl<P 

1 = 1 

[ith element of W^^{b — Am)], (26) 


where 4>[r] := Pr (;f ^r) = J'_„ djr/\/2^ is 

the probability function of x~“'V'(0, 1). Next we 
concentrate on the nominator in (25). Let ^ be 
a Tj X Tf Choleski factor of 11, that is, = fl. 
Then, using the linear transformation y := 
9t~^(x — m) with Jacobian det 9t: 


(x-n)fx(x)dx 

•'Ax Sib 

= 3e^sr[y I yeV]xPr(ye»p), (2?) 

where the rj-dimensional s.v. y:=9t~'(X — 
m) ~ X(0, /), and the set W is the transformation 
of Q: 


»P := 3{-^(Q - m) = {> e R" I A9?V ^ - ^f^)- 

Although the general case is derived along 
similar lines (it is however much more involved 
and will not be treated here), we restrict 
ourselves here to the case where Q is defined 
with a single constraint a'^x ^ b (b e R) yielding 

W={yelR''|r'V<d}; 
c := 9?a; d = b - a'fi- 

In that case we can write for the ith component 
y, of the vector Y in (27), assuming that ith 
component c, of the vector c is not zero: 


^[y, |c^y<d] = 


sign (c,) 

(27r)'''^Pr (c^y:<d) 




where we introduced vectors >»_, and c_, of 
reduced dimension r/ - 1 by skipping the ith 
component from y and c, e.g. cl, := 
[cj, .... c,_,, c,+i, . . . , c,,]. However, a tedious 
derivation (using the Matrix Inversion Lemma) 
shows that the right hand side of the above 
equals 

sign (c,) r r 

(2;r)'’'" J_. ■■■ L 

exp (-i[(y-, - M.)^n,'’(y_, - fi.) - a,]) dy_„ 


introducing 


n,:=(/ + c_,c!:,/cit 
n,-dx n,c_,/cf. 


The above integr al expre ssion equals sign 
(c,) X exp (o,/2) X Vdet n/2jr, since the in¬ 
tegrand is proportional to a Gaussian p.d.f. 
Moreover, det 11, = c?/||c'||“ and we thus con¬ 
clude to 


l?[y I c^'y = 


-c, X exp (a,/2) 
llcIlV^x Pr(c’y-d) ' 


if c, ^ 0, and 0 otherwise as can be directly 
checked. Finally, u.sing c = '^Ha, d - b - a'n and 
(26), we substitute the above in (27) and next in 
(25) to find; 

^X I a'X^b] 

^ X ||,‘^a|| X <t>|(6--a'M)/IK^“IH ’ 

where X ~ X{n. and the rth component 

of the T;-dimensional vector v equals 0 if 
c, = 0 and otherwise v, = c, x exp(((6 - a' nf 
(||c-,||^/||c||‘'- 1/c,^)). Application to X- 
©|(Z = z) with M ^nd from (2.1.2) 

results in the CMV estimator for a single 
constraint (e.g. sign of steady gain) 

0( Mv= I Z = z, «'0-fij. 


Below a simple example is given which illustrates 
several characteristics shared with the general 
MAP/un (CML) and AAP/un (CMV) es¬ 
timators. Next, some properties are highlighted. 


5.4. Example: a CML and CMV estimator in the 
scalar case 

Consider a white stochastic signal (A",, | n = 
1,2, 3 • ■ that is uniformly distributed on 
[-M, mI where m Ell'S ■ If other words, the X„ 
are independently, identically distributed uni¬ 
form r.v.s on [—M, m]- However, the precise 
value of fi is unknown and has to be estimated. 
For example, consider a measuring device 
(having error statistics that are about uniform) of 
which, e.g. the standard deviation p/Vi is 
required. In order to estimate p. we can 
consider the realization (outcome, sample) x„ 
that X„ takes at each time n (i.e. X„=x„). 
Initially we will assume that all we know is 
M ^ 0, based on consistency of the interval 
definition. What is the ML estimator in this 
case? For the general case of N realizations 
x,,...,X/>/ of the mutually independent 
Xi, . . . , Xs (stacked for notational eonvenience 
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in the random vectors, x^y and X^,, respectively), 
one finds for the likelihood L of the event 
X/v = x^, (given a value for n): 

Lpi-N = X,v fl) =/xyl„(X/y) 

= U 

n ~l 

for all n ^ N and 0 otherwise. This expression is 
easily seen to be maximized at fj = 
max(|jri|, kzl. • • . k/vl since if we reduce fi, L 
becomes 0, whereas if we increase L 
decreases. Ihus the ML estimator, using N 
realizations, is (see also DeGroot, 197(^): 

AAf/(N)=-“argmaxL(Xyv|^/) 

f, -n 

= m'Axi\X,\,\X,\,,.. .\X^\y (28) 

This is clear since no sample can have larger 
absolute value than /i so the largest is closest to 
/i. Now consider the case in which prior 
knowledge is available such as that is above 
sonic Ihreshold Mnim^ * We do 

not rule out the possibility that the statement 
it^mm is false; the consequences of incorrect 
knowledge will be discussed too. Note that the 
stancard case corresponds to ^ 0. The 
Constrained Maximum Likelihood (CML) es¬ 
timator is: 

AcAf/i-^) ~ ■fg max /.(X^, I n) 

= max(/^_,|A',|.|.Y,|.|Jr^.|). (29) 

The validity of this result can be seen as follows. 
Since fi it follows that (if mi ^ H.,,,,, w.p. 

1 . In other words, the prior knowledge can be 
viewed as an additional realization, say at time 
U, slating A'o-jiXniin- On the other hand, if 
Am/ (^) ^ tl follows from the definition of 
the ML and CML estimator that = 

Am/.(^)- This combinatioT) yields (29). 

So far the (C)ML estimator has been 
analysed It can be easily checked that, in this 
example, MAP/plu (with continuous prior 
behaving linearly for i)^ = 6^^ and being 

constant elsewhere) coincides with ML. In 
addition, we can relate the Constrained Mini¬ 
mum Variance (CMV) Estimator (24) to CML. 
Introducing the abbreviation M \= 

\x,\, \x,\, .... |jc^l), one can write: 

I ~ Mminl 

^ r / r 

Jm (2 f^r/ L (2/i)'" 

^-/v^i/(-jV + 2)|^ 

>-'^^V(-n + i)|m 


/V-l 

/V-2 A/'-^ 

/V- 1 

= -—r X max [x,!, |as|, -kvi), 

N - 2 

(A/- 3 . 4 . .S- •). 

yielding, for /V 3 

Ac Mv(A') = I X,v; fJ. H^,n] 

(30) 

This shows that the CMV estimator is less 
cautious. In the following, the mean and 
variance of both eslimators will be calculated to 
gain furthei insight into their relative merits. 
Since (30) slates that for fixed /V, CMV is 
proportional to CML, (29) is the basis of the 
following calculation of the lAh moment 
(Z -1.2,3,...)of/i,^,(A^): 


i^lAcML(/V)]' 



xdPr(|2r,|-x, lA'.k .V_ \A\\--x) 

= {—) ^ + I 

N + Lx x^i' 

^ > 

(31) 

(V -A L) 

Note that the first and the second term are the 
unconstrained and the correction part with 
respect to p Mnun- I rom (31) it is seen that the 
relative expected bias /:f( a// ‘d CMI^ is 

/^(-Mi (-V) (•'AcMi - H)ln 

' 

NA\ 

This relation confirms that if p (correct 

information), i^A(M/(^) converges linearly (at 
first-order rale) to p for large iV. Note that the 
expected bias is negative. In addition, using (30) 
it is seen that the relative expected bias ol CMV 
is 

/^C'Mv(^) ^Ac Mv(^) ~ l^)/^ 

(N-uC^—) +2 

(N -I- ])(N - 2) 

showing that if converges 

quadnitically to p for large N. In the CMV case, 
the expected bias is positive but vanishes more 
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Table 1 Rei ATtvE expected bias (in %) as a function of 
N FOR ML, CML AND CMV 

ficMiXN ) Pc MV ( N ) 

ui------ 

samples (N) fi„,„ = 0 /i„„„ - 0.9/^ = 0 p„,„ - 0 9^i 


10 

-9.1% 

-6.2% 

2.3% 

5.5% 

20 

-4 8% 

-4.2% 

0.5% 

1.1% 

30 

-3.2% 

-2.0% 

0.2% 

0 4% 


quickly. Both effects are tabulated below for the 
cases = 0 and fi„,„ = 0.9/i, 

Table 1 clearly shows that the CMV estimator 
is more biased for tighter prior knowledge; in 
the case of the CML estimator it is the other way 
around. Furthermore, using (31), the variance of 
A*cml(^) 

Acml(^) 

var 




V + 2 



N-f2 


V + 2 



V4 I 


unconstrained and the correction part of the 
variance. Since the correction term can be shown 
to be strictly positive and increasing with 
(for all yv>3) if the CML 

estimator accuracy increases as a result of tighter 
knowledge. This is visualized in Fig. 9 for 
(no knowledge), = and 

^min“ 9^/10, respectively. Note that bias as well 
as variance reduce with tighter Further¬ 

more it clearly shows that the impact of prior 
knowledge is most prominent for small sample 
lengths; asymptotically the ''profiV' vanishes. 
The above observations also apply to the CMV 
estimator because of (30), Since ArMv(^) is less 
cautious, its variance is larger, i.e. 

/N - 1\^' 

var ficMvW = ^ AcmiW- 

If the prior knowledge is correct both con¬ 
strained estimators inherit the asymptotical 
unbiasedness of the unconstrained ML 
estimator: 

lim /ic'Mv(^) = lim /icML(^) 

N—*^ /V-+ac 


(N+l)(/V-2) \ fi / 

+ 2 — 

\ fi / _^ 

(^+1)^ jV + 2_' 

Again, the first and the second term are the 


= lim fiMi (V) = 

N—y 

as can be seen from (29) and (30) together with 
Pr (lA'/vl > fi - d)> I - € lor any 6, e > 0 if V is 
sufficiently large. In case of false information 
(i-^- it follows from (29) and (30) that 

the CML as well as the CMV estimator will be 
bia.sed for all N, i.e. /ic mi (/V) = cf. Fig. 



5 10 15 20 25 30 35 40 45 50 

NUMBER of SAMPLES 

Fio. 9. Mean and variances of ML and CML estimator of fji as a function of N 
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9. In the CMV case the bias is even bigger; 
asymptotically the CML level will be met. 

lo avoid such inconsistencies, one should only 
use undeniable prior knowledge. In this simple 
example this could easily be detected but the 
observation applies to the general case too. 

5.5. Some properties of the constrained 
estimators 

It can be shown that asymptotically (when the 
number of data items N —QC ) CML, CMV and 
MAP/plu estimators converge with probability 
one to the conventional ML estimator provided 
the ‘process \s in the admissible model class 
implying that the true parameter 6 ?* e Q That is, 
the constrained estimators are asymptotically 
unbiased and efficient when the a priori 
knowledge is correct. This is a direct conse¬ 
quence of the fact that the likelihood will 
asymptotically become Gaussian distributed, 
centered at 0*, with vanishing covariance matrix 
(c.g. Ljung and Soderstibm, 19<S3). 

On the other hand, one should be careful 
when imposing unreliable knowledge since 
6* ^ Q will generate an asymptotic bias, as is 
illustrated in the previous section. Furthermore, 
the mean squared error of the constrained 
estirrators is less than that of the ML estimator. 
Moreover, this advantage can expect to be 
greater for shorter experiments and when 
non-stationarities or modelling errors play a 
dominant role. These observations are illustrated 
below. 

5.6. Simulation results 

We will briefly discuss a simulation result 
here. More extensive results are in van Aken 
(1987). We consider a scalar, first-order ARX 
process 

yk =apyk-y + + ^k. 

= (32) 

h, = 1 / 10 . 

where is a scalar ‘‘Generalized Binary 

Noise” test signal with unit amplitude and 
non-switching probability /7:=8/l(). Such a 
signal can at each time kT “decide” to remain at 
the actual signal level {u^ - Wa i) or switch to an 
opposite level with probabilities 

Pr (ma = = 

Pr(u* = -u*. ,) = 1 -p. 

0<p < 1. 

Generalized Binary Noise has been shown 
(Tulleken, 1990) to be superior to ordinary 
Binary Noise (PRBS). In particular, it is 


asymptotically (i.e. for large experiments) 
optimal for stable first-order ARMAX processes 
when p equals G+fl^)/2. Since the most 
relevant simulation experiment length is rather 
small, the non-switching probability is selected 
somewhat less than (1-f 9/10)/2 = 0.95. The 
noise ^ is chosen white Gaussian J) of 

which the outcomes arc not measurable The 
variance is chosen such that the signal-to-noise 
ratio is about unity (as much induced signal as 
noise on output) which is often the case under 
realistic conditions in the process industry. 
Candidate models of similar ARX format are 
considered: 

yk= (a^beU). (33) 

The unknown a and b have to be estimated. In 
order to define a GB model class, some crucial 
knowledge is imposed. In this example we 
consider knowledge of open-loop stability and 
hounds on the stationary ^ain § 6/(1 - a)\ 

1/5- ^^^5. 

This results (see (9) and (11)) in a Grey-Box 
model set Q, given by: 

Q 6 E IR I -1 - {i — 1, 

(1 -fl)- 56 .- 25(1 -n)}. 

Note that the process is indeed stable and has 
unity stationary gain, so the true process 
parameter is in Q. Now, using 1(K) Monte-Carlo 
simulation trials in order to scan the dependency 
on the particular data sets realized, the 
stochastic properties of ML (which of course 
equals LS in this case), CML and CMV for this 
particular process are obtained with good 
approximation. The expected values of the 
parameter estimators a, 6 and - d) for 

ML, C^ML and C.'MV are shown in Fig. l()(a)-(c) 
(solid lines). In addition, the successive ap¬ 
proximated 95% probability curves are shown 
using dashed lines. The results are represented 
as a function of the experiment length, expressed 
as a multiple of the 95% process settling time 
(r"'" - [-3/ln 0.9|r - 287). For the process 
industry, the most interesting situation is around 
indices 2, 3 and 4. It can be seen that all three 
estimators asymptotically converge to the correct 
values, indicated with dotted lines. However, for 
short experiments the constrained estimators and 
in particular CMV are more biased than ML. On 
the other hand, the estimator uncertainty (in 
practical terms, the realization dependency) is 
drastically reduced for CML and especially for 
CMV. This improvement is most significant in 
the estimators of 6 and since low-frequency 
characteristics are much better modelled. Note 
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Fig 10 ML, CML and CMV estimators for a, b, gam ^ and respectively 


that ML for short experiments indeed suffers 
from a high probability of selecting non-sense 
models being unstable or having wrong sign of 
static gain. For step response and frequency 
response estimators similar observations can be 
made. Figure 10(d) presents the distribution of 
ML, CML and CMV step response parameter 
estimators for an observation length These 
results suggest that, on average, CMV is much 
more reliable than ML, which can be evidenced 
with theoretical arguments and more compli¬ 
cated simulations as well. Obviously, the 
increased model accuracy has important conse¬ 
quences for the success and robustness of 
controller design. 

5.7. Impact of Grey-Box approach on controller 
design 

Consider the first-order process (32) (with true 
parameters <3^ = 0.9 and ^^^=0.1) and the 
corresponding model (33) (using estimators d 
and (>, taking values d and b). On the basis of 
this identified model we want to design a simple 
fixed, proportional controller law for (32): 

(34) 

It is sufficient for our goal to select the gain ^ 


using conventional pole-placemcnt, depending 
on the identified parameters a and ft, i.e. 

^(d, ft). For notational convenience wc skip 
the arguments if no confusion can arise. The 
model (33) combined with controller (34) will 
yield a closed-loop first-order system having its 
pole d—^b at some desired location A, if 
^:=(fl —A)/ft. Note the extremes of the 
open-loop ('^=0;A = fl) and the minimum- 
variance (dead-beat) case dlb \ k = 0). If the 
model were perfect, the aim would be realized. 
However, in reality the modelling error will 
introduce a more or less deteriorated controller 
performance. The installation of such a control¬ 
ler (34) in combination with the process (32) 
does not even guarantee closed-loop stability, 
since the magnitude of the true closed-loop 
pole Up - %d, b)bp may be greater than one. 
Modern, attractive techniques for robust (e.g. 
^*) controller design have become available 
(e.g. Kwakernaak, 1986; Doyle et ai, 1989, 
Djavdan et ai , 1989) which account for the 
effects of uncertainty in model, process or 
performance requirements. However, these 
methods inevitably lead to conservative and thus 
iftefficient control performance if the uncertainty 
is relatively large. Therefore the efforts to 
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improve upon model identification techniques 
remain essential. Below it i.s illustrated that 
Grey-Box modelling can have considerable 
benefits in terms of improved controller design. 
Since = 0.9 and =0.1, only proportional 
controllers with gain satisfying 

i.e. -1<'^^<19, will stabilize the process. In 
addition, one could ask: when are we performing 
better than open-loop, i.e. when do we decrease 
the asymptotic output variance lim 7>yl with the 

selected controller? Since, using (32) and (34), 
yielding 

hm iyl = cr"/[l - (a^ - '%)i, 

the variance decreases if 1 — -1 - 

i,e. {)<^ />) *- - 18, Note that the 

minimum-variance gain a^Jh^, — *^ is in the 
middle ot this interval, binally, it is relevant to 
see for which sets of diftercntly estimated 
parameters (say a and h) the design generates 
the same gain as when d and /> were being used. 
This IS the case when \^{a,h)-{a k)lb = 
h), yielding equations ot straight lines 
b = (a ~ A)/7/(d, h), which all pass through 
(A, 0). The straight lines shown m Fig. 11 for the 
minimum-variance design case (A = 0) refer to 
the parameters sets yielding closed-loop mar¬ 
ginal stability ('.{!»= - 1 and .0= 19), no variance 
reduction (^.^ = 0 and 18, ‘break-even”) and 
minimum-variance (^.^ — 9). In addition, the 
approximated means and 95%-cqui-density con¬ 
tours ot the ML and GMV estimator ot \d, 


are shown. These results were gained using 
Monte-Carlo simulation where the conditions of 
the previous section applied. Specifically, an 
experiment length of three times the 95% 
settling time (85 samples) was selected with a 
unity signal-to-noise ratio. In the CMV case the 
Grey-Box model set (a, h eU | -1 1; 

(1 - ( 2 ) < 56 ^ 25(1 - fl)}, of which a corner is 
visible in Fig. 11, was used again. Figure 11 
shows the effect of model uncertainty on the 
closed-loop performance. It can be clearly seen 
that the probability of not ending up with an 
unsuccessful or even instable closed-loop process 
(which equals the integral of the density over the 
region between the “break-even” ^S = Q and 
^=18 lines) is considerably lower m the CMV 
case, even when ML-identified models showing 
negative sign of static gain would be skipped in 
advance (which are situated below the horizontal 
axis). This effect is even more pronounced for 
A>(). In that case the lines (of model 
parameters generating equivalent closed-loop 
behaviour) uniformly shift a distance A to the 
right, covering an increasing part of the CMV 
density. 

5-8. Application of CML to the distillation 
column field data 

After the above discussion of theoretical and 
simulation results of Grey-Box estimators, it is 
interesting to see how these methods improve 
the identification of a real process. We will focus 
on the distillation column and associated 
input-output data which was described in Figs 1 
and 2. Specifically, the dynamics from set point 
ot the reflux flow controller 0/^ to the top 
temperature will be investigated; additional 


'I Jf irue parameter 

( 0 ( b^^ak - even vnncjnrp) I (on - 0 9 , bn ~ 0 1 ) 



Fifi. li I’he impiict of Grey-Box icsults on Ihi.' quality of controller design 
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examples are in Peelers (1987). A scalar 
ARMAX(2, 2, 2) model was postulated, i.e. 

yk^(i\yk-\'^a2yk-2 

= byUk-\ + b2Uk-2’^ + Cx^k-\ + 

It was known that the process dynamics were 
open-loop stable having negative stationary gain 
^ (since more reflux cools the top of the 
column), which was certainly above Using 
(10), or Example 3 and (11) for n—2, together 
with the fact that only sound discrete-time poles 
were acceptable (implying fli<0 and cf. 

Theorem 2) the admissible parameter set Q was 
given by 



"fl" 




0 

VI 



, 0<A.<1 



€ R" > 


b2 

1 ± Ai -1- A 2 ^ 0 



— (1 4- Ai -h A2)/2 < -h ^2 “ U 

< 

_C2_ 

J 


Over the full experiment length of 1200 samples 
the transients of three estimators were com¬ 
pared: ML, CML and The last two 

estimators use the prior knowledge that the 
parameters must be in Q, Here CML is the 
“batch” version where for each extended 
experiment length the likelihood is combined 


with a prior uniformly distributed on £2, yielding 
a posterior of which the maximization is giving 
rise to either ML or projected ML results, cf. 
(21). CML'''*' does the same, but uses possibly 
projected, intermediate results for subsequent 
prior definition in the recursive updating. The 
reader may verify that CML'*^*^ is the MAP/rg 
estimator which maximizes, for each experiment 
length, the posterior associated with the 
restricted Gaussian prior (19), where 6^^ is the 
last estimate and 11 the associated variance. In 
Figs 12(a) and (b) the three different estimators 
AI and A^ of AI and A 2 are shown, whereas Fig. 
12 (c) presents the estimated stationary gain 
I +/? 2 )/(l + + ^^ 2 )- Figure 12(d) indi¬ 

cates that the CML"**^ estimator had to perform a 
huge number of projections towards the edge of 
Q (FLAG = I) during the recursive identification 
process because of detected model inconsis¬ 
tencies. Some of them are apparent from Fig. 
12, c.g. most of the initial 100 ML estimates of 
were (way) out of bounds. A more serious, 
persistent deficiency is shown in Fig. 12(b), since 
the ML estimate of — a^ is strictly positive. This 
reflects a physically pathological situation, with 
one of the two estimated poles being negative. 
Since the Grey-Box set Q excludes such models, 
CML had to correct always and CML"*^ often 
(Fig. 12(d)) for that tendency In the CML case 
this results in this simple example m subsequent 
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Fig. 12. ML, CML and CML*^®*^ e.stiniators for reflux to top temperature model 
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pro]eclion onto the relevant constraint 
whereas is penetrating the acceptable 

region, with a tendency towards this edge 
EflectiveH this implies that in this case no 
second-order AR dynamics could be properly 
htted bcc ause of overparametrization As can be 
seen from the CML estimates of -d, (which 
equals ^he discrete-time pole of the resulting 
first-order model) the physical relevance of the 
AR dynamics is enhanced considerably the 
associated lime constant of the underlying 
contmuous-tiriK model is about 20 samples 
instead ol about two in the ML case 

Amongst other things this example clearly 
shows how model reduction of too complicated 
models can be realized in a natural fashion by 
exploiting the physical information Additional 
experiments (see also Peeters (1987)) have 
confirmed that the Circy Box approach is 
capable of enhancing a number of characlcnstics 
of the model in practual cases considerably 
hssential features such as stability sign of 
stationary gains and occurrence ol sound poles 
arc guaranteed with improved rale of 
convergence 

6 ( ()N( 1 DSIONS 

In this paper Grey Box statistical estimation 
is discussed where a linearly parametrized 
dynamic regression model is identihcd which 
explains the data best but is also consistent 
with S()ecified physical knowledge It is shown 
that important physical knowledge such as 
stability (or even an admissible area tor pole 
location) as well as signs or bounds on steady 
gams can be translated into inequalities that arc 
linear in the model parameters Using Bayesian 
inference Constrained Maximum and Average 
A Posteriori estimators are introduced Explicit 
solutions for the special case of Ciaussian 
likelihood and uniform as well a^ plurxform 
(piece-wise linear with uniform component) 
prior are presented, resulting in a CML, ( MV as 
well as MAP/plu estimator with promising 
characteristics Theoretical considerations, a 
simulation result and a practical application to a 
distillation process show the advantage of the 
constrained estimators under non-ideal, realistic 
experiment conditions Considerable variance 
reductions at the cost of a somewhat larger bias 
can be achieved, indicating the potential for 
advanced control applications where the availa¬ 
bility of an accurate model is essential It may 
imply that, e g an adaptive, automated design 
will come more into reach 
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An Integrated Collision Prediction and 
Avoidance Scheme for Mobile Robots in 
Non-stationary Environments* 

K. J. KYRIAKOPOULOSt and G N. SARIDISJ 

A scheme integrating simple and fast collision prediction and a reactive type 
{Jeedback solution) of avoidance for mobile robots in environments 
containing moving obstacles is computationally efficient and appropriate for 
real time implementation. 


Key Words —Mobile robots, motion pUnning, collision prediction, collision uvnidtincc, fWtentiLil 
functions, Lyapunov stability 


Abstract -A formulation that makes possible the micgraiion 
of collision prediction and avoidcincc stages for mobile robots 
moving m general terrains containing moving obstacles is 
presented A dynamic model ol the mobile robot and the 
dynamic constraints are derived Collision avoidance is 
guaranteed if the distance between the robot and a moving 
obstacle is nonzero A nominal trajectory is assumed be 
knowr from off-line planning The mam idea is to change the 
velocity along the nominal trajectory so that collisions arc 
avoided A lecdback control is developed and local 
asymptotic stability is proved d the velocity ol the moving 
obstacle is bounded Furthermore, a solution to the problem 
of inverse dynamics for the mobile robot is given Simulation 
results verify the value of the proposed strategy 

1 INlRODUCniON 

Thl probllm of efficiently planning the motion 
of a mobile robot in an environment containing 
moving obstacles (Fig. 1) is difficult and 
computationally intensive. Although it has been 
stated as early as 1984 (Freund and Hoyer, ’984) 
ad hoc solutions were given (Liu et al. (1989); 
Tournassoud (1986)). Reif and Sharir (1985) 
gave an algorithmic solution to the problem but 
they were restricted to some categories ol shapes 
of objects and their approach is not suitable for 
an on-line implementation. Search based ap¬ 
proaches for solving the above problem have 
also been presented in Fujimura and Samet 
(1990) and Shih et al (1990). On the other hand, 

* Received 1 October 1991, revised 11 April 1992; received 
in final form 9 June 1992 The original version of this paper 
was not presented at any IFAC meeting This paper was 
recommended for publication in revised form by Editor A 
P Sage. 
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Troy, NY 121H0-3590, U S.A. 

t NASA Center of Intelligent Robotic Systems for Space 
Exploration, Rensselaer Polytechnic Institute, Troy, NY 
12180-3590, U S.A 


Kant and Zucker (1984, 1986, 1988) iised the 
decomposition of the motion planning problem 
to the find-path, and move-along-path problems. 
Their proposal is that the avoidance of moving 
obstacles can be done by adjusting the speed 
along the geometric path. The same approach 
was adopted in Wu and Jou (1988) and in 
Griswold and Ecin (1990). The basic idea of this 
approach is utilized in this work. 

To facilitate a fast solution a hierarchical 
decomposition has been proposed Kant and 
Zucker (1988) and adopted and extended m our 
work. The problem is divided as follows. 

• Off-line path and motion planning. 

• On-line motion replanning. 

In the off-line stage, two problems are solved; 
First, path planning, the 'Tind path" problem, 
i.e. the search for a connected curve r(.y) = 
|jr(.y)y( 3 )z(A)]^ on the terrain described by a 
surface ^(jc, v, 2 ) = 0; v is the trajectory para- 
metrization variable (e.g. path length). The path 
should connect the initial and taiget points 
without colliding with the stationary objects 
while satisfying certain criteria. Second, motion 
planning, that is to find a “nominal" motion 
function 3„(/) along this path that does not 
violate the kinematic and dynamic constraints of 
the robot, and some performance criterion (e.g. 
time) is minimized. 

The subject of this paper is the development 
of an algorithm for the on-line stage. This 
algorithm has to act in a supervisory mode 
during motion execution and make sure that the 
robot is going to move from its current state 
(position velocity Uo) to the target one 
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Fig 1- Environment with a mobile robot and multiple moving obstacles 


(position Sfy velocity v^) avoiding collisions with 
those moving obstacles with which collision is 
predicted based on sensory input. The basic idea 
is to alter the velocity along the path r(5) 
without changing its geometry. This is useful 
only if the following fundamental assumption is 
satisfied; 

Temporary obstruction assumption. The mobile 
robot moving along path r{s) can only be 
obstructed during a bounded amount of time, 
i,e, the moving object is assumed not to 
permanently stay on, or move parallel to r(5). 

The new plan must satisfy the dynamic 
constraints and stay as close as possible to the 
nominal plan. 

In our prior developments the objects were 
modeled as convex polyhedra and based on this 
assumption an approach to predict collisions 
(Kyriakopoulos and Saridis (1990a, 1992a)) was 
developed. For the case of a planar terrains a 
real time collision avoidance scheme, the 
Minimum Interference Strategy (MIS) and the 
Optimal Control Strategy (OCS) (Kyriakopoulos 
and Saridis (1990b, 1991, 1992b)), have been 
proposed. There, in addition to collision 
avoidance, time consistency with the nominal 
plan was sought. 

In this paper the potential fields strategy (PFS) 
is presented. Potential fields were originally 
introduced by Khatib (1985) and furthermore 


investigated by Rimon and Koditschek (1989) 
and applied for mobile robots by Kant and 
Zucker. It is a computationally inexpensive but 
local method providing only collision avoidance 
capabilities while performance is not guaranteed. 

Here, a reformulation of the potential fields 
approach is attempted, so that performance as 
well as collision avoidance are sought without 
significantly increasing the computational load. 
Performance is expressed in terms of proximity 
of the final time of the new plan to the final time 
of the off-line plan. Performance is achieved by 
the following. 

• Introducing a simple collision prediction stage 
in order to determine if action should be 
taken. 

• Utilizing the information of the nominal plan. 

In Section 2 the definitions, modeling and the 
mathematical problem arc stated. The theoreti¬ 
cal analysis and the presentation of the Potential 
Fields Strategy (PFS) is done in Section 3. 
Finally, in Sections 4 and 5 simulation results 
and suggestions for future research are pre¬ 
sented, respectively, 

2 PROBLEM FORMULATION 
The mobile robot is a kinematic mechanism 
composed of the body and the rolling wheels. Its 
kinematics and dynamics can be modeled based 
on the necessary assumption that the wheels are 
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ideally rolling The conditions to achieve rolling 
vi^ithoul slipping and skidding aic presented 

2.1 Kinematic modelinf’ 

Although the development of Alexander and 
Maddocks (198M) is based on the assumption of 
planar motion theii analysis holds for the case of 
general terrains with the additional assumption 
that the curvature of the floor is such that 
sufficient contact of the wheels is guaranteed. 
Consider the top view of a mobile robot (Fig. 2) 
moving along a trajectory described in a 
parametric form by r(s) where s is the 
parametrization variable. The world coordinate 
frame {p,,Py,p,) with unit vectors (r,/, A) is 
fixed at O. The body coordinate frame 
(p^^Py>Pz) is attached to the body at point B 
and moves along r{s). 

The motion of the robot is described by r(^) 
and the orientation angle function 0(.v), defined 
as the angle between p, and pf. The instant 
translational velocity of the robot with respect to 
the world coordinate frame is n(t) and its 
rotational velocity is a)(t) = The signed 

angle between i}(t) and p^ is /I 

Two wheels are located at jc,, Xj w.r.t. the 
world coordinate frame and at xf, xf w.r.t. the 
body coordinate frame, a,, are the signed 
angles between x^, xf and p'\ respectively. The 
angles and between their axis of rotation 
and pf, are called steering angles. Vectors 0, 
and 6j represent the angular velocity of the 
rolling wheels. 

The centers c, and of planar rotation. 


corresponding to each wheel are: 

r, -X, ~h . 

(D 


where r is the radius of the wheel. In order to 
have ideal rolling of the robot, the centers r,, c) 
must be identical, and therefore 

u>(x, - X,) = r(i), - (),). ( 1 ) 

1'his is the rolling compatibility condition that 
guarantees rolling without sliding. In Appendix 
A the inverse kinematics are presented based on 
the development of Alexander and Maddocks 
(1989). 

2.2. Velocity bounds 

To avoid skidding, the centrifugal forces 
should not saturate the available friction 
between the wheels and the floor. In order to 
satisfy this condition, a bound tin the 

velocity must be set at each point j. A vehicle 
with wheels moving along path r{s)~ 
[x(j)yCv)z(.v)]^ t on a terrain described by the 
surface g(x, y, 2 )=() is considered. If the 
assumption that g{x, y, z) is reasonably smooth 
is made, the contact points of wheels are planar. 
A coordinate frame is attached to the 

center of mass G of the vehicle (see Fig. 3). This 
is essentially the same with the frame p^, pf, pf 

iThr path will be sometimes denoted as r(Aj = 
|jr(A )y(z)7(j and sometimes f(.v) = jr(.j)i f y{s)} -t- z{s)k. 
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FiCi 3 Forces acting on a mobile robot moving on general tcriain 


of the previous section assuming B = G and 
fi = 0. If p" is tangent to r(s), p‘.' is normal to the 
plane tangent to the surface of the floor, and 
is normal to both then the unit vectors 
corresponding to (p'^, p", p") are 

^ k" X ?’ k^'(s) = 3- 

( 2 ) 

Since coordinate frame p',', p", p” moves w.r.t 
time 


d? 


di 




^ “Til V 'J* . - 

-= W X ( =>-.V = W X I . 

dr ds' 

-^ = (oxf^-^s = o)xj\ 
dr (Is- 

d^" - ro^d^V , 

-= a> X — T = aj X fc 

dr ds 


(3) 


where di is the vector of angular velocity. If 
equations (3) are solved! then S) has the form 

S) = Ci - s. (4) 


t This set of equations is not actually ovcrdelermincd This 
IS because the z(i) of vector r(r) can be determined by x(i), 
yb) and g(x,y, r) = 0 


Similarly, 


270 


d^i 

dr^ 


ddi d?’ d“i'' , 

V^ p.’ _J_r-^ 


ds ds' 


= oi X (a) X + ^ iS' H- 

dt 

do) 

= w X (w X i ) -♦- X i , 


d; ^ drJ) d^'' 

_ dci^ 

= w X (a; x/^) -h — X f\ 
dt 

do> d/f" d^)?’ 

j = 0) X (fti X k )+—Xk — S + J 

' 'fit tb' 


dr" 


dr 

-n d(J) -n 

= a>x (u) X it") + -— X A:", 

dr 


(5) 


and the angular acceleration has the form 

d = ^ = /42(.r)j' + /4|(,s)s'^. (6) 

Unit vector c lies on the (p", p'j) plane and 
points towards the center of curvature or r(j). 

d^r/dr^ 


lld^r/ds^" 


( 7 ) 
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The acting forces on the vehicle are; the 
weight w = mgk, the reaction forces ^ = 
i — 1, - . . , n,^, from the floor to the wheels and 
the friction forces f, =fj> + fj\ , = i 

between the floor and the wheels. When the 


robot moves with a speed .v and acceleration i a 
centripetal force 

F, = m.fy(i)c, 

(8) 

an inertial force 


F, = mx'i", 

('^) 

and an inertial torque of the form 


Mt, = la + d> X (lib) = M^ix)s + Mt(x)x^, 

(10) 


should be exerted. (/ is the inertia matrix of the 
mobile robot around its center of gravity.) 

The motion equations of the vehicle are 

^ + /"i) + + f'i + T/ = 0, (11) 

l(f. + F,)x{x:'-lk') + M„ = {), ( 12 ) 

I 1 

where I is the height difference between the 
contact points of the wheels and the center of 
mass (j. 

The slide constraints for the wheels are: 

\f.\ = VCf:y^JU?^riF„ ( = 1-(13) 

/<-0. /=l,.,../i„,. (14) 

where rj is the friction coetficicnt between the 
wheels and the floor. 

The actuator constraints are 

- ^ W, sin (p'! + cos , 

/ = 1 .( 15 ) 

where (-.'ft. are the brake and accelerating 
bounds of the actuators, respectively. 

The maximum velocity so that skidding is 
avoided is given by the mathematical program 

^skiii(v) = max {i’V(l l)-(f5) are satisfied}. (16) 
If a vector X of unknowns is formulated, where 

then mathematical program (16) has linear 
objective and constraints, except (13) which is 
convex though. 

2.3. Dynamic modeling 
The simplying assumptions made to obtain the 
dynamic model of the mobile robot are: 

• No slipping: rolling compatibility conditions 
are satisfied. 


• No skidding: kCi-)! s 

• The rotational kinetic energy of the rotating 
wheels is not considered. 

• Frictionless motion (w.r.t. the robot’s kinema¬ 
tic mechanism). 

The kinetic and potential energies of the 
moving robot are: 

K = lib) + \ms', (18) 

P^mgz(s). (19) 

where 1 is the moment of inertia of the robot 
around C, m is its mass, g is the gravity vector 
and 2 (.r) is the height of the center of gravity 
w.r.t. a world coordinate frame. 

Since («(.v) = j/(i)c, where /(.v) is the curva¬ 
ture at X, the kinetic energy becomes 

K[x. i) = i/. (i)/%v)i- + (20) 

where I,{s)={c,Ic) is the reflected inertia 
matrix. The Lagrangian is 

Lix, x) = K-P = \l,{x)f\x)x' 

+ {mx^ - mgz(x). 

The Huler-Lagrange equations, assuming no 
nonconservative forces, such as friction, are: 

d 

dt 9x dx 

where u is the steering force of the robot tangent 
to r(.i) at every instant. Using (20), the dynamic 
equation is 

(m + /,(i)/-(.v(r))).v(t) mx)fHs) + J.(s) 

Xf(s(0)f'(x(t)))xi0 -I- mgz(x) = m(0. (21) 

where (•)'= d( )/d.s. An equivalent useful 
formulation may be obtained using parameter s 
as an independent variable instead of time t. 
This is accomplished by introducing 
ii(.s) = ds/dr(A). Therefore 

(m + /,(i)/^(,v))i;(.v)i;'(i’) + (i/:(.f)/^(s) -I- Us) 

+ mgz(s) = u(s). (22) 

The steering force u is the sum of the 
projections of the n^. actuator forces on the 
tangent of r(.y) 

m( 0 = u(A’(t)) = 2 M, sin (0;’(s(O))- (23) 

I 

The Py components of u,s arc absorbed by 
friction. Usually, the steering angles ip]’ of the 
wheels have a limited range of deviation around 
90“. Therefore 

c,slsin((/);V(0))l^l. 


(24) 
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where Ci is a positive constant. From (15), (23) 
and (24) 

t;,, 

where 

t7, = c,nH.^,, 

U2 = ctn^9^2- 

This is an acceptable approximation of the input 
space because it is on the safe side. 


(25) 

(26) 


2.4. Collision avoidance constraints 

Collision avoidance is guaranteed if the 
distance d{s, t) (Gilbert and Johnson, 1985) 
between the robot and the object is greater than 
a safety positive constant d^\ i.e, 

t/(.y, t) = min (||z, - z,\\: z, e C^x), 

t.l 

z,^C\it))^d'\ (27) 

where 

C,(i) = {x/x e 9 A,R;'{s)x s h, 

-ArR-'{s)Us),Are';n'”^ \ (2K) 

C’„(0 = {y/y 3AoR,','{i)y 

A,R: >1.. e !H'"\ b„ € Hi'}. (29) 

are convex polyhedra representing the convex 
hulls of the mobile robot and the moving 
obstacle, respectively. {Ar, h^) and (>t„, b„) are 
the parameters that define the convex poly¬ 
hedron description of the robot and the object, 
respectively, with respect to their fixed coordin¬ 
ate frame. J?,, 7^, 1], represent the rotation 

and translation of the frames of the robot and 
the object with respect to the world frame and 
must be known in order to compute d{s, t), 
Rr{s)» T^s) can be easily derived from r(A). 
/?o(f), 7i(r) are not well known and are 

estimated based on sensory input. Distance 
d{s, t) can be computed by a mathematical 
program of the form; 

£/(/) = min ||jc ->'1U 

Jc.y 

.v.M,(0^(f)<M0- (30) 

A„{t)y{t) < h„(f). 

Normally one would choose the Euclidean 
distance (a = 2) to represent the distance, and 
have to solve a quadratic programming type of 
problem. However \f a = \ or » then a linear 
programming type of problem can be formulated 
and gain in terms of computational efficiency. 


2.5. Mathematical problem statement 
The dynamic equations and the constraints are 
summarized as follows: 


System (equation (22)) 

x(t) = A{x{t)) -f B{x{t))u(s), (31) 

where x{t) = [A(r)u(0l^ with i^(t) = s{() 


A(x) = 


v{t) 

\i:is)fis)-^if{s)f'{s) 
m + 4(A)/%s) 


(32) 


and 


B(x) = 


0 

I_ 

m + l,ix)f-{s) 


(33) 


Initial-final conditions 

J:(^y<)) = l0 V,,)', Jc(.s 7 ) = [frce u,|'. (34) 

Input constraint (equation (25)) 

(35) 

State constraint (equation (16)) 

^ »-’sk,d(-i’). (36) 

Collision avoidance constraint (equation (27)) 

/)-(). (37) 

The final equality constraint (34) can be imposed 
using inequalities Considei as upper and lower 
velocity bounds the two velocity arcs starling 
from (.sy, ly) and going backwards with input 
-IJ 2 . (h uribl they meet (l^ku^(^)» 
respectively. Thus iyn.,x(0 and lyrnnCO are 
constructed. This is demonstrated in Fig. 4. 
Thus, a new set of inequalities 

i)^v„,Js)^\v(s)\-:~v„,M (3K) 

are used to satisfy (34) and (36). 

The problem is to find a feedback control of 
the form w(.i>, l^ f) such that (35), (37), and (38) 
are satisfied. 


3 THE POTENTIAL FIELDS STRAIEGY (PFS) 
The potential fields approach does not 
generically address the performance i.ssue. 
Stability, in the sen.se of collision avoidance and 
convergence to the goal slate, is the main i.s.sue. 
Furthermore, being a local method, it is 
computationally efficient but on the other hand it 
does not handle constraints well. The main 
problem with a naive application of this 
approach is the existence of local minima in the 
overall potential field. A considerable research 
effort has been done Rimon and Koditschek 
(1989) on constructing globally converging fields, 
but this has been achieved only for special 
categories of shapes of objects (e.g. ‘star-like”). 
The above issues become more complicated if 
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the objects move and the system becomes 
non-autonomous. 

In the problem at hands, the application of 
potential fields is done in the (s, r) space since 
the problem has been reduced to a unidimen¬ 
sional one by considering motions only along the 
off-line trajectory r(A). A Lyapunov function 
approach was adopted, in order to aim tor both 
stability and performance, two generic states are 
considered for the case of having only one 
moving object: 

• Alarming state. Collision is possible unless 
action is taken. Ihe inverse of the distance 
between the robot and the moving obstacle is 
penalized. The attractive goal is slate {s - 

Sf, V = Vf - 0 ). 

• Non-alarming slate. The main task is to track 
the nominal plan. Therefore the attractive 
state is i^,(0 

This decomposition dcmi^iiStrates the necessity 
of development of a fast but generic scheme that 
classifies the current state of the system as 
alarming or not. 

3.1. Fast collision prediction—the alarm function 
In our previous developments the Minimum 
Interference Strategy (MIS) and the Optimal 
Control Strategy (OC) Kyriakopoulos and 
Saridis (1990b, 1991, 1992b), have been prop¬ 
osed. Those are global methods and require the 
accurate knowledge of parameters .such as 
collision time (tj, collision point (.s\), etc. In the 
case of PFS, collision prediction is only required 
to test if the local configurations and velocities of 
the mobile robot and the moving obstacle may 
lead to a collision. 


Consider Fig. 5. A mobile robot is moving 
along its Cartesian path and has an instan¬ 
taneous translational velocity Vf^ - i\ + 
while a moving obstacle is detected to have an 
instantaneous translational vclocilyt H- 

Configuration (a) is considered as alarming 
because the shaded area (volume) which is the 
iiueiaction of the swept areas (volumes) of the 
polyhedra along their instantaneous translational 
motion lines lie on the side of positive direction 
of both velocity vectors. Configuration (b) is 
considered as non-alarming because the shaded 
area lies on the negative side for velocity vector 
i\,. A malhemalical methodology to construct an 
alarm function FJ.\, t) indicating the ‘degree of 
alarm” of a configuration is presented below. 

C onsider the convex polyhedral descriptions 
of the mobile robot and the moving obstacle, 
respectively: 

(39) 

(40) 

X,, x„ e Notice that {A,, If) are functions of 
.V, while (A„, /?„) ^tre functions of t. The swept 
areas (volumes) of the mobile robot and the 
moving obstacle under a horizontal translation 
can be parametrized by 

x'r-X, (41) 

x'„ = x„ + nv„, (42) 

where x^, x„ satisfy (39) and (40), respectively. 
Their intersection (shaded area of Figs 5a and b) 

i Obviously I'v, »rc the hon/.ontal components of the 
velocities of the mobile robot und the moving obstacle, 
respectively. 
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Fig 5 (a) Alarming configuration (b) Non-alarmmg configuration 


is given by 




(43) 


Introducing (43) to (39) and (40) the description 
of the shaded area 5 is obtained 


5 = {^6 5H'’3/I.C<B}, 

(44) 

where 

where 

^lA, 0 A.V, 

L 0 A„ -A„v„ A„v, J 

1 

_1 

» 

3.2 F, 


LaJ 


The 


(45) 


The configuration is alarming only if; 

^max - max A > 0, 

(46) 





where 

and 

/^max - max u > 0. 

(47) 


A'e.S 

Therefore, in order to determine fx 

max, two 

«(*) 


simple linear programs should be solved. Notice, 
that Ajnaxj functious of (.y, r) because 

they depend on A^s), B^is), 

In addition to the above, if 




alariTi i 


(48) 


or 


Mmax ^ ^alarm? (^^) 

where Kurmy f^aiarm are large positive constants, 
then the configuration should not be considered 


as alarming. 

Thus the alarm function is defined as 

0 0)^N(^alarm ^niax(‘^» 0) 

^ ^y(Mmax(‘^> 0)^-^ ( jliirm A^ maxi"'» 0). (50) 


WO).)’ "'‘"7*"" ,, (51) 

U) non-alarming state 


--^0. 

d^ ’ 


(52) 


dll 

jt((s) — =-a(s)vHt) +uit), (53) 
at 


M{s)^ m + I,{s)f%s), 


1 (iAlis) 


Consider the proposed Lyapunov function 
candidate 

Y{t, s, V) = iM{s){v - v„{,t)f + i^r(^ - ■‘^(O)". 

(54) 

where j„(0. ^niO is the nominal plan, and 
Kf, > 0. The idea behind this function is to try to 
lead the system as close as possible to the 
nominal plan since there is not any alarm. 
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In general, r(f, n) o (s, u) = yj. 
Furthermore, ^(f, a, v) is continuously 
differentiable and bounded from below by a 
decrescent function. Therefore it is a valid 
Lyapunov function candidate. Its time derivative 


dv dv ar ay 

—— _ —— f —— y -I- y 

dt dt 3s dv 

and if (54) is substituted, then 
AV 

— = (u - u„)l - .v„) - a(.s)i'i)„ 

- Jl{s)ij„ -t-«]. 
For 

u(f, .s, u) = -Kp{s - sjt)) - A„(i; - u„(r)) 

+ a(j)uu„(t) + M(s)i)„. 

(K„ >0) equation (56) becomes 

dr 


result (see Appendix B) 

lini — v„(r)) = 0. 

Thus, from (53) and (57) we deduce that 
hm (i(r)-.v„(f))=^(). 

3.3. = 1; Alarming state 

In this case, both collision avoidance and 
convergence to the final state {Sf, Vf =0) should 
be satisfied. Therefore the proposed Lyapunov 
function candidate is 

r(r, A, v) = \M{s)\r + T cK,^^s^^ 

-f \cM{s) Ai'iJ T G(d{s, r)), (60) 

where tSs — is- Sj), Y{t, s, v) will 

be everywhere positive if 

K„ H K„ + 

{]<r c < -^^ , 

M M 


LJsing a well-known result by Hale (1980) (see 
Appendix B) and the fact that .v,,(r), v^,(r) are 
uniformly bounded in r, 

lim (ii(t) - ii„(t)) =0. 


where Afo ^ M(s) ^ M, and 

I) + 

, (d-D)+ (/) + </„) In-- ^ (l-^D 

G(fl) = V 

0 d> D. 


Notice that 

.r„(t)^i>, i)„(0 = iV,(0 ^ ‘ 

where T is the motion time of the nominal plan. 
Since u(t, s, n) is continuous and uniformly 
bounded in t then n, n arc continuous and 
uniformly bounded, and using a well-known 


G'{d) = ] ~~dTd~ (63) 

0 d > D, 

G{d)^i), (Fig- 6) continuously differentiable, 



Flo. 6 Structure oi Cid). 
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and decreasing (G'(d) ^0) w.r.t. its argument d. 
D is the largest distance for which we are 
concerned, and do is appropriately chosen to 
define G(0). The idea behind such a construction 
of V{t, s, i;) is to try to lead the state of the 
system to this of the desired state, and at the 
same time to penalize proximity with the moving 
obstacle when the configuration of the system 
may lead to a collision. s, u) is continuous 
and bounded from below by a decrescent 
function. Therefore it is a valid Lyapunov 
function candidate. 

In the following analysis ddids, dd/dt are 
required. Distance function d{s, t) is well known 
to be continuous but not continuously 
differentiable Gilbert and Johnson (1985). A 
way to bypass this difficulty is to predict the 
“singularity” points where the derivative is 
discontinuous based on the estimate of the 
motion vector of the moving obstacles and 
interpolate locally with a signoid function. This 
approach was used in the case study and did not 
create any numerical problems. Thus, the 
notation ddids, dd/dt is going to be used rather 
loosely here. 

dr dd 

— = G'(d) — -f a{sy + {K^ + cK„) 

dt dt 

dd 

+ ca(s) Asv^ + ^J((s)v^ + G'{d) — + 

as 

+ \M(s) Asi) + (i> + Aj')j«(5)u = G'(d) ^ 

at 

+ a(s)v^ + (Kp + cK„) Asif + cfl(5) Asv" 
dd 

+ iJ((s)v^ + G'(d) — + M(s)vv 
ds 

+ Asii + (u + if As)(-av^ + v). (64) 

If a control 


then 


{f 




( 68 ) 


It can be easily tested that for /C,,, 
constraint (67) is more constraining than (61). 
Similarly, 


= iK(s)f^(s) + Us)f(s)f'(s)^an 

Q j 

= 1/' ,r f ft _r ^ 

maji ' *tm„,yniaxy maxi *^^0 — ^ ^^0 


= I + ‘5r/mux, (69) 

where I^(s) is the reflected moment of inertia of 
the mobile robot at point jt, = max^/, (i), 
/;„,^ = max> y;(5), f„,.,n-fnaxj(s), /L,-max, 
f'{s) and 6r the diameter of the robots convex 
hull. 

The required analysis is very difihcult because 
the system is nonautonomous, and the distance 
function is not explicitly known as a function of 
{s, t). A local analysis assuming that 


~ Sf^As ^ As, < 0, 
|u|^Wn.a.-max v„^,(i), 


(70) 


will be performed. 

dd 

(A) Stationary obstacle. ~ = 0. 
In this case 


dV 

d7 


\cK„ 


A.k- + 2K,. 



^-G'{d)y^As-a(.s)v\ (71) 


u{t, s, V) = -fl(j)u2 - Kp As - 2K„v - G'(d) ^ , 

ds 


(65) 


is chosen, then 


dr_ 

(cK„ . / „ 

cM\ ,) 


—^ As + ( 2K„ 

-i' 

dr “ 

(2 \ ' 

2 / i 


c dd , ad 

-~G'(d)—As-a(s)v +G'{d) —. (66) 

Z ds dt 


From Lasalles' invariance principle, is 

c dd 1 

-:^G'{d) — As - a{s)v =o|, 


(72) 


then the state of the system asymptotically goes 
to the largest invariant set of W, denoted as 
{W}, i.e. (Sf, Vf) satisfying 


Notice that if 


c ^ 


4A:., , 4K„ 

Kp + M ' K„ + jU’ 


(67) 


u, = 0, 

K,is,-Sf) = -Gidis,)) 


ds 


(73) 
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Obviously, the equilibrium point (r,, v^) e { W} 
depends on the initial state. For example if 

and then 

(B) Moving obstacle. 


The analysis of this case is very difficult for the 
general case and can be only local Khalil and 
Kokotovic (1991). The following analysis is 
based on the assumption that the velocity of the 
moving obstacle is bounded. Thus, 


then 


3d 

-y <_< y 

~ 3t' ' 


G'(0)F. 


(74) 

(75) 


The requirement that 

c , dd , dd 

--G'{d) — ^s-a(s)v' + G'(d)—<l], (76) 

2 os dt 


is satisfied if 


s max V (r, v, J) = - ^ As;’ - ^ <'/''(0)5;,iy 

+ -G-'(())V<(). (77) 

which is true for 


K.. 


«ni'l«-r;'(())F 


2 Aif 


(78) 


and 

K„ 


(ti„ul, G'(0)K)M-4A:„G'(0)5 „.s^ 
2K„ Ar? - fl„nLx + C’(0)V 


(79) 


as it is shown in Appendix C. 

Depending upon the assumptions for dd/dt, 
several conclusions about the stability properties 
can be drawn from inequality (76) ba.sed on the 
development of Khalil a’ld Kokotovic (1991). 
For example, if in addition to the boundedness 
of dd/dl we have ddldt-^i) as /^O then we 
can conclude that (.v(0> iKO)"^ (■^i - *'<■) e { W'"}. 

The algorithm for the Potential Fields Strategy 
is: 

Algorithm for PFS. 

Step 1. Collision prediction 
Solve (46) and (47) for A miiX’ 
respectively; 

Find F„{s, t) from (50); 
if f„(i, f) = 1 G0T0 2; 
else (F„(s, t) = GOTO 3. 

Step 2. Collison avoidance 
Find minimum distance d(s, f); 


Find u{t, s, v) from (65); 

GOTOl. 

Step 3. Tracking .stage 
Find u(t, s, v) from (57); 

GOTO 1. 

3.4. Implementation issues 

The control input u(l, s, u) obtained from (57) 
or (65) may violate the limits imposed by (25), 
t'mttx(‘S) and Un„„(5). This can be avoided by 
imposing 

u{t, s, v) : = 

[max {Ui. u(i, s, u), M(s)v,„^,(s) x n;„„(j:) 

-i a(?)>-’Lx(0}. M(t, s. i')=?0, 

|min{-fy 2 . u{t, s. v), M(s)v^,„(s) x i»;„,„(j) 

+ n(‘0»^mm(j'’)}. u(t.s,v):£0. (80) 

Obviously, the system may approach the limits 
of the feasible space but it will never violate it. 

If tJ(s, r) becomes very small then the input w 
becomes very large. However in reality 
u — U,, indicating that collision avoidance is not 
always guaranteed because of the limited 
capacity of the actuators. 

After those practical constraints are con¬ 
sidered, stability analysis becomes a formidable 
task. However, the stability for the unbounded 
case shows that the proposed strategy: (a) tries 
to classify the moving obstacles as disrupting or 
not, (b) tends to avoid the disrupting ones, and 
finally (c) when avoidance has been achieved, 
tracking of the nominal plan is its main purpose. 
The simulation results of the next section 
demonstrate those stages. 

3.5. inverse dynamics 

The control force u given by equation (80) is 
actually the sum of the projections of the 
individual control forces w,, given their linear 
combination u, is by solving the following 
nonlinear programming problem 

max mm r/'Ff - (/^;,+/;,) 

I I /I. 

s.l equation (I l)-( 15) (81) 

X u, .sin (0"(.v(O)) == M (given), 

r I 

that can be reformulated as 
max w 

A' 

+/?„). / = !,.. 

w > 0 (82) 

equation (11)-(15) 

S M, X .sin (.^';(.v(f))) 

j- I 

= u (given in equation (80)), 
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where X = ^ F\ - ■ ■ F»Js^w]. 

The idea behind this optimization is to try to 
allocate the forces u, in a such a way so that 
saturation of the available friction at each wheel 
is avoided. 

4 SIMULATION RESULTS 
In this section a case study is presented. A 
mobile robot and a moving obstacle with 
geometric shapes, moving in the same environ¬ 
ment (Fig. 7). The shape of both the mobile 
robot and the moving obstacle is rectangle 
with dimensions 0.3 m x 0.52 m and 0.28 m x 
0,28 m, respectively. The scenario is that 
when the mobile robot is about to start 
moving, an obstacle with kinematic param¬ 
eters xo = 4.0m, y() = 7.0m, = 0.04 m sec“‘, 

Vy = -0.075 m sec"', = 0.004 m sec"^, = 
-0.041 m see"", is going to collide with it under 
the current plan. 

The mobile robot has parameters: 

Mass (Af) :60 kg. 

Inertia :32 kg m^. 

Maximum accelerating force (t/i):140N. 
Minimum decelerating force (t/ 2 ): -60 N. 
Maximum velocity : 8 m sec"^. 

Wheel-floor friction coefficient (f;):~0,12. 

It has the task of going from configuration A to 
configuration B within 7 = 12.1753 sec. An 
oflline path planning stage is done and a path 
r{s) with total length —14.60m is 


computed. The parameters of r{s) are indicated 
on Fig. 7. 

Initial and final velocities are zero {v^ = Vb = 

0 ). 

The resulting velocity profile from the 
Potential Fields Strategy (PFS) is plotted with 
solid line on Fig. 8. The dotted (. . .) line is the 

velocity of the nominal plan. Line (-) 

is the maximum velocity Uniax(J^) along r{s). The 
noisy components of the velocity profile results 
from the fact that numerical differentiation was 
actually done to calculate the distance 
derivatives. 

The time functions for PFS (fpfsC^)) and the 
nominal plant (tnomC^)) are plotted on Fig. 9. 
The dotted (. . .) line is the velocity of the 
nominal plan. The final motion time of the 
nominal plan is T = = 12.1753 sec, while 

7pf,= 11.5008 sec. 


5 CONCLUSIONS 

PFS has the numerical advantages of a local 
method. From (57) and (65) the fact that 
computation load is introduced only from the 
calculation of terms f„(A, t), d{s, t) becomes 
obvious. For F^(s, r), it was shown in (46), (47) 
and (50) that two linear programs of six variables 
(eight for three dimensions) and + 
constraints (where m,, the number of the 
sides of the polygons of robot and obstacle, 
respectively) must be solved. For d(s, t) as 



FiCi 7 Environment with a mobile robot and a moving obstacle 
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U 

> 


s.distance along r(s) (m) 

F ill 8 Velocity profile Irom PFS (solid) 



discussed in vSection 2.4, it HH, or 1| ||, norms 
are used then a linear program should be solved. 
Obviously the solution of linear programs of 
such a small number of variables and constraints 
does nol impose any real-time computational 
constraint. Therefore this feedback control ran 
be easily recomputed at every sampling interval 
without any problem. 

Currently our efforts are focused at the 
implementation and comparison of the collision 
avoidance schemes developed in Kynakopoulos 


(1991). It is well understood that the proposal of 
moving always along the same path will not 
always prevent collisions. Our research direc¬ 
tions include the problem of replanning the 
shape of r(.y) by incorporating the nonholonomic 
constraints of the rolling motion. 
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Fig 9 Time funcliom, for PFS (—) and the nominal plan (solid). 
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APPENDIX A. ROBOT INVERSE DYNAMICS 
If It is known that the object has, at instant t, translational 
velocity i;(f), orientation angle O and rotational velocity 
w(0=- 6(0 (Fig- 2). 3’hcn the instantaneous angular velocity 
and steering angle of the wheel i, are (Alexander and 
Maddocks (1989))- 



X \/j,Hr)- + |A,"r- (wTO)- - 2 

X l'’(OI («(()cos (/} - O', -0(i(O)). 




tan 


(A 1) 


I / I'i'Jl'' P - |jrfV<>(() sin (a,J e(i((H) \ _ g, 
\|ir(/)| cos/{-Ir"! ('<(') tos (q, + 0(.v(»)))/ 

(A.2) 


There are the inverse kinematics equations lor a general 
mobile robot, i e. given i;(i), B(/), B(/) equation (A.2) gives 
the angular velocities Oft) and steering angles (p^(t) of every 
wheel i 


APPENDIX B RESULTS NEEDED IN SECTION 3.2 

Result 1 Consider the system 

i ^ f(t, s) (B 1) 

Let r be a l.yapunov function candidate for (B 5) Suppose 
that 

r(t. K)^~W(x)- 0 V/ * a f .li'’. (B 2) 
where Jf^(jr) is a continuous function of r Define 

,7^ U/r(A)-=l)). (B 3) 

if 

Vp F 3 neighborhood .A'(p) 3 Vx e ^ ip), f(t.x) is uni¬ 
formly bounded m i, 
then 

v(i)-i» as I—► ^ 

Result 2 If /(f) and fit) are continuous and uniformly 
bounded then 

lim fit) = lim /(/) = 0 


APPENDIX C: INDICAITON OF THE VALIDITY OF 
EQUATIONS (78) AND (79) 

From (77) and (67) it suffices that 

- A.v; - C'(0),Vv, + - 0. (Cl) 


Solving for c and introducing (67) 






This inequality is satisfied if (78) and (79) are satisfied. 
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Adaptive Control of Systems with Backlash* 

GANG TA01 and PETAR V. KOKOrOVltl 

tor systems with unknown backlash an adaptive ifwerse controller 
promises to significantly improve system performance. 

Key Words Adaptive backlash inverse; adaptive control; backlash; control system desicn; nonlinear 
systems, parameter estimation; stability. 


Abslracl —We develop a mathematical model of backlash 
inverse and give a parameirization ol the error caused by its 
estimate We then design an adaptive backlash inverse 
controller for unknown plants with backlash and prove the 
global boundedness of the closed-loop signals. Simulations 
show major improvements of system performance 

1. IN I RODlJCnON 

Sysilm componfnts, such as hydraulic servo¬ 
valves or gears often have dead-zone, hysteresis 
or backlash characteristics (Krasnosel’skii and 
Pokrovskii, 1983; Mayergoyz, 1991; Nelushil, 
1973; Truxal, 19,^8). These nondifferentiable 
nonlinearities severely limit overall system 
performance, dheir parameters arc often un¬ 
known, and an appropriate control strategy for 
such systems is adaptive. However most adaptive 
control results are for linear plants or plants with 
differentiable nonlinearities and are not ap¬ 
plicable to nondifferentiable nonlinearilies. This 
situation motivated a research program initiated 
by two recent papers which deal with dead-zone 
at the input of a linear part of the plant (Recker 
et at., 1991; Tao and Kokotovic, 1991). In this 
paper we continue this research direction and 
solve a more difficult problem with backlash. Wc 
propose an adaptive backlash inverse design 
which guarantees signal boundedness and results 
in major improvements of system performance. 

We consider a plant with a linear part G(D) 
and a backlash nonlinearity B( ) at its input: 

V(f) = G(D)[w](r). u(t) = Bivit)), (1.1) 
where. D is used to denote, as the case may be, 

* Received 2 January 1992; received in final form 9 June 
1992. The original version of this paper was presented at the 
4th IFAC Symposium on Adaptive Systems m C ontrol and 
Signal Processing which was held in Grenoble, France during 
July 1992. The Published Proceedings of this IFAC Meeting 
may be ordered from: Pergamon Press Ttd, Headmgton Hill 
Hall, Oxford, 0X3 ODW, U.K. This paper was recom¬ 
mended for publication in revised form by Associate lidilor 
C. C. Hang under the direction of Editor P. C . Parks 

t Department of FJectrical and Computer Engineering, 
University of California, Santa Barbara, CA 931(X)6, U S A. 


the Laplace transform variable or the differential 
operator 0|xl(r)—i(f) in continuous time, and 
the z-transforni variable or the advance operator 
D[x\(k) = x{k 1) in discrete time. Both G(D) 
and B(') are unknown. Only the plant output 
^(f) is measured and the accessible control input 
is i'(f). The backlash output u(t) is not 
accessible. 

We first develop a backlash inverse for the 
continuous-time case and derive its discrete-time 
counterpart. We then design an adaptive control 
scheme for the discrete-time version of the 
unknown plant (1.1). In Section 2 wc introduce 
the concept of a backlash inverse BI(') and 
investigate its implementation. If the backlash 
characteristic B( ) were known, then an exact 
backlash inverse BI( ) inserted between a linear 
controller and the plant would cancel the effect 
of backlash. In most applications, the backlash 
nonlinearity cannot be accurately modeled and 
only an estimate of its inverse can be 
implemented. In Section 3, wc derive a 
linear-like parametrization convenient for adap¬ 
tive control. In Section 4, we use a first-order 
example to introduce the adaptive backlash 
inverse design and illustrate performance im¬ 
provement achieved. Our main result is given in 
Section 5 where we design the adaptive backlash 
inverse system for unknown plants of higher 
relative degree. 

2 BACKLASH AND ITS RIGHl INVERSE 

In this section wc develop a backlash inverse 
Bl( ) and present its implementation to be 
cascaded with the backlash B( ) at the input of 
the plant. 

2.1. Backlash characteristic 

The backlash characteristic B^,■) with input 
E(f) and output u(t) is described by two straight 
lines, upward and downward sides of B(*), 
connected with horizontal line segments The 
upward side of B( ) is active when both E(f) and 
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M(r) grow: 

u{t) = m,{v{t) - c,), m,>0, and v{t)>il 

(2.1) 

If v(t) changes sign at t = and i)(r)<0 for 
t>ti, then the upward side becomes inactive 
and the motion for t > proceeds along the 
inner segment where 

u{t) = u(ty) = constant. (2.2) 

The downward side of B(‘) is 

u{t) = mf{v{t)~ci), m/>0, o<c,, 

and v(r)<(). (2.3) 

It is reached at r = ^2 > when 

u(ti) = m,{v(t 2 )-c,). (2.4) 

During the motion on the inner segment with 
u{t) - w(r,), the following condition is satisfied: 


ujtx) 

m, 


•f C/ < 


v{{)< 


u(f\) 

m. 




(2.5) 


Starting with t- ( 2 , the downward side continues 
to be active as long as i)(r)<(). If at t = t ^>(2 
v{r) changes its sign again, and u(r)>0 for 
t>t^, then the downward side becomes inactive 
and the motion proceeds on the inner segment 
where 


u(0 = w(r^) = constant, (2.6) 

until at ( = 1 ^X 7 , the upward side becomes 
active, where satisfies 


u(r,) = m,(i;(f4)-c,). (2.7) 


This narrative description of the backlash 
characteristic is mathematically modeled by: 

uU) = 

ni,v(t) if u(f) >0 and u(t) = m,{v{() - r,) 
rrififit) if v{t) < 0 and u(t) = m,(v(t) — c,) 

if ni,(vU) - rj < u(t) - - o), 

of u(r) > 0 and u{t} ~ m,(v(t) - o), 
or u(0 < 0 and u{i) = - c,), 

or ij(/) = 0. 

(2.8) 


Although this model allows the upward and 
downward slopes to be different ^ m/ 
provided that the intersection of the two lines is 
not in the region of practical interest, we only 
consider the usual backlash characteristic with 
two parallel sides = ni/ = m. 

A typical motion on such a characteristic 
initiated at t = 0 with u(0) = 0 and m( 0) = 0 is 
shown in Fig. 1, where v{t) and u{() are plotted 


along two synchronized orthogonal f-axes. 

It is useful to treat the model (2.8) as a 
first-order dynamical system and consider u{t) as 
its state. With an initial condition m( 0), the 
knowledge of ii(r) and f;(r) uniquely defines u{t) 
for f^O. We will restrict t;(r) to be piecewise 
continuous. Except at points of discontinuity of 
u(/), its derivative v{t) will also be piecewise 
continuous. We note that u{t) is ‘more 
discontinuous" than u(f). For example, even if 
v{t) is twice differentiable, ii(0 may still be only 
piecewise continuous, due to the inner segments 
on which w(f) = 0 even though u(f)=^0. 

Another important observation concerns the 
discontinuity of u{t) caused by an inconsistent 
initialization when the pair u(0), w(0) is not a 
point on the backlash characteristic in the 
(u, w)-plane. After a jump in u(t) the pair u(O^), 
w(0'^) will uniquely define a point on the 
backlash characteristic and v(t), u(() will remain 
on it thereafter. 

Because of the dependence of u{() not only on 
u(r) and m( 0, but also on ]’(r), we think o1 the 
input-output mapping from i^(r) to u(t) defined 
by (2.8) as a description of a "relative degree 
zero" system having a causal right inverse. Next 
we develop such an inverse and use it as a part 
of our new controller structure. 


2.2. Backlash inverse 

The most damaging effect of backlash on 
system performance is the delay coriespondmg 
to the time needed to traverse an inner segment 
of B( ). The ideal backlash inverse BI( ) will 
make the traverse of this segment instantaneous 
and thus cancel this undesirable backlash effect. 
Another unde.sirable effect of backlash is the 
information loss occurring on an inner segment 
when the output u{t) remains constant while the 
input u(/) continues to change (see the 
"chopped" u(l) in Fig. I). These two unde¬ 
sirable effects are eliminated with the backlash 
inverse Bl( ) defined by the following mapping 
from Uj{t) to Lf(r): 


I 

— if d and i^(r) = — -f r, 


--- ujt) it uJt) 0 and v(f ) = - -I- c, 

trif m. 


v(t)^ 0 


if u,((t) - 0 


(2.^) 


Ww(0 


g(r, t) if Uj(t) 0 and v(t) = - 1 - c, 

m, 

-g(t, t) if uJt) < 0 and n(/) = -h c,. 

m. 


In this definition, the inverse of a horizontal 
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segment of the backlash characteristic is a 
vertical jump defined as the time integral of the 
impulse: 

g{T, t) = d(T - ^') ■ 

( 2 . 10 ) 

where (!)(r) is the Dirac O-function. Fhus an 

upward jump in the backlash inverse is 

v(/'^) = v(l-) + g(T,l)dT = v(t ) 

+ 1-^^-) + fr- O 

\m^ mi! 

+ ( 2 . 11 ) 

nir 

The effect of this jump in BI( ) will be to 

eliminate the delay caused by a segment in B( )- 
In a similar manner the use of (2.9) restores the 
information that would have been lost in (2.8) 


We show this by proving that BI( ) defined in 
(2.9) IS the right inverse of B( ) defined by (2.8). 

Lemma 2.1 (Backlash inverse). The characteris¬ 
tic Bl( ) defined by (2.9) is the right inverse of 
the characteristic B( ) defined by (2.8) in the 
sense that 

uAQ - B(Bl(w,(/o)))B(Bl(u,(r))) - uAt). 

( 2 . 12 ) 

for any piecewise continuous u,/(r) and any 

Prooj. Suppose that wj(r)^‘0 for refrn, r,J and 
some ^)>^). First, if v{tA ~ ^'r ^nd 

u{tA ~ — <r)^ then it follows from (2.9), 

(2-8) that w(r) ^ m,v(t) = m,(M^(t)/m,) = uAt) 
for refr,), /]] with u(r()) = w,,(r-,). Hence 
B(BI(M,/(r))) = uAt) for any t e [/o, r,J. Second, if 
= and w(/o) = m/(u(fo) - e,), 

then, according to (2.9), u(0 will have a jump at 
t = so that i;(r) = uAOInir H- c, for t = The 
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jump in u(r) makes u{t) traverse an inner 
segment so that u(td) = mr(i’(f(|) - Cr). which 
reduces to the first case above. 

When Ud(t) < 0 for f e [tn> <ij. a similar analysis 
shows that B(BI(Mrf(f))) = Ud(t) for any t e 
ffo. f,]. If Udit) = 0 for ^], then 

B(BI(m<,( 0)) = MO holds for any t e [r„, r,]. 

If Wj(/) changes the sign at t — ti, then we can 
repeat the procedure, and show that 

B(BI(m^( 0)) = w^(0 for any t ^ to- V 
The mapping (2.9)-(2.11) may not define a 
backlash inverse only if the signal Uj(() is such 
that u(r) and u{t) never leave an inner segment. 
This situation can happen only if i;(0), m( 0) are 
initially on an inner segment and u,f{t) = 0 for 
r ^ 0 or if u^(t) does not change sign but the total 
increment of u^{t)lmr (or decrement of u,/(t)lmi) 
is insufficient for u(0r w(0 to leave the segment. 

As Ua(t) is the design signal at our disposal, 
the above situation can be remedied. If u,t{t) 
does not reach f„ defined in (2.12), then w,/(0» 
v(t) are initialized by 


UdM , Mrf('n) , 

-c, if ir(ro) =-- + c, 

/ -4- I rri/ 

, .. ... Min) 

-+ c, if u(f„) =-+ cv. 

L m, nXr 


(2.13) 


This will always result in = B(BI(Urf(ro))) 

and (2.12) holds thereafter. 

When the exact backlash parameters m,, m,, 
c^, Cl are unknown, we will use the estimated 
backlash parameters to design an adaptive 
backlash inverse. 

Let q(t), ntrit), tv(0 be estimates of m,, 

nir, Cl. cv. and denote the adaptive backlash 
inverse BI( ) as BI(/^,(t), Cr(t), mi{t), c'i{t); ■). 

Graphically, the backlash inverse (2.9)-(2.ll) 
is depicted in Fig. 2 by two straight lines and 
instantaneous vertical transitions between the 
lines, where the downward side is 

M) = ^ + O(0. and M0<0, (2.14) 

m,(c) 

and the upward side is 

>H0 = ^^ + o(0. and MO^O. (2. l-‘5) 
m,(f) 

Instantaneous vertical transitions take place 
whenever u,i{t) changes its sign. On the lines 
i;(r) = 0 whenever w,/(r) = 0. 

In Fig, 2, the motion of v(t), u,i{t) starts with 
^^(0) = n(()) = 0. At r = 0, v{t) vertically moves 
to the upward side. For / e (()» t|], ujit) > 0 does 
not change sign so the motion stays on the 



Fig. 2. Backlash inverse. 




327 


Adaptive control of systems with backlash 


upward side (2.15), For t e (t,. r,), u,(0<0, the 
downward side (2.14) is active. At t = t^, the 
sign of Ud(0 changes causing an instantaneous 
vertical transition from the upward side to the 
downward side. Because of the subsequent sign 
changes of two more instantaneous 

vertical transitions take place; one upward at 
r = tz and another one downward at t = t^. The 
magnitude of the vertical translation of i((/) is 
equal to the length of the estimated inner 
segment of B( ). If an exact backlash inverse is 
i^ed, that is, if m, = m,, c, = c,, = and 

c7=c,, then, after initialization of the backlash 
inverse, the backlash output u(t) is equal to 
Urf(f), that is, u{t) - B(BI(Mf))) = 


2.3. Discrete-time representation 
In most applications the accessible control v(t) 
is piecewise constant, i.e, n(r) ^ vUk)- nik) for 

t e [f*, i). /c = 0. 1, 2.For such discreie- 

time applications the backlash model (2.8) is not 
appropriate because of the discontinuity of 
signals. However, from the same physical 
description of backlash given by (2.l)-(2.7), we 
can obtain the following discrete-time backlash 
model u{k) = B{v(k)): 


u{k) = 


m,{v{k) - (’,) 
•' m,(v{k) - 
Mk - 1 ) 


tor u{k )' Vi 
lor i'(A') ^ II, 
lorn,- n(A) 


where 


i',. 

(2.16) 


u(k -1) 

V, = --+ c„ 

mi 


u(k 




(2.17, 


Similarly, from (2.14), (2.1.5) with true 

backlash parameters, we obtain the exact 
di.screte-time backlash inverse model n(A)- 


BI(M^)) as 

n(A - I) 

uAk) 

v(k)= nil 

ujk) 

-+ 0 

m. 


tor u,i(k) = nAk - 1) 
lo. u,i(k) m,,(A - 1) 

for h,,(A) > n.iik - 1 )• 

(2.18) 


A„. An initialization of m,/(A), ii(A) defined by 


vik„,)={ 


X «a(*o) , „ 

-t- Cr if n(Ao) =-+ Cl 

m, nil 

-1-0 lfii(A„) =-+ 0, 

m/ 

( 2 . 20 ) 


results in w,y(A:d) = B(BI(u,/(/co ))), where the 
jump from t^/co) to n(/cu) is instantaneous. 

Based on the structure of the backlash inverse 
(2.18), the discrete-time adaptive backlash 
inverse ij(^) = BI(M,y(A:)) is 


w(A) = 


n(A-l) 

m,{k) 


i^Ak) 

mAk) 


+ c,(A) 


for uAk) - uAk - 1) 
for u,,(k)<uAk - 1) 

for uAk)'>uAk - 1), 

( 2 . 21 ) 


where mi{k), Ciik), ni^{k), c\(k) are the 

estimates oi ni,, q, m,, di t ~ tk- 


3 PARAMBIRI/ATION 

To develop an adaptive law for updating the 
estimates /n^(/), Cf(t), c,(t) ot the backlash 

inverse parameters, it is crucial to express the 
backlash inverse error u{t) — in terms of a 
parametrizable part and a unparanietrizable but 
bounded part. 

l o give a compact description for the adaptive 
backlash inverse, we introduce two indicator 
functions; 


fi 




for u,j{t), t'(0 on the upward 
side of Bi( ) (3.1) 

otherwise, 

tor uj(t), v{t) on the downward 
side of Bl( ) (3.2) 

otherwise. 


From (3.1), (3.2), we have that 

Xrin^xM=h (3.3) 

XliO = XliO^ Z?(0 = XrU), Xl0)Xr(0 

(3.4) 


The discrete-time version ol Lemma 2.1 states 
that the characteristic Bl( ) defined by (2.18) is 
the right inverse of the characteristic D(-) 
defined by (2.16) such that 

u„(A„) = B(BI(m,,(A„)))2»B(B1(w,,(A))) = uAk), 

Similar to the continuous-time case. A,, is 
reached when uAk) - uAk ~ U changes sign at 


Using (2.14), (2.1.5), (3.1)-(3.4), we express ii(r) 

vit)-{xAt)^i,m>it) = ^ 

mAt) 

X (u,j(t) + rnAOcAO) + 

m,(t) 

X (Urf(f) + m,(t)c,(i)). (3.5) 

Similarly, for the backlash B( ), we introduce 
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three indicator functions XrU)< XiO) and xA0> 
such that 


Xi(0 = 



< 


0 


for u(t) on the upward 
side of B( ) (3.6) 

otherwise. 


1 


X/(0 = 


0 


for i;(0 on the downward 
side of B( ) (3.7) 

otherwise, 


1 for u(0. w(/) on an inner 
A:iO = 'i segment of B( ) (3.8) 

0 otherwise. 

Using the following obvious relationships: 

Xr(t) + X/(0 + X^(0= (-^-9) 

xf(0 = xi(0^ XrO) = Xr(0, Xs(0 = xA0' 

(3.10) 

Xi(0Xr(t) = 0. xiiOxAO = 0. 

xA0xr(0 = 0> (3.11) 

we arrive at the compact expression for the 
output «(/) of B( ): 

m(0 = (Xr(t) + XiO) + X.(0)‘^(0 

= Xr0)'nr(l>0) - Cr) + Xl(0^r(v(0 “ f/) 

+ X<(0«v. (3.12) 

where u, is a generic constant corresponding to 
the value of u(f) at any active inner segment 
characterized by 


4 o-''w(r)<—+c,. (3.13) 

m, 

Multiplying both sides of (3.5) by XiU)^ using 
(3.4), we obtain 

XiUMO = XiiOO’riiUMO - m^(r)c,(t)), (.3.14) 

and similarly we have 

rn,U)UdU) = XrU){mr{t)v{t) - m,(t)c,(f)). 

(3.15) 

Using (3.3), (3.14), (3.15) we get the following 
expression of 

Md(0 = ihO) + Xr(l)Mt), 

= hU)0nii0v(t) - m,{t)c,(t)) 

+ x'rU){mrit)''{t) - (3.16) 

From (3.12), (3.16), we have the following 
relationship between M(f) and 

m(0 = MO + XrOK'nAvil) - O 

- m;(f)w(f) + mrUyCriO) + XiiO 
X (m,(u(0 - c,) ~ 4- m,(/)c,(f)) 

+ M0. (3.17) 


where 

MO = (Xr(0 - £(0)("'.(y(0 - c,)) 

+ (xi(t) - £(0)(Mi'(0 - c/)) 

+ Xs(t)‘^s> (3.18) 

which represents the unparametrizable part of 
the error between u(f) and Uj(t). 

From (3.18), we see that MO 's reduced to 
zero if xAO ~ £(0 = XiiO = MO = 0- 
This condition is satisfied if m^i = mi, rnr-rrir, 
Ci = Ci, and c^ = c^, because, after initialization, 
the motion of v(t), u(t) will not be on any of the 
inner segments, and u(t), v(t) are on the upward 
(downward), side of B( ) if and only if u^(t) and 
i;(r) are on the upward (downward) side of 
BI(). 

When the parameter estimation errors are 
present, the above condition is not satisfied so 
that in general. However, as we show 

next, dn(t) is bounded. 


Proposition 3.1. The unparametrizable part 
Jo(0 of the control error u{t)- Uj(t) is bounded 
for any t > 0. 


Proof, There are three different cases to be 
examined: 

(1) ifX((0 = l. Xr(0 = Xv(0 = ll. Ihcn 


0 for £(r) = i, ■i::(f) = () 

MO = i i'n/- rrirMO - m,Ci 4- c,m, 

for x,(0 = ii. £(0= 1; 

(3.19) 

(2) if XriO = 1. X/(0 = XXO = 1'. then 


MO 


0 for£(0 = 0, £(0 = 1 

•< {nir - m,)v(t) 4- m,c/ - c^rrir 
for£(r) = 1. £(O = 0; 

(3.20) 


(3) if xAO = 1. xAO = XAO = 0, then 


MO = 


~m/v(t) + Cfm/ -h Wj 

for £(0= I, £(0 = 0 

— mrV(t) -h Crtrir + 

for £(0 = 0, £(0 = 1. 


(3.21) 


If the intersection of the two lines of 

B(’) must not be in the region of interest, by 
assumption. Therefore v(t) is bounded and u, is 
given by 

u, = m,(v(t) - c,0, c,,e(c,,c0, 

or 

M, =m,(w(0-c„), c„e(c,, c,). 


(3.22) 
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This shows that in all three cases d (t\ 
bounded. ’ 

For the usual backlash characteristic, m,= 
m, = m, and the expression (3.22) always holds. 
Therefore is bounded even if ir(t) is not 
bounded. V 

Letting ^,{t) = m,{i) = m(t) and mc,(0 = 
m(f)(v(/), mciit) = m{l)ci(t). we define 

- (me,, m, me,) ', 

(^hU) ~ m{t), mciit))'. ^ ^ 

l«,,(t) = iXrU). ~V(t), Xlit))' 

Finally, from (3,17), (3.23), (3.2-J), we obtain 
the parametrized expre.ssion lor the backlash 
inverse error uit) - u,,(i) in terms of the 
parameter error -01 with a bo¬ 

unded disturbance d|i(t): 

uU) - 11 , 1 ( 1 ) = </>/',(Mw^(t) t d„(0. (3.25) 

This parametri/ation holds for the di,screte- 
time case: 

u(k) u,,ik) = <i)l(k )<e,(A:) + d„(A.). (.3.26) 

This cxpressuin will be irnporliint iov our 
aclaplivc clcsij>n. 


Define u{k) = Tkf,u(k) - B{v{k)), where 
fl(ii(/r)) = Tk^m^, Cf, c/, v(k)) is the 

modified backlash characteristic with slopes 
Tk^,mt, Tkf,m,. With the non-unity gain Tkp 
taken care of by the modified backlash model 
and u{k) renamed a^s u{k), and from (4.2), the 
linear part of the plant becomes: 

y(k + 1) = y(A:) T u{k). (4.3) 

In the absence of backlash our design 
objective would be achieved by the controller 

i^Ak) = -y{k) -^y,Ak t l). (4.4) 

In the presence of backlash we use this 
controller along with an adaptive scheme 
designed to update the backlash inverse on-line. 

Since, by assumption, m is known and 
cv=-C/ = c, we let m(k) = m, mci{k) = - 
mcAk) = rni(k). The backlash inverse error 
equation (T26) becomes: 

u{k) - uAk) = (p(k)(i){k) + do(k), (4.5) 

where wik) xA^) - Xi(k), <p(k) fi{k) ~ 0*, 
6(k) = fnc{k), and 6*-me. 

For the tracking error eik) ~ y(k) - we 

obtain from (4.3)-(4.5) 

e(k)=(Hk - \)a){k 1)- 6*(i)(k - 1) 

^dAk-\). (4.6) 

and define the estimation error as: 


4 AN INi KODUC lortY LXAMPl 1 

4hc purpose ot this section is to give an 
introductory example of adaptive backlash 
inverse. I'o focus on the backlash problem, we 
consider that the linear part of the plant in the 
continuous-time form is (i(P) ~ k^lD where k^ 
is a known constant. Foi the backlash charac¬ 
teristic we assume that only its breakpoint 
parameter c\ - -Cf ~ c " 0 is unknown, while the 
slope m *0 is known. 

Our objective is to design an adaptive law to 
update the backlash inverse estimate and a 
control n,/(0 to stabilize the closed-loop system 
and make the plant output y(t) track a given 
reference signal v„,(0 which specifies the desired 
system behavior. 

For a discrete-time control design, the linear 
part of the plant is given by 





(4.1) 


When u{() is piecewise constant and 
k ^ 0, then 


y(k= y{k) + Tk,,iiik). (4-2) 


f(A)-f(A')-h 0{k)io{k - 1) - 0(k -])o){k - 1). 

(4.7) 


This is the implementable form of €(k) to be 
used in adaptive update laws. A simpler but 
unimplemenlable form of e{k) obtained from 
(4.6) and (4.7) is 

e(k) = (p(k)(n(k - l) + ri„(/c - 1). (4.8) 

Using the implementable form of e{k), our 
update law lor Uik) based on a gradient 
algorithm (Cjoodwin and Sin, 1984; Landau, 
1990) with an initial estimate 6(i)) is 


0{k ^ ])^0{k) 


Ya){k ” i)e{k) 

T-f"7ir(fc - T) 


~o{k)0{k). ()<y<L 


(4.9) 


where a(k) is a ''switching-o signal” (loannou 
and Tsakalis, 1986). Its implementation requires 
a priori knowledge of an upper bound M on 

K^l: 



foTmk)\>2M, 

otherwise 


0< a„< ^(1 - y). 

(4.10) 
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The stability and tracking properties of the 
closed-loop system (4.3)—(4.5), (4.9) are: 

Proposition 4.1. All signals in the closed-loop 
system are bounded and there exist ao>0, 
A > 0 such that 


A I 


2 I d(,(k)-h^,„ (4.11) 




*=*,-2 


for any kj ^2, A :2 — 0- 


Proof. Using (4.8), (4.9) and introducing 


e(k) = 


e(k) 

\/l + (o^(k-l)’ 


^ Vr+w^(k^)' 


(4.12) 


we express the time increment of V(k) = (p~(k) 
as 


V(A:+ l)-V'(/c):£-0(,yf-(/:) 

-o„a(kmk) + dMk-l). (4.13) 
This proves that tp(k) is bounded. By definition. 



a)(k) is bounded. Hence e(k) in (4.6) is 
bounded, and so is y(lc). Finally, ujk) in (4.4), 
o(A) in (2.18), u(k) in (2.16), and thus all 
closed-loop signals, are bounded. 

Using (4.7), (4.9), we have that 

e^(k)^2eW + 2io\k - l)(d(k) - (l(k - I))' 

, , j2Y^(o\k-2) 

y p-(k - n 4- 7n^(k - \W(k - liV 




which, in view of (4.12), (4.13) and the 
boundedness of tu(fe), proves (4.11). V 
To evaluate the closed-loop system perfor¬ 
mance improvement achieved by the propo.sed 
adaptive backlash inverse, simulations were 
performed for the first-order plant (4.2). The 
plant parameters were taken as: = 3.7 and 

m,. = m/ = m = 1.3. The parameter cv=-r/ = 
c = 1.25 is unknown to the adaptive backlash 
inverse. The discrete-lime lime step 7 = 0.1 so 
the modified slope is k^^Tm = 0.4625. 
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Fig. 3. Tracking errors for y,„{t) = 10 sin 1.26r 


211 adaptive backlash inverse 


-1 


10 


Fig. 4. Trucking errors for >',„(/) -10 sgn (sm 2.2l) (square 
wave). 
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Three cases were studied for comparison; (l) 
onl\ the controller (4.4) is applied, that is, no 
backlash inverse is implemented; (2) the 
controller (4.4) and a fixed backlash inverse 
(that is, the backlash inverse implemented with 
fixed parameter estimates) are applied; (.3) the 
controller (4.4) and an adaptive backlash inverse 
are applied. 

The .sy.stem re.spomses to y„.(0 = 10 sin 1.2(if 
with c(0) = 1.91 are shown in Kig. 3, and the 
system responses to y,,.!/) = 10 sgn (sin 2.2/) 
(square wave) with ^(0) = 0.59 arc shown in Fig. 
4, where c{() = mr(r)/m(f) and mU) - m. 

The simulation results show that the adaptive 
backlash inverse (case (3)) leads to major system 
performance improvements in all the cases ot 
different initial conditions and dilfcrent re¬ 
ference signals: in addition to the signal 
boundedness the adaptive scheme achieves 
convergence to zero of both tracking error and 
parameter error, while the control error also 
converges to zero because the parameter error 
docs. The simulations results also show that a 
fixed backlash inverse whose parameter was 
either underestimated or oveiestimatecl (case 
(2)) is also useful; the tracking error in reduced 
while it is quite large in case (1) when no 
backlash inverse is used. 


5 ADAPTIVr ( ONIROI DlSK.N 
We are now prepared to address the main 
problem of this paper; adaptive control design 
for an unknown discrete-time plant with 
unknown backlash at its input. Using D to 
denote the z-transform variable or the advance 
operator, as the case may be, the unknown plant 
to be controlled is 


y{k) - G(D)[u\{k), u(k)^ B(v{k)), 


(HD)- 


ZiP) 

IHD) 


1 ) 


Without loss ol generalny, the polynomials 
Z(D) and R(D) are assumed to be monic so that 
the high-frequency gain ot Ci(D) is one, and the 
actual high-frequencey gain of the plant is 
represented by the slope ni of the backlash B( ) 
We make the following assumptions about the 
plant: 

(Al) G(D) is minimum phase; 

(A2) the relative degree rU of (HD) known; 
(A3) the degree n of R(D) is known; 

(A4) m ^ AHd for some known nio ^ 0» ^md 

Our problem is to design an adaptive 
controller to achieve the tracking ol a reterence 
signal y^{k) by the plant output y(A:). To solve 
this problem we need a controller structure 


which not only achieves tracking when all the 
plant parameters arc known but also results in a 
tracking error expre ,sion suitable for adaptive 
control design, 

5.1. Controller structure 

From the usual controller structures used in 
adaptive linear control we borrow the feedback 
part, that is, we pass the output y(k) through a 
linear filter \)^, where a{D) = 

(D " ^ \ , D and 8^ e R'\ The new part 

of the controller is its forward part. It must 
incorporate the backlash inverse and, hence, 
cannot have a linear structure. In c>olving this 
new problem wc still want to preserve the linear 
parametrization of the error equations, which 
will be the main tool of our adaptive design. A 
structure which meets this requirement is 
obtained by passing not only the control signal 
u(A), but also the backlash inverse signals x^(/c) 
and Xi{^) through the filter formed of a{D) and 
adiustablc parameters 
Introducing the four regressors: 

w,(A) = t/(D)[A:,](A;), (Oi(k) = a{D)\xi\(k), 

(5.2) 


o/„(A:) = u(D)|i<](^), 
c./,(fc)-(«'(/)), l)'|,vP). 

which multiply the conesponding adjustable 
parameters, wc propose the tollowing linear-like 
structure ot the nonlinear adaptive controller: 

u,f(k) 0^u),{k) -f Hj(Of(k ) y 0|,^o^,(k) 

y6iw,(k)^yJk-^rC), (5.4) 

This controller is shown in Fig. 5 where XiiO and 
Xf(t) are obtained from the logic block L which 
implemenls (3.1) and (3.2). 

The output u^i(k) of this controller is applied 
to the adaptive backlash inverse in order to 
genet ate the plant control input; 

i;(A)-BI(M/r)). (5.5) 

We now show that for this controller there 
exist matched values of the adjustable para¬ 
meters resulting in exact tracking with internal 
stability. 


pUur 



F If, s I he uduptivc inverse CDnirtiller structure 
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Lemma 5.1. There exist matched values 0*, 
ffy, ffb of 6r, 6,. 6„, 0J,, and dy with which 
the controller (5.4) achieves the closed-loop 
global stability and tracking y{k + n*) = y^ik + 
n*). 

Proof. We first express the matched values of 
the parameters 6 ^, 6, and in the forward part 
of the controller in terms of the backlash 
parameters = {mc^, m, mc,y multiplied by a 
parameter 6 * eR" '. 

9* = —$umCy, = 

(5.6) 

When then define 6 * and the matched value 
6 * of the feedback parameter 6 y as the solution 
of the Diophantine equation: 

d*JaiD)R(D) + ey(a'(D), l)^Z(D) 

= R{D) - Z(D)D’'\ (5.7) 


Lemma 5.2. The controller (5.4) with arbitrary 
6 y(k), 6 i(k), 9 „{k), dy{k), O^ik) re..ults in the 
tracking error consisting of a linear part 
<p^ {k)(o{k) and a bounded part d,(k), that is: 

e(k) = yik) - y„,(/c) = <p''{k - n*)u)ik - n*) 

+ d,(k).<p(k)=6(k)-9*, (5.14) 

d,(k) = d„(k -n*)- eya{D)[d,,]{k -n*). 

(5.15) 

Proof. Recall from (3.16), (3.23) and (3.24) that 
the expression of the backlash inverse estimate 
m ) is uj{k) = ~-6l(k)(Of,ik) so that (3.26) gives 

u{k)^-Ot/w,ik)^d,(k). (5.16) 

Substituting (5.16) in the right side of (5.11) and 
(3.26) in the left side of (5.11), and using (5.S), 
we obtain 


Substituting the matched values in the 
controller (5.4) we see that its forward part is a 
linear parametrization of the nonlinear term 
e:/a{D)[^6toj,\{kf namely: 

e*Ja(D)[-e:co,]{k) = er(Oy(k) 

+ e!^vj,(k)-^d:ya>y{k). (5.8) 

With the matched values and (5.8) the 
controller (5.4) has the form: 

u,{k)=6:aiD)[-e*,a>,]ik) 

+ e;^ia'iD), l)^t>J(k) +y„(k + n*). (5.9) 

The right inverse Bl( ) of the backlash B( ) is 
also matched, and hence 

u{k) = B(BI(n^(A:))) = u^k) = -6:(vdk). 

(5.10) 

On the other hand, when both sides of (5.7) 
are divided by R(D) and then operated on u(k), 
the resulting identity is 

u(k)=^ e:/a(D)\u](k)+ 0:'ia^(D), U^fyKA:) 

+ y(k + n*). (5.11) 

Equating u(k) of (5.11) with Uj{k) of (5.9) and 
using (5.10) prove y(k + n*) = y„(k + n*). The 
closed-loop system is globally stable because 
with the matched values the closed-loop poles 
are zeros of D"-"''* 'Z(D). V 
It is due to the identity (5.8) that the new 
controller structure (5.-1) will lead to a 
convenient linear form of the tracking error 
expression. For a more compact notation, wc let 

e(k) = (d:(k), e',(k), ei(k), ei{k), ei(k)y, 
e* = (er^ e;', e*/. otY, (5. 12 ) 
a)(k) = ia}J{k), (i}J(k), (o^k), o)l(k), u}Uk)f. 

(5.13) 


u,i(k) + (l)!,{k)o}h(k) + d„{k) 

= oya)y(k)-ry,„ik + n*) + e(k+n*) 

+ d;'a)y{k) +0!'(o,(k) +e*'o,Jk) 

+ e:'a{l))\d„\{k). ( 517 ) 

Finally, we substitute (5.17) into (5.4) to gel 
(5.14). V 

We have thus obtained a tracking error 
equation with a linear parametrization and an 
unknown, but bounded, disturbance. 

5.2. Adaptive scheme 

Our major task now is to design an adaptive 
scheme to update the parameters oi the backlash 
inverse (2.21) and the controller (5.4) to 
guarantee the signal boundedness for the 
closed-loop system. This task is achievable with 
the tools of adaptive linear control (loannou and 
Tsakalis, I9K6; ligardt, 1979; Kreisselmeier and 
Anderson, 1986; Narendra and Annaswamy, 
1989; Middleton et al., 1988; Praly, 1990). 

One update law for 6 {k) with an initial 
estimate ^(0) suggested by the form of the 
tracking error equation (5.14) is 


6 {k + l)^6{k)~ 


yo){k - n*)f (A:) 
r^io^ik- n*)(i){k-n*) 


-a(k)0(k)y (5.18) 


where e(/c) is the estimation error: 
e{k) = eik)^ie(k)- 0(k - n*)yo){k - 

(.5.19) 

and a{k) is a “switching-a modification” 
(loannou and Tsakalis, 1986) whose implemen¬ 
tation requires a priori knowledge of an upper 
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bound M on the Euclidean norm ||0*||2 of 0*- 

aik) = 

|a„ for ||e(k)|| 2 > 2 M 

lo otherwise ’ 2(1 ~/)• 

(5.20) 

Although not shown in (5.18) we use 
projection t^ ensure that and 

m^k)^0^mcXk) to implement the adaptive 
backlash inverse (2.21). 

This adaptive control scheme has the following 
stability and tracking properties. 


Theorem 5.1. All signals in the closed-loop 
system are bounded and there exist > 0, 
> 0 such that 

k^ + k2 n" 

2 e\k)^a„ X dl{k) + li„. (5.21) 

k-=ki A i k^~nu 

for no = 2n* -I- n - 1 and any /c, ki^^O. 

Proof. The first part of the proof, which shows 
the boundedness of the update law, is standard. 
Substituting (5.14) in (5.19) results in 


e(k) = (p(k)(i){k - n*) + d^{k). (5.22) 

Using (5.18), (5.22) and introducing 

m- _, 

Vl + U)'{k --n*}<i)(k-n*) 


d,{k) 


dm 

Vl + io^{k - n*)(i){k - n*) ’ 


(5.23) 


we express the time increment of V(k) = 
(p\k)^{k) as 

V{k + l)-Vik)^-a,mik) 

-o„o{k)e\k)d(k) + dm (5.24) 

This proves that (p{k) = 0 {k) — 0* is bounded. 
In view of (5.22), (5.23) and the boundedness ot 
di{k), this implies that e{k) is also bounded. 

In the second part of the proof we use a novel 
technique to show the closed-loop signal 
boundedness. Let us introduce 

u){k) = {u>f(k), (oUk))'. a>„(k) - fl(tl)[Hl(k). 

(5.25) 

It can be shown that there exist bounded 
sequences f,(k) e", 
g 2 {k)eR^"~' and con.stant FjeR^^" such 

that 

(a{k) = Fmd>{k) + gm’ 

(b{k) = FMk)Fg2{k). (5.26) 

Then we use (5.7) and the fact that 


y{k) = iZ(D)/RiD))[u]ik) to obtain 

0 :^(Ouik) + d;^(Oy{k) = u{k)-y{k + n*). 

(5.27) 

Using this equality and the definition of a>(fc), 
we express 

ro{k + 1) = A*a)(/c) + b*y(k + n*)=A*(b(k) 

+ [«](* + '»*). (5.28) 

for some constant matrix A* e and 

constant vector b* e R~'' Since the first 
component of d){k) is M(/c-n + l), it follows 
that fore* = (1,0.0)' eR""~\ 


Z)"''^'u(D) = c*^(DI -A*) 'b* D"‘u(/1), 


R{D) 


(5.29) 


which implies that 


c*{DI -A*) -'h* = , , 

Z(D)L>""" ■' 

det {DI~A*) = (5.30) 


that is, A* is a stable matrix. Using (5.19) and 
the first equality of (5.2S), we have 

G}{k -1- 1) - v4*d)(fc) + h^{yrr.{k + ^*) c(k -h n^) 
- {e{k ^ n*) - 0 {k))‘io{k)). (5.31) 

From (5.30), all eigenvalues ol /I* arc inside 
the unit circle of the complex plane. I’herefore 
there exists a non-singular constant matrix 

I).(2.-1) \\QA*Q '||2<1. 

Define the vector norm ||'|| in ^ by; 

Ikll = IIC^^IIz From (5.26), there exist constants 
c, >0, r, > 0, / = 1, 2, such that tor all k ^ 0 

||fu(A)||--.cJ|ru(^)|b-+ c,, (5.32) 

||u;(/c)||,-(MKu(/c)|| + r,. (5..33) 

Introducing x(k) = \lik n*)\ ||6(A: -I- n*) 

-W(A:)|| 2 , using (5.18), (5.24) and (5.32) it can 
be shown that ther exist constants ^0, ( ^ > 0 
such that 


k\A-k2 A I 4 A 

^ x^{k)t:c^ X 

A-/C, A Ai 


,..l-f.||a)(A)|p^‘’"- 


(5..34) 


It follows from (5.30), (5.31), (5.33), there exist 
constants Oo e (0, 1), Cs > 0, c,, > 0 such that 

||w(/: -1-1)11 <(ao + c,x(A)) ||a>(/c)|| -I- c^,. (5.35) 

Substituting (5.19) in (5.31) and using (5.14), 
(5.26), wc obtain 

wik + \ {A* + h*<(>' {k)Fm)dm + gm> 

(5.36) 
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for some bounded sequence g 3 ()t) e R^~'. Since 
ipik), Fiik) and g-nik) in (5.36) are bounded, 
ci>(^) grows at the most exponentially, that is, 
there exist constants C 7 > 0 , Ch > 0 such that for 
all ^ 2:0 


||ft>(* + l)||<C7||a,(A:)||+c«. (5.37) 

Now we show the boundedness of a)(k) by 
contradiction. Assume that d)(Ac) grows un¬ 
boundedly. Then, in view of (5.37), given any 
(5o > 0 and k 2 > 0 , we can find d e ( 0 , <5,|] and 
Hc] > 0 such that 


ke{kj-n* . ky-l). 


llw(fc)||=-, k = ku 


(5.38) 

(5.39) 


ll"(A:)||^^, ke{k, + l.... ,k,+k 2 + \). 

(5.40) 

Therefore, for ; e {0, . . . ,/cj}, the state 
transition function of ||d)(ilc-I-1)(| = (a„ 
c,jt(/c)) ||<i»(A:)|| satisfies 


'iZ/ / r, * 1 +/ \l 

n (flo + C5x(A:))<(fl„-l--~ 5 x(A:)) 

< ^flo + (c,Vc-,(n* + 1 )) 6 „ + c, 

< (a„ -I- (CfiVc^(n* + 1)) 

X ( 

'fln + c,Vc 3 (n* + 1 )^ 


(5.41) 


With any 60 satisfying a„ + (c,Vc-i(n* -t- 1)) 
60 <1, (5.35) and (5.41) imply that 

||d»(/r,1)11 < (flo-)-C5x(A:)) ||ti)(/c,)||-l-Cv, 

*=*, 

(5.42) 


for some constant C 4 > 0 , and FI (fl() + 

k^k 

c^xik)) < \ for any j > and some 7 , ^ 0. Hence, 
for 


past values. Since this sum contains di{k) and its 
past values, we use (5.15) to express di{k) in 
terms of do{k — n*) and its past values and 
finally obtain (5.21). V 

We have thus shown that the adaptive law 
(5.18) ensures robustness with respect to the 
bounded disturbance d^i{k). Another advantage 
of this update law is that the asymptotic tracking 
is achieved if the adaptive backlash inverse 
converges to the exact one, that is when d()(/c) 
disappears for large /c, see (5.21). 

We have not yet shown that, in general, the 
tracking error e{k) converges to zero. The 
dependence of rf(>(/c) on the parameter error 
suggests that this will be so if the adaptive 
system has sufficiently rich signals. Extensive 
adaptive backlash inverse simulations showed 
that with rich signals the backlash inverse 
parameters converge to their matched values so 
that the asymptotic tracking is achieved. 

6. CONCLUSIONS 

This paper has presented what appear to be 
the first formulation and solution of the adaptive 
control problem for systems with backlash. To 
achieve this, we first demonstrated the right 
invertibility of a general backlash model and 
parametrized the error expression of a backlash 
inverse estimate needed for continuous and 
discrete-time implementation. We then intro¬ 
duced a new linear-like structure for a nonlinear 
controller capable of cancelling the effects of 
backlash. This controller structure makes it 
possible to obtain a linear error equation, with 
the effect of an inaccurate backlash inverse 
represented by a bounded disturbance. From 
this point on, a robust adaptive update law was 
designed to guarantee global signal bounded¬ 
ness. Simulation results showed major system 
performance improvements. 
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(5.42) implies that ||d>(^i + y + 1)|| < 1/6 for any 
e (y'l,.... ^ 2 }, which is a contradiction. Hence 
w{k) is bounded, and so is cj{k). 

Next we use this fact to prove the bound given 
by (5.21). From (5.18) and (5.19), we first 
evaluate a bound on the sum of e^(k) in terms of 
€\k) and a^{k- l)d^(k - 1 ) 0 (A: - 1 ) and their 
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Performance Limitations of Non-minimum 
Phase Systems in the Servomechanism 

Problem*! 

L. QIUJ: and E. J DAVISONS 

^ exists in the achievable transient performance 

which is possible to he obtained in the tracking and disturbance rejection of 
a non-minimum phase system, and this limitation can be characterized 
completely by the number and locations of the right-half plane zeros. 

Key Words I incar npumal regulator, servomechanisms, non-minimum phose systems, transnnssion 
zeros, cheap conirol 


Abstract Ihis paper studies the Lheap regulator problem 
and the cheap servomcLhanisni pioblem lot systems which 
may be non-mmimum phase The stud\ extends some 
well-known properties o| ‘pertcLl regulation’ and the 
‘perfect tracking and distuibance lejcclion of minimum 
phase systems to non-mmimum phase systems It is shown 
that perfect re)eclion to disturbances applied Id the plant 
input can he achieved no matter whether the system is 
minimum phase or non minimum phase, whereas a 
fundamental limitation exists m the .Khievable transient 
pertormance of tracking and rejection to disturbances 
applied to the plant oulpiil lor a non-minimum phase 
system, and that this limitation can be simply and completely 
characterized by the number and locations of those zeros ol 
the system which he m the right half ol the complex plane 
f urthcrmoic, this limitalion provides a quanlitative measure 
ol the ‘ degree ol dilficulty" which is inherent in the control 
ol such non-minimum phase systems 


1 INTRODUCTION 

h MAS lONti HI I N realized that minimum phase 
systems have certain advantages over non- 
minimum phase systems, foi example, right- 
invertible minimum phase systems can achieve 
perfect regulation (Kwakeinaak and Swan, 
1972b; Francis, 1979; Scher/inger and Davison, 
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1985) and perfect tracking/disturbance rejection 
(Davison and Chow, 1977; Davison and 
Scherzinger, 1987). These properties, ht^wever, 
are not possessed by non-minimum phase 
systems. In addition, a non-minimum phase 
system, unlike minimum phase systems, has 
various fundamental limitations associated with 
the achievable closed loop transfer matrix 
(Cheng and Desoer, 1980) the achievable closed 
k/op gam margin (Tannenbaum, 1980) LOG 
loop transfer recovery (Stein and Athans, 1987; 
Zhang and Freudenberg, 1990) sensitivity or 
complementary sensitivity mmimi/ation (Freud¬ 
enberg and Looze, 1985; Francis, 1987) model 
reference adaptive control (Miller and Davison, 
1989) etc. On the other hand, it has been recog¬ 
nized that not all non-minimum phase systems 
behave m the same way; for example, some non- 
minimum phase systems produce results which 
are 'Tilmost as good” as minimum phase 
systems, whereas other non-minimum phase 
systems arc indeed ‘ almost impossible” to 
control. Il is the purpose of this paper to study 
the quality of non-minimum phase systems with 
respect to tracking and disturbance lejection. 
We will show that for each non-minimum phase 
system there exists a fundamental limitation on 
the achievable transient response of the system. 
This limitation has a simple characterization in 
terms of the number and the locations of those 
zeros of the system which lie in the open-right 
complex plane, and provides a quantitative 
measure of the “degree of difficulty” which is 
inherent in the control of a non-minimum phase 
system 

In this paper, we first study the cheap linear 
quadratic regulator (LQR) problem for non- 
minimum phase systems. Consider a linear 
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time-invariant system described by 
X = Ax -f Bu, x(0) = JC(,, 
y = Cjc + DM, 

where u, x, y are finite dimensional vectors 
depending on the time t. Assume that 
(A, B, C, D) is stabilizable and detectable, and 
consider also the associated optimal cost 
functional 

= min f (y'y + e^w'w) dt. (2) 

14 J() 

The cheap LQR problem concerns the limit 
y,) •= lim 4- It is shown in this paper that ^ has a 

e—I) 

very simple expression when {A, B, C, D) has a 
particular special structure. This expression is 
not very important per se since very few systems 
have such a special structure. However, the 
significance of the re.sult lies in the fact that for 
every transfer matrix, there always exists a 
realization which has such a special structure. 
Therefore the result has Important applications 
in control problems which depend only on the 
system transfer matrix rather than on the 
internal realization of the system. The cheap 
optimal servomechanism problem, which is the 
focus of this paper, belongs to this class of 
problems. An expression for Jo involving the 
concept of “zero directions” was obtained by 
Shaked (Grimble and Johnson, 1988) for the 
case when D is a zero matrix and A, B, C arc 
general matrices; this expression, although 
interesting, is quite complicated and is not 
convenient to use in our application. 

Consider now a system with noise corrupted 
input and output: 

X = Ax ^ B{u 4 5), A'(0) = (J, 

(3) 

y=Cx^D{u + ^)^T], 

where ^ is the input disturbance, rj is the output 
disturbance. Wc assume again that (A, B, C\ D) 
is stabilizable and detectable. A control problem 
which often arises is to design a controller for 
system (3) such that the overall system is 
internally stable and such that the output y 
asymptotically tracks a reference signal for 
arbitrary r) and yref contained in a certain class 
of signals. This problem is called a 
servomechanism problem, U is well-known that a 
general servomechanism problem can be treated 
as one with the reference signal being equal to 
zero since the distinguishing role between y,,., 
and r/ disappears if the tracking error y - is 
taken to be the output under consideration. 
Therefore, we will assume that yrcf=b through¬ 
out the paper. In practice, a controller which 


solves the servomechanism problem is also 
required to have a good transient response, i.e. 
it is desired that the closed loop system should 
have a “fast speed of response” without 
“excessive peaking/oscillation” occurring in the 
output y and other system variables, as they 
approach their steady state values. To achieve 
such a response, we seek a controller which 
generates an input to achieve the following 
optimal quadratic cost functional 

7^ — min I (y'y 4-c'w'w) dr, (4) 

n Jo 

where u is a variable associated with the 
transient behaviour of the input u. The problem 
concerning the limit 7o;=lim/t is called the 

cheap servomechanism problem. It is clear that 
Jf and 7o depend on the controller structure used 
since the controller structure will determine the 
physical meaning of u. In this paper, we first 
look at the ideal case; we assume that the system 
parameters arc exactly known and that the 
disturbances are measurable. In this case, a 
feedforward controller structure can be used 
(Davison, 1973). It is shown in this case that7o is 
a quadratic form on the output disturbance only, 
and that a norm of this quadratic form can be 
characterized explicitly by the locaticms of the 
zeros of system (3) which lie in the open right 
hand side of the complex plane. In practice, the 
.system parameters are always somewhat uncer¬ 
tain and it is often impossible to measure the 
disturbances; in this case, the servomechanism 
problem is still solvable but the controller has to 
contain an internal model of the disturbances, 
see eg. Francis and Wonham (1976) and 
Davison (1976). Such a controller is called a 
robust servomechanism controller. We will show 
in this case that 7() has the same characteristics as 
in the teedtorward controller case, i.e. J^^ is a 
quadratic form on the output disturbance only 
and a norm of this quadratic form can be 
characterized explicitly in the same way by the 
locations of those zeros of sytem (3) which lie in 
the open right hand side of the complex plane. 
The significance of the results for the feedfor¬ 
ward controller lies in that it tells us what is the 
best possible result that can be achieved in the 
ideal case, while the results for the robust 
servomechanism controller show that even 
though the controller does not have as much 
information, the limiting performance is identi¬ 
cal to the ideal case. 

The structure of this paper is as follows. 
Section 2 gives some preliminary material on the 
factorization of a system into the product of an 
inner system and a minimum phase systems. 
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Section 3 gives the main result on the cheap 
LQR problem for a special system resulting from 
the factorization given in Section 2. Section 4 
studies the servomechanism problem with cheap 
quadratic cost using feedforward controllers 
Section 5 studies the same problem using robust 
controllers. It is assumed in Sections 4 and 5 that 
the disturbances are constant signals. Section 6 
extends the result obtained in Sections 4 and 5 to 
sinusoidal disturbances. Section 7 contains an 
example. Section 8 contains conclusions. 

2 PRELIMINARIES 

The transfer matrix F of the system (I) or (3) 
is given by 

F{s) = D C{sl - A) ^B. (.S) 

Conversely, a four-tuple of real matrices 
{A, B, C, D) is said to be a realization of a 
proper real rational matrix (transfer matrix in 
short) Fif (5) is satisfied. Throughout this paper, 
the following notation is used to divide the 
complex plane into three parts; = {a e 
C:Re(i)>0}, C’= {a 6 L: Re (.s) - 0}, d - 
{a e r: Re (.v) < 0}. A transfer matrix is said to 
be stable if all of its poles arc contained in 1 , 
and a square constant real matrix is said to be 
stable if all of its eigenvalues are contained in 
C . 

The zeros of a transfer matrix are defined to 
be the roots of the numerator polynomials of the 
nonzero elements of its Smith-McMillan form. 
A transfer matrix is said to be minimum phase if 
it has no zeros in C^; otherwise it is said to be 
non-minimum phase. The zeros of system (1), 
system (3) or simply a realization (A, B, C\ D) 
are defined to be the roots of the invariant 

polynomials of the matrix 

Similarly, we can define the concepts of 
minimum phase and non-minimum phase for 
system (1), system (3) and a four-tuple 
{A, B, C, D), as was done for a transfer matrix. 
Although the zeros of a real rational matrix and 
those of its realization may be different, their 
minimum phase or non-minimum phase property 
is always the same as long as the realization is 
stabilizable and detectable. 

Associated with any realization (A, B, C, D) 
in which A is stable, there are two Lyapunov 
equations: 

AP^PA^ = -BB\ (h) 

A'Q^QA = -C'C. ( 7 ) 

The solutions F, Q to equations (6)-(7) are 
called the controllability grammian and the 
observability grammian of {A, B, C, D), 


A-kl FJ 
. C D\ 


respectively. A minimal realization (A, fl, C, D) 
of a stable transfer matrix F is called a balanced 
realization if the solutions P, Q to equations 
(6)-(7) are diagonal and equal. It is shown in 
Moore (1981) that every stable transfer matrix 
has a balanced realization. Procedures to find a 
balanced realization from any minimal realiza¬ 
tion of a stable transfer matrix are given in 
Moore (1981) and Laub et at. (1987). 

A stable transfer matrix F is called inner if 
F'(~-s)F{s) = 1 . All the zeros of an inner matrix 
must be located in 


Lemma 1 (Glover, 1984). Let (A, fl, L\ D) be a 
balanced realization of an inner matrix F and let 
P, Q be the solutions to equations (6)-(7). Then 

(a) P=0 = /. 

(b) D D = l 

(c) DT: + zr-o, DP' + c-o. 


A transfer matrix F is said to be right- 
invertible if F has full row rank for at least one 
V c C. If (A, B, C, D) is any realization of F, 
then the right-invertibility of F is equivalent to 


the fact that 


A-A/ B 


has full row rank for 


at least one A f C. Therefore, no confusion will 
be caused when we talk about the righl- 
in'ertibility of system (1), system (3) or 
realization (A, B, C, D). 

The following factorization result serves as a 
foundation for our development. It is noted here 
that when the poles and/or zeros of two transfer 
matrices are compared in the following, we 
consider not only their values but also their 
multiplicities in the Smith- McMillan sense. 


Lemma 2. A transfer matrix F can always be 
factorized as b = F, F such that F| is inner, F^ is 
minimum phase and right-invertible, and the 
unstable poles of F arc equal to the unstable 
poles of F. 

The result has been known for a long time but 
its original proof is hard to trace. Readers are 
referred to Qiu and Davison (1990) for a proof 
and Zhang and Freudenberg (1990) for a proof 
of its dual version. 

Given a transfer matrix F, let F- FF be the 
factorization described in Lemma 2. Let 
C), Di) be a balanced realization of F, 
and let {A;, Bj, G, th) be any stabilizable and 
detectable realization of F- Then a stabilizable 
and detectable realization of F is given by 



(-|C| £>,62], D = D,D2. 
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This realization is called a factorized realization 
of F and plays an important role in the 
development. 

The following lemma gives some useful 
properties of right-invertible transfer matrices 
with respect to the factorization described in 
Lemma 2. 

Lemma 3. Let F be a right-invertible transfer 
matrix and let F = F^F^ be the factorization 
described in Lemma 2. Then F| is square, the 
zeros of F^ are equal to those zeros of F 
contained in and the poles of Fi are equal to 
the negatives of the zeros of Fi- 

Proof. Let F be r x m. Then Fi has r rows and at 
most r columns. If Fi has less than r columns, 
then the rank of F is less than r for all s eC, 
which contradicts the assumption that F is 
right-invertible. Therefore F, must have r 
columns, i.e. it must be square. It follows 
directly from the identity Fi^(s) = F{(-\) that 
the poles of Fi are equal to the negatives of the 
zeros of F,. 

Let (y4, F, C, Z)) be a factorized realization of 
F. Those zeros of F which are contained in 
are the complex numbers A e which make the 
matrix 

">t,-A/ FjC. F.D. 

0 Aj-XI 
C, D,r. D,D,J 

reduce rank. Adding times of the third 

row to the first row and multiplying the third row 
by we transform the above matrix into 

Ai-FiDf'Ci-A/ 0 0 

0 A,~A/ F, 

Dr'Cl C. Dz. 

Hence the zeros of F in are the eigenvalues 
of — F,Dr'C,. Notice from Lemma 1 that 

A, - FjD^'r , - A, + F,Dr'D,F; 

= a,4^f,f; 

= A,-(A,^A[)=^A\. 

This completes the proof. □ 

We end this section with a few words about 
the norm of quadratic forms. A (real) quadratic 
form f on V E is a function of the form 
f(v)~v'Qv for some symmetric matrix Qe 
Therefore a norm of Q gives a norm of 
the quadratic form /. The trace norm of Q, i.e. 
the sum of the singular values of Q, is of 
particular interest. If Q is positive semi-definite, 
then its trace norm is equal to its trace. The 


following lemma gives a physical meaning to the 
trace norm. Let f^{') denote the expectation 
operator. 

Lemma 4 (Levine and Athans, 1970). Let 
be a positive semi-definite matrix and 
let u be a random vector in W with = 0 and 
g(im') = /. Then ^^(v'Qv) = tr Q. 

3 CHRAP LOR PROBl.LM 

Consider the cheap LOR problem defined by 
(l)-(2). It is known that = x!)P,Xi) and that the 
optimal control which stabilizes the system and 
which achieves the optimal cost is given by 
w = -(6'/ + D'D) '(FT,+ D'C)x, where P, is 
the unique positive semi-definite solution lo the 
following algebraic Riccali equation (ARE) 

[A-Bic'l^D'D) 'D'C]'F, 

-f F,[A-F(eV + D'D)-'D'C] 
H-C'[/-D(r/ + D'D) 'D jC 
-P,B[€^-I^D^D) ^B’P, = {). (9) 

It is easy to show (Kwakernaak and Sivan 
(1972b)) that F, monotonically decreases as e 
goes to zero, and so the limit P)\— lim P, exists. 

t -n 

The following result was proved in wScherzingcr 
and Davison (1985); the same result for the case 
when D = 0 was obtained in Kwakernaak and 
Sivan (1972b) and Francis (1979). 

Lemma 5. F„ = () if and only if (A, F, (', D) is 
minimum phase and right-invertible. 

For systems which do not satisfy the 
conditions given in Lemma 5, (Francis, 1979) 
characterized the null space of /n, which is 
simply the set of all Xn with and (Saben 

and wSannuti, 1987) gave a complete decomposi¬ 
tion of the state space in terms of the transient 
speed of the state trajectories. In the following, 
we will show that F„ takes on a very simple form 
if (A, F, C, D) is a factorized realization of an 
arbitrary transfer matrix. 

Lemma 6. Let (A, F, C\ D) be a factorized 
realization of a transfer matrix. Then F, " 

r/ 0 1 

0 P y unique positive 

semi-definite solution to the ARE 

[A2 - , 

+ P,i[A.- -{- D^D.) 'DK’;| 

+ Cil/-£).(e7 + DiD2) 'D,]C2 
- PMe-I + D;D2) 'B.P,2 = i). 
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^ Ljc^-I accordingly with the 

partition of A. Then the optimal control is ^ivcn 


The proof of the first statement of this lemma 
is obtained simply by verifying that the given 
solution indeed satisfies ARE ( 9 ). The second 
statement follows by using Lemma L The details 
of the proof are dry algebra and are omitted. 

A direct application of Lemma 6 leads to the 
following corollary. 


Corollary l.If (A,/?, C\ D) is a balanced 
realization oi an inner transfer matrix, then 
P, = / and the optimal cimtrol is u = 0 . 


Since any minimal realization of a transfer 
matrix is similar to a balanced realization, this 
corollary implies that if (A, B, (\ D) is a 
minimal realization of an inner matrix, then the 
optimal control of the system ( 1 ) under cost f 2 ) 
is always zero, and thus the optimal performance 
is independent of e. This is consistent with the 
well-known fact that cheap control asymptoti¬ 
cally puts all the poles of the closed loop syvStem 
to the mirror points of the system’s zeros in i ' 
with respect to the imaginary axes. Since an 
inner matrix already has this property, no 
control is therefore needed to make it optimal. 

Since (An, Bi, Cn, Di) is a stabihzable and 
detectable realization of a minimum phase and 
right-invertible transfer matrix, the following 
theorem is obtained immediately from Lemmas 
5 and 6 . (In the statement of the theorem, we 
assume that P, is partitioned accordingly with 
the partition of A given in |H).) 

Theorem 1 . Let (A, /i, T, D) be a factorized 
realization of a transfer matrix. Then 



4 FELDIORWARD SERVOMECHANISM 
C C:)NTRnLEER 

In the rest of this paper, we apply Theorem 1 
to study various optimal servomechanism prob¬ 
lems with transient performance measured by 
cheap quadratic functionals. Consider the 
servomechanism problem for system (3). Denv)te 
the zeros of (A. P, C, D) which are contained in 
(if any) by A,, Ao, . . . , A/, To achieve clarity 
in the presentation, we assume in this and the 
next section that the disturbances are constant 
signals. The results will be extended to the 
sinusoidal signals in Section 6 . 

In order for the servomechanism problem to 


be solvable, it is necessary and sufficient to have 
the following assumption. 


Assumption 1 . A.ssume that 
row rank. 


B 

D. 


has full 


Assumption 1 implies that system (3) is 
right-invertible and has no zero at the origin. 
Assume that di.sturbances f and r/ are measur¬ 
able. Then under Assumption 1 , the following 
feedforward controller solves the servomechan¬ 
ism problem: 

M = /C.v~i;-[D-(C-I DK){A eBA) 'P]'/;. 

( 10 ) 

where K is any matrix which makes A 4 BK 
stable and [1^ ~ [■l'([-]|']') ‘ (Davison, 1973). 
The inverse involved exists due to Assumption 1. 

Assume controller (10) is applied to the 
system (3). The closed loop stability implies that 
the input and the stale of the system will 
approach constant values in the steady-state. 
Denote the steady-state values of the input and 
the stale by li and X, respectively. I'hen il and x 
must sati.sfy equations 

0 = Ai 4-P(u + ^). 

0 = Cx 4 D(u + ^) 4 t/, 
u ~ Kx ~ ^ 

-fD-(C4DA)(A4PA) *P]'r/. 

Let the transient part of the input and the state 
be denoted by u u - u and x := x - x, 
respectively. Ehen these values arc governed by 
the following equations; 

X =- Ax 4 Pm, X(0) = “i, 

1 ’ — Ci 4 Du, 

14 ^ Kx. 

These equations suggest that in order to have 
a good transient response, we can choose K to 
achieve the following optimal quadratic cost 
functional: 

y, = min I (y'y 4-r-u'w) dr. ( 11 ) 

From the knowledge of the LOR problem, J, is 
a positive semi-definite quadratic form on 
jf( 0 ) = -X, which in turn is linear in S and 17 . 
Hence, is a positive semi-definite quadratic 
form on ^ and rj. Since ./, monotonically 
decreases as c" goes to zero, it follows that 
^;=limy, exists. Ihe following theorem says 

that y„ is a positive semi-definite quadratic form 
on rj only, i.e. is independent of the input 
disturbance, and that a norm of 7,, can be simply 
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given by the locations of the zeros of system (3) 
in C^. 


Theorem 2. for some positive semi 

I 1 

definite H and tr H = 2 £ — . 

fA, 


Prcfof, The nature of the problem setup 
indicates that 7^ depends solely on the transfer 
matrix F of system (3). Therefore, we can 
assume that (A, B, C, D) is a factorized 
realization which is of the form of (8). Let 
F,{s) = D,-^C,{sI-A,)-'B, and 
C 2 {sl - A 2 )~^Bn. Since F is right-invertible by 
Assumption 1, it follows from Lemma 3 that F, 
must be square and that the poles of F^ arc 

It is known that = x'(0)PfX{0) = x'PJe, 
where P^ is the unique positive semi-definite 
solution of ARE (9). By Theorem 1, Po-= 

Let X be partitioned as x = 


lim P^ = 

f ♦() 


L() 0 


JCl" 

Xi- 


according to the partition of A as given by 


(8); then 7^ = 

Now assume that the closed loop system is at 
steady-state. The output of F, must be -rj and 
the output of F 2 is therefore —F^\{))r]. It then 
follows that jf 1 = Ar'fliFi '(0)?j. Let /7 = 
F;\{))B[A\-'A, '£f,F, ‘(0). Then 7o=r;77r/. 
Since the matrix F)(0) is unitary, it follows that 


trW = tr(B;A;-'Ar'fl,)-tr(Ar'fi,fl;Ar'). 
By using Lemmas 1 and 3, wc obtain 
tr/7--tr [A,‘(A,FA;)Ai 'I 
( 1 

--2tr(Ar^) = 2S Y'- 

I 1 A, 

It is not a surprise that the feedforward 
controller (10) produces perfect control, i.e. 
7n = 0, for the case when only the input 
disturbance is present, even if the system (3) is 
non-minimum phase. In fact, the feedforward 
controller generates a signal which completely 
cancels the input disturbance. However for the 
case when the output disturbance is present, 
perfect control cannot be obtained for non- 
mmimum phase systems, and a norm (or an 
averaging effect) of the optimal performance 7, 

/ 1 

IS now bounded from below by 2 H . This 

f 1 A, 

( 1 

result shows that 2 E - can be considered as a 

i- \ A, 

quantitative measure of the degree of difficulty 
in solving the servomechanism problem for 
non-minimum phase systems with constant 
disturbances. This result also emphasizes the fact 


that not all non-minimum phase systems behave 
the same. A system with a small positive zero is 
more difficult to control than a system whose 
zeros in are far away from the origin. On the 
other hand, a conjugate pair of complex zeros 
oc±jp in with or «|j8| will not cause signifi¬ 
cant difficulty in control since its contribution 

Act 

to the limiting performance 7o is 


(X^ -I- p 


2 • 


This 


later phenomenon has been observed in Davison 
and Gesing (1985) in the control design for a 
large flexible space structure. A similar result to 
Theorem 2 was given in Morari and Zafiriou 
(1989) for SISO systems by using frequency 
domain techniques. 


5 ROBUST SERVOMECHANISM CONTROLLER 

In practical applications, the parameters of 
system (3) arc always somewhat uncertain and 
we often do not have access to the disturbances. 
In either case, controller (10) cannot generally 
be used. To overcome these difficulties, a robust 
servomechanism controller has been proposed 
to solve the servomechanism problem (Davison, 
1976). The robust servomechanism controller 
does not require the disturbances to be 
measured, but must contain a servocompensator 
which contains the modes of the disturbances. A 
significant advantage of the robust servomechan¬ 
ism controller over the feedforward controller is 
that tracking and disturbance rejection occur for 
all perturbations in the system provided only 
that the perturbed closed loop system remains 
stable. 

Assume again that Assumption 1 holds. The 
robust servomechanism controller for system (3) 
with constant disturbances can take the following 
form 


2 ( 0 )- 0 , 

u = KuX + Kz, 


where [F„ K\ is chosen to stabilize matrix 


A 

C 



B 

D 




K]. 


On combining the original system and the 
servo-compensator (the integrator), the aug¬ 
mented system with input u and output z is then 
described by the state-space equation: 


X 

— 

A Oipi f/J 

(u + <;) -f 

0 



0 oJLzJ Ld. 


/ 


z = 



^( 0 ) 

2 ( 0 )„ 
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The action of the controller becomes that of a 

state feedback, i.e. u = [K, Define new 

variables: x ~x, z := i, u := u. On noticing that 
z=y, the augmented system then becomes 

[Me :\m:h 

>-|0 


minimum phase systems 
Let 




'■'Ic, 


fl, 

0 

D, 


and 


X 2 

z 


^ I 

= T z 

L-^2 J 


Assumption 1 guarantees that T is invertible. 
System (15) can then be transformed to 


and the controller becomes «= [X,, K] 

A 

Xi 


"a, b, o" 



" 0 ~ 

j. It can be easily shown that system 

X 

2 

A 

_ 


0 0 Cz 

_ 0 0 Az_ 

y 

_^2_ 


D, 


u. 


/ 

A 

0 


B 

1 

C 

0 


D. 


.[0 /j.o , 


IS always stabilizable and detectable under 
Assumption 1 and the assumption that 
(A, fl, C, D) is stabilizable and detectable. 

This suggests that in order to have a good 
transient response, we can choose [K^^ A^] to 

achieve the following optimal quadratic cost 
functional 

J, = min (y'y 4- c'u'u) dr (14) 
U J() 

By the same argument as made in the last 
section, we see that J, is a positive semi-definite 
quadratic form on £ and rj, and that Ji) := lim 

t —0 

exists. The statement of the following theorem is 
exactly the same as in Theorem 2. 

Theorem 3. Jo=r]'Hj} for some real positive 

1 

semi-definite H and tr H = 2 E - 

i 1 A, 


y = [Ci D, 0 ] 




( 16 ) 


and the initial condition then becomes 
'je,(()) B.l). 0 


i(0) 


LD|Zl ;J 


Note lhat (16) is a factorized realization with 

/!<) ('- 

Ml, fli. C), D,) being inner and I 


0 A. 


Dz 

B, 


, [/ OJ. oj being minimum phase and 

right-invertible. Let be the unique positive 
semi-definite solution to the ARE associated 
with system (16) and optimal cost functional 
(14). Theorem 1 leads to 


lim /\ 

f .11 


/ 0 0 
0 0 0 
U 0 0 


Proof. Again 7, depends solely on the transfer 
matrix F of system (3). Iherefore ( A, B, C , D) 
can be assumed to be a Lii torized realization of 
the form of (8). Consequently (13) can be 
rewritten as 


Ai 

B,Cz 0 


ii 


B,lh~ 

0 

A. 0 


^2 


Bi 

c\ 

D.C. 0_ 


2 




Xi 



y = [0 0 


/] 


Xl 


with initial condition 

4 ,( 0 ) BJ)2 

izfO) ^2 

Lz(0)J IhDi 



and y„ = i,'(0)-fi(0) Direct calculation shows that 


i,(0) = |/ 0 0|/' 


' B, 


= -/l, '(O)r;. 



where F|(0) — D, — CjA,'fl,. Let H — 
F[~'iO)B[A[' 'Ai'B^F;'H)). Then = 

The rest of the proof now proceeds in exactly 
the same way as done in the last part of the 
proof of Theorem 2. □ 


Two interesting points can be observed on 
comparing Theorem 3 with Theorem 2. Firstly, 
although we can no longer completely cancel the 
input disturbance when the robust ser¬ 
vomechanism controller is used (as was done for 
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the feedforward controller) perfect control still 
occurs for the case when only the input 
disturbance is present even if the system is 
non-minimum phase. In other words, no matter 
whether the system is minimum phase or not, 
the robust servomechanism controller's reaction 
to the input disturbance can be made arbitrarily 
fast. This result is perhaps somewhat surprising. 
Secondly, we are concerned if the use of the 
robust servomechanism controller sacrifices the 
potential performance of the controlled system, 
in comparison to the use of the feedforward 
controller. Since the variable u in (11) and (14) 
have different physical meanings, it may appear 
that the norms of /o given in Theorem 2 and 
Theorem 3 arc incomparable. However, it has 
been shown in Kwakernaak and Sivan (1972b) 
that if wc denote the outputs of the optimal 
system in both cases by , then 7,1- 
lim J,t v/ Vf dl, i.e. when c is small, essentially 

c^O 

contains only the output term. Therefore, in 
both cases, the limiting transient speed of the 
output, measured by a norm of/(,. are the same. 
In other words, the use of the robust 
servomechanism controller will not lead to a 
significant loss m the potential limiting perfor¬ 
mance of the system. 


(^ SINUSOIDAL DISTURBANCES 
In this section, we extend the results obtained 
in the last two sections to the case when the 
disturbances are sinusoidal signals. We will 
obtain results which are in the same spirit as in 
Theorems 2-3. 

Assume that the disturbances in system (3) are 
now of the following form 

^(r) = Eh sin an + cos wi, (17) 

ViO ~ 1 sin lot -f 7]^2 cos oj(, (18) 

where f,. '= I t ' I '^rid rr. := ' are real 

constant vectors. In order for the servomechan¬ 
ism problem to be solvable for system (3) with 
disturbances of the form (17)-(18), it is 
necessary and sufficient to have the following 
assumption. 


Assumption 2. Assume that 
full row rank. 


A-jcoI B 
C D_ 


has 


Assumption 2 implies that system (3) is 
right-invertible and has no zero at ju). 

First, we consider the ideal case when the 
system parameters (A, B, (\ D) are exactly 
known, and the disturbances as well as their 


derivatives are available for measurement. In 
this case, the following feedforward controller 
solves the servomechanism problem: 

M = /Cjc - ^ - Re {[D + (C + DK) 
x{ja)I-A~BKy'BY}r] 

-~lm{[D + (C + DK) 

CO 

x(j(o/-A-BK) 'BYji/, (19) 

where K is any matrix which makes A -f BK 
stable and [ = [ ]'([ ][ T) ^ (Davison, 1973). 

The inverse involved exists due to Assumption 2. 

Assume controller (19) is applied to system 
(3), and assume that the steady-state signals of 
the input and the state are given by u and x, 
respectively. Then u and x must satisfy 
equations: 

X = Ax -h Bu B^, 

0 - Cx -f Du -f DE + 7], 
u = Kx Re {[D^(C~\-DK) 

X A - BK) 

-- lm(|/) + (('4 DK) 

0) 

x(]iol-A - BK) 7y]')r/. 

Let the transient part of the input and the stale 
be denoted by ii := u - u and x := v - x, 
respectively. Then these values are governed by 
the following equations: 


X = Ax -I- Bu, x(()) = -t(()), 
y - Cx Du, 
u = Kx, 

These equations suggest that in order to have 
a good transient response, we can choose K to 
achieve the following optimal quadratic cost 
functional 


J, — min j (y'y T c'ld'il) dr (20) 

From the knowledge of the LQR problem, is 
a positive semi-definite quadratic form on 
i(0) = -ifO), which in turn depends linearly on 
Ef. and Hence J, is a positive semidefinitc 
quadratic form on and j],.. Since J, decreases 
monotonically as goes to zero, it follows that 
y„:=limy. exists. The following theorem says 

f -0 

that y,, a positive semidefinitc quadratic form 
on 7 ],. only and a norm of 7,, is given by a simple 
expression involving only the zeros of the system 
(3) in 17 
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Theorem 4. 7,, = n'Mr],. for some positive 
definite M and 


semi- 


tr A/ = 



1 

-f —.——. 

A, -I- j{jD 


)■ 


Proof. Similar to the constant disturbance case 
we can assume that {A, B,C.D) is a factorized 
realization of the form of (8). Let f|(.v) = 
/)i + C,(a/ - A^)~'B^ and Fi(s) = Do + V^isl - 
An) 'Bj- Since system (3) is assumed to be 
right-invertible, it follows from Lemma 3 that F, 
must be square and that the poles of F, are given 
by -A,, -Aj, .... -A,. 

It is known that 7, = i'(0)F,^(()) = i'(0)P^^(()) 
where is the unique positive .semidcfinite 
.solution of ARE (9). By Theorem 1, p^ - 


lim P, = 

r-M) 


I 0 
0 0 / 


Let X be partitioned as jr = 


In this case, we see that a system with zeros in 
close to /w, where co is the frequency of the 
disturbances, is rnore difficult to control than a 
system whose zeros in are far away from jw. 

We can see, as in the constant disturbance 
case, that the feedforward controller (19) cannot 
be used cither when the system parameters 
(A, B, C, D) are uncertain or when the distur¬ 
bances are not available tor measurement. The 
robust servomechanism controller does not have 
these disadvantages. The robust servomechanism 
controller for disturbance signals of the form 
{17)“(18) can take the following form tDavison, 
1976) 



^i(O) (21) 

2,(0) _ 


according to the partition ot A given by (8); 

thcn7n = Jfi'(0)i,(0). 

Now assume that the closed loop system is at 
steady-state. I’he output ot F^ must be -rj. I'o 
obtain the state x, of f,. we use /'(•) to denote 
the Laplace transform operator. Then i, is 
related to rj by 

Let L(.s) ^ (.s7 -/4,) Steady slate 

sinusoidal analysis tells us that 

1 

ii = - Re L{jw)r] - Im L{jcL))rj, 

(it 


Hence, 

i,(()) ™ “Re L{j(o)Ti,.2 - liTi 

Let 


A7- 


Im L(jof) 
Rc 


[Im L{j(o) Re L{ju))l 


u = + /C,2| + 

wheie [A"o A", A^] is chosen to stabilize the 
following matrix 

A {) {) B 

C 0 -fj// + D [Ao A, A,). 
0/0 0 


The augmented system (with input u and 
oulput Zn) is then described by the state-space 
equation 


X 

ii 

Z-y 


A 0 
C 0 
0 / 


0 

or I 
0 




”0“ 



D 

(m -t- ^-) + 

/ 


Z|(()) 



_ 0 _ 


_ 22(b) _ 


:,-[0 0 /| 


= 0 , 


Then 7„ — and 

tr Af = tr [L*(ju})L{jw)\ 

= tr[Ai* \ja))B\{}iol - A^)"" ' 

X (yVu/- Ai \j(o)] 

= tr[Bl(iu)l-A,)* 'i}(oI - A,) 'BI] 
= tr [(ytu/— A|) 'BiB[(—jtoJ — A^) ]. 

By using Lemmas 1 and 3, we obtain 
tr Af = tr [{j(ol - A i) ' 


VA,-;ftj 


■ jti) - jto 


Define new variables x .= x + u}^x, f, z, 
-I- u)'z ^, Zn := Zi -t- uy^Zn, and u ii + tu“«. On 
noting that Z 2 + the augmented system 

then becomes 


X 

in 


A 

C 

0 


y = [i) 0 l\ 


0 

-(0^1 

0 

X 

hA 


X 

i\ 

L^2 j 


-I- 


B 

D 

0 


( 22 ) 


A,)-'l / 

'a 0 0 ” 



where 

r 0 -(o-i 

, 

D 

° \ 

0 / 0 


_ 0 _ 


[0 0 /j.O 
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is always stabilizable and detectable under 
Assumption 2 and the assumption that 
(A, B, C, D) is stabilizable and detectable. 

The initial condition of system (22) is given by 


f(0) i(0) 

2,(0) = z,(0) 

2*2(0) J [22(0) 


AO 0 
C 0 -U)~I 


+ D [li(O) + ^(0)1 + 1 7/(0) 

LoJ Lo_ 

Ta 0 0 “I 

C 0 -<o-l 


D [u(0) +5(0)1+ I 7/(0) 


+ 1^ D J(m( 0) +5(0)1+|^/j 7/(0). 

These equations suggest that in order to have 
a good transient respon.se, we can choose 
[K„ /C, K,] to achieve the following optimal 
quadratic cost functional 


= min (y'y + e^ii'u) d/. (23) 

Again the optimal cost is a positive 
semi-definite quadratic form on 5f ^ind 7 /,,, and 

y„=lim7, exists. The statement of the next 

theorem is exactly the same as that of Theorem 
4. 


y = [0 0 0 I\ 


'At + 0)^1 A,fl, B, O’ 

0 0 0 / 

C,A, C,fl, D, 0 

C, D^ 0 0. 


As.sumption 2 guarantees that T is invertible. 
System (24) is then transformed to 

■jr,^ A, fl, 0 O' ,t, 

0 0 / 0 f| 

22 0 -m’/ 0 C. f, 

X. 0 0 0 A. X. 


y = [C, D, 0 


Theorem .s. /„ = 7/,!A/7/,. for some positive semi- 
definite M and 


= S (7--+-)■ 

i-.i \a., - ju> A, + jiu/ 


Proof. Again we can a.ssume that (A, B, C, /)) 
laciuiizcLi icaiizaiion 01 r wnicn is or me 
of (8). Consequently (22) can be written as 


A, ByC. 0 
0 A, 0 

C| D 1 C 2 0 
0 0 / 


System (25) is in the factorized form with 
(A,, By. C|, Dy) being inner and 

0/0 0 

-mV 0 C, Dj . [/ 0 0), 0 

0 0 A. B. 

being minimum phase and right-invertible. Let 

Pf be the unique positive semi-definite .solution 
of the ARE associated with system (25) and the 
optimal cost functional (23). Theorem 1 leads to 


/’(,;= lim P, 

c-Mt 


/ 0 0 0 
0 0 0 0 
0 0 0 0 
0 0 0 0 


and /„ = jE|'(0)i|(0). 



rtnance limitations of non-minimum phase systems 
The initial condition i,( 0 ) can be obtained as; 
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-fi(0) = [/ 0 0 0] 


2|(0) 

^2(0) 

U2(0)J 

= K 0 0 OJT”' 

AiCt 0 0 -| 

I ^ >1.’ 0 0 


^1 j 0 - a>^/ 


(1 

0 1 0 J 


B,Dr 

^2 


“0~ 

0 

7 

D,D, 

[«(()) + ^(0)1 4 

0 




r;(0) 



~B,D,- 


“0“ 

A 


D,D, 

[n(()) 4 &(())! 

0 

/ 

m 


_ 0 _ 


\J)_ 

j 


7, AN EXAMPLE 

An experimental flexible beam system is 
described by the following transfer function 
(MacLcan, 1990) 

F(s) 

= _ 1^ 26s'' - 1 . 661 .-’ - 287fa" + 453 j + 954 00 
5.v'’+ 4.83 a'' + 2312?“ -t- 48Rs’ -I- 60657 j^ + 40.5j ’ 

which has a stale space model of the form (1) 
given by 

-0.996 -463 -97.8 -12131 -8.11 0 


A = 


1 

0 

0 

0 

0 


0 

0 

0 


0 

0 

1 

0 

0 


0 

0 

0 

1 

0 


0 

0 

0 

0 

1 


0 

0 

0 

0 

0 


Straightforward computation leads to: 


= ’(■,)-’ +„r/| ' 

X [M, - BJ), 'r;(()) 

+ «,/), 't)(0)1 

~ ~[(A| — V i)’ -l- (a'/| ' 

X IM, - BJ);'( ,)/!,/), S, ^ 

Let L(a) = (.v/-A, 4 Z?,/),'C |)“'. I'hen 
simple algebra shows that 

I.is) = (sl-A,) */i, 

x[D, + r,(s7 - A,) ^/i,] ' 

’rs). 

and 

ii(0) = -Re [L(ja))]ri^;, - Im \L(j(o)\r|,^. 


Im L(joj) 
-Re L(j(o) 


m L(jio) Re 77 j(o)Y 


Then Jt) — The rest ot this proof is the 

same as the last part of the proof of Theorem 
4. □ 


Again we have two similar observations as in 
the constant disturbance case. Firstly, perfect 
control still occurs for the case when only the 
input disturbance is present, even if the system is 
non-minimum phase. Secondly, the use of the 
robust servomechanism controller does not lead 
to a significant loss in the potential limiting 
performance of the system, compared to the 
feedforward controller case. 


0 

7?“’ 

' 0 


0 

C=[[) L65 -0.331 -576 90.6 19080], 

n = 0 . 

Note that this is not in the factorized form as in 
(H^ 

This system has two zeros in : A, ^ 6.18 and 
A>-17.7. r)ui purpose is to .solve the ser¬ 
vomechanism problem for this system with 
respect to a constant reference signal 
(assuming that there are no external distur¬ 
bances present). As we have discussed in Section 
1, we can consider y - y,,, as the output and 
-\V, as the output disturbance for this problem. 

Let us firs! apply the feedforward controller of 
the form (10) (with D = 0); 

u = [-C{A ^ BK^) ^B] 

where K, is chosen to achieve the optimal cost 

J, = min [(y-y,ef)'(y-yrcr) 

u J() 

4 e^(u - u)'(u - m)] dr. 

Assume that the system has a zero initial 
condition. l.ct x, be the steady state value of the 
state variable; then x, = ~(A + BK^y'Bl- 
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Table 1. Comparison of optimal costs for feedforward 
CONTROLLER CASE 


€ 

4 

_4^ 

4. 

1 

i-2iyL 

0.951yl, 

2.54 X 10- 

10"' 

0,640y?„ 

0.592y?e, 

4My%, 


0.513y?., 

0.499y?„ 

1.39 X 

10 ' 

0.4A8y?„ 

0.462yf„ 

6.70 X 

10 " 

0.448y?„ 

0.445y;„ 

2.62 X 


Table 2. 

Comparison of omMAL 

COmROl Ll R C'ASI 

COSTS F(^R ROBUST 


1, 

4r 

4. 

1 

1.78y;„, 

I.SOy?,, 

2 76x 10 'yf,, 

10 ' 

0 962yr;-, 

0.867y?„ 

y 49y;,, 

10 ‘ 

0 690.v;,| 

0 650y,^, 

4 (X) X 10-yf,, 

10' 

0 56.1yf., 

0 547vj,, 

1.63 X 104;,, 

10 " 

0 506y?„ 

0 497y,’,, 

9.59x104;’,, 

10 ' 

0 472y?,, 

0.447v=„ 

5.4KX 104;,„ 


+ and 4 = 

where is the unique positive semi-definite 
solution to ARE (9) (with D = 0). From 


Theorem 2, we obtain -I-= 

0.437v^cf as e—>0. Table 1 gives the computed 
value of for several different values of e. For 
the convenience of comparison, Table 1 also 
gives 


and 


'^vt • I .VreO yrcl) 

Juc ’= - M)dr, 

^0 


where and u, are the output and control 
trajectories of the optimally controlled system. 
The quantities and can be obtained as 
tr (CL^C') and tv (K^L^Kc), respectively, where 
Lf is the solution of the following Lyapunov 
equation 

(A -h BK,)L, + L,{A F BKJ + i,i; = U. 


It is seen that as c—>0, the computed value of 
is approaching the limiting cost /(, = U.437>';,.f 
obtained from Theorem 2. 

Now let us apply the robust controller 

2 y yrefi 

u = K^^^x + 

where [Ki)^ K^] is chosen to achieve the optimal 
cost 


4 = min [(>'->',.,)'(>'- 3 ',„)-He^u'u]dL 

U 

We know from Section 5 that = [0 -y,^t\P, 
[_ j Ihe unique positive 

semi-definite solution of the following ARE; 


A' 

C' 


A 

0 

. 0 

0 . 


C 

0 . 


+("]|(' 'i-pp. 


B 

OJ 


[B' ()]/", = a 


From Theorem 3, we again obtain that 
> 0.437yrcf as e—►(). Table 2 gives the 
computed value of for several different values 
of €. Table 2 also gives 


:= (y. ->’r.l)'(.V, -yrcl) dt, 

Jll 


and 


/„, := u;«, d/, 

h 


where >■, and u, are the output and control 
trajectories of the optimally controlled system. 
The quantitic.s 7,, and J,„ can be obtained as 
tr((0 /]L,|0 /]') and tr((AC„, K,\L,\K„, /C,)'), 
respectively, where L, is the solution ol the 
following Lyapunov equation 


/A 0 
UC 0. 


B 

LOJ 


[A',„ 


a.))l 



A 

C 



B 

() 


[A,„ 


K. 


I)' 


I" 


Again, It IS seen that as r—^0, the ci)mputed 
value of J, IS approaching the limiting cost 
y„ = 0.437yrti obtained from Theorem 3. 


H ( (JNC L DSION 

This paper considers the cheap regulator 
problem and the cheap optimal servomechanism 
problems for systems which may be non- 
minimum phase. The basic tool used is a 
factorization which factorizes an arbitrary system 
into the product of an inner system and a 
right-invertible minimum phase system. Based 
on this factorization, the study of an arbitrary 
system can be decomposed into the study of an 
inner system and the study of a right-invertible 
minimum phase system. The cheap control 
problem of an inner system becomes easy to 
analyse by exploiting various properties of inner 
matrices, while the cheap control problem of a 
right-invertible minimum phase system has 
already been intensively studied. 




Performance limitations of non-minimum phase systems ^49 


A novel contribution of this paper is the 
establishment of the fact that the number and 
the locations of the zeros of a system m the open 
right half of the complex plane, are trutial 
factors which determine the best attainable 
closed loop performance of the system In 
particular, it is shown that the fundamental 
design limitations on the closed loop perfor¬ 
mance of the servomechanism problem can be 
completely characterized by the number and the 
locations of the zeros of the open loop system 
which he in the open right half of the complex 
plane This design limitation can be used to 
evaluate an open loop system, i e to determine 
whether the system is “inherently hard to 
control", and to assess a given closed loop 
design, 1 e to determine how near the closed 
loop system's performance is from the best 
attainable 
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Discrete-time Loop Transfer Recovery for 
Systems with Nonminimum Phase Zeros and 

Time Delays*t 

ZHIHONG ZHANGt and JAMES S. FREUDENBERG8 

An analysis of the effects of nonminimum phase zeros and delays on the 
discrete-time loop transfer recovery procedure leads to a better under¬ 
standing of the procedure. 


Key Words—Discrete time systems, optimal control, nonmimmum-phitscd systems, delays 


Abstract —The purpose of this paper is to study what 
happens when the discrctc-limc loop transfer lecovery 
([TR) procedure is applied to plants with nonminimum 
phase zeros and with uniform lime delay in all channels 
Explicit expressions are given for the asymptotic behavior ot 
the resulting sensitivity function anti loop trandcr tunclion 
These results yield a belter understanding of the mechanism 
of the discretc-time loop transfer recovery procedure and the 
design limitations due to nonminimum phase zeros and time 
delays 


1 INTRODUCTION 

SiNO THI SPMINAL wofk of Kwakcmaak (1%9) 
and Doyle and Stein (1979, 1981) Ihe loop 
transfer recovery method has received a lot of 
attention and evolved as a formal design 
procedure (Stein and Athans, 1987). Recently, 
extension of the loop transfer recovery design 
technique to discrete-time systems has been 
studied by a number of researchers (Maciejow- 
ski, 1985; Ishihara and Takeda, 189b; Niemann 
and Sogaard-Andersen, 1988; Kinnaen and 
Peng, 1990). Motivation for such an interest can 
be seen as follows. Firstly, guaranteed feedback 
properties for the discrete-time linear quadratic 
optimal regulator or Kalman filter do exist 
(Safonov, 1980; Shaked, 1986; Anderson and 
Moore, 1990) although they arc not as good as in 
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the coiitinuous-time case. Nevertheless, it is 
desirable to have a method of recovering these 
properties. Secondly, the loop transfer recovery 
procedure significantly simplifies the use of the 
LQG methodology. Knowing that it will be 
recovered in the LTR procedure, the dc-signer 
can mainly concentrate on the design of the state 
feedback loop. 

There are two types ot observers for 
discrete-time systems: predicting observers and 
filtering observers (Franklin and Powell, 1980). 
The predicting observer is used when there is a 
significant computation time and the filtering 
observer is used when the computation time is 
negligible. The standard state feedback scheme 
is dual to the predicting observer, but not to the 
filtering observer. This implies that there is a 
fundamental difference between the problem of 
recovering state feedback loop properties at the 
plant input using a specially tuned Kalman filter 
(which is a filtering observer) and the problem of 
recovering the optimal observer loop properties 
at the plant output using a specially tuned LO 


optimal regulator. 

Maciejowski (1985) -studies the problem ot 
recovering state feedback properties at the plant 
output. He shows that if the plant is minimum 
phase and has no time delays and if the cheap 
control regulator is applied to the filtering 
observei then the feedback loop of the observer 
can be recovered. He also observed that 
although It is generally impossible to have 
perfect recovery when the plant is nonminimum 
phase or when the predicting observer has to be 
used a useful degree of recovery is often 
obtained. An interpretation of this phenomenon 
in terms of asymptotic eigenvalue locations was 
also provided in Maciejowski (1985). Due to the 
fact that sampling often introduces nonmini- 
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mum phase zeros (Astrom et al., 1984) that 
computation lime is not always negligible, and 
that many plants contain time delays, it is 
important to understand what happens when the 
loop transfer recovery procedure is applied 
under these conditions. The problem of loop 
transfer recovery for a plant with delays has 
been studied in Kinnaert and Peng (1990) for 
minimum phase systems. Wc shall see that their 
results can be included in our framework as a 
special case. 

In this paper, we shall use the same approach 
as in Zhang and Freudenberg (1990) to study the 
effect of nonminimum phase zeros and time 
delays upon the LTR procedure. In Zhang and 
Freudenberg (1990) we studied loop transfer 
recovery for continuous-time nonminimum 
phase plants. We first factorized the plant into a 
minimum phase part and an all-pass factor 
expressed in terms of plant nonminimum phase 
zeros and their associated directions, and used 
these factors to derive explicit expressions for 
the loop transfer function and sensitivity 
function resulting from the LTR procedure. 
Design interpretations were then obtained from 
those expressions. The results of Zhang and 
Freudenberg (1990) were derived for the 
problem of recovery at the plant input and could 
also be applied, via duality, to the problem of 
recovery at the plant output. Since duality is not 
so complete with discrete-time systems, we shall 
only consider here the problem of recovery at 
the plant output. There are several procedures 
to achieve the loop transfer recovery (e g. 
Niemann and Sogaard-Andersen, 1988; Tsui, 
1989; Saberi and Sannuti, 1990; Chen e( al., 
1991). Our discussion will focus on the one 
based on Riccati equations (Maciejowski, 1985). 

The rest of the paper is organized as follows. 
Section 2 contains definitions, some properties of 
nonminimum phase systems and a generalization 
of the minimum phase/all-pass factorization 
formulas in Enns (1984) and Zhang and 
Freudenberg (1990) to discrete-time syvstcms. In 
Section 3, we derive explicit expressions for the 
limiting values of the sensitivity and loop 
transfer functions when the LTR procedure 
proposed in Maciejowski (1985) is applied to a 
nonminimum phase plant. Interpretations of the 
results are also given. The results of applying the 
LTR procedure with the predicting, rather than 
the filtering, observer are discussed in Section 4. 
In Section 5 we study the application of the 
LTR procedure to a plant with uniform time 
delay in all channels. An example is given in 
Section 6 to illustrate our results and conclusions 
arc found in Section 7. An abbreviated version 
of this paper was presented in Zhang and 
Freudenberg (1991). 


2. NOTATION AND ALL-PASS FACTORIZATION 
In the following, we consider a discrete-time 
system described by state equations 

+ Buk, (1) 

yk = Cxu. ( 2 ) 

where jce/?” is the state, u g is the control 
input, and y e/?"' is the measured output. It is 
assumed that (CM) is observable, (A, B) is 
controllable, C and B are each full rank, and 
G(z) := C{zl - Ay ^B. We shall use superscripts 
T and H to denote transpose and complex 
conjugate transpose, respectively. Notation d 
denotes complex conjugate of a complex number 
a. 

Consider the state feedback control law 

U* = -K.x,. (3) 

Let be obtained by using the LO optimization 
technique with the performance index 

X 

iulRu,+x[Qx,), ( 4 ) 

k -0 

where R ^>0 and Q = C. Then it is known that 
K, is given by 

K, = (B + B'A/B) (.S) 

where M is the positive definite solution ol the 
Riccati equation 

M =A^MA- A^MB{R^ B^MB) ' 

X B^MA + Q. (6) 

Define the loop transfer function of the optimal 
regulator loop 

H,f(z) = KAzl~A) ’B-(R^B'M«) ' 

X B^MA(zI - A) ^B, (7) 

The above control law assumes that all states 
are available for feedback. Typically, not all the 
states are measurable, and the missing states 
must be estimated from the output measure¬ 
ments yi,l:^k. There are two versions of full 
order estimates for the state x^: the filter 
estimate which is based on measurements up 
to and including the current measurement y^, 
and the predictor estimate x^/^ i which is based 
on measurements up to yk-\- Two observers 
commonly used in discrete-time systems are 
described as follows (Maciejowski, 1985; Fr¬ 
anklin and Powell, 1980), 

2.1. Predicting observer 
The predicting observer is described by 

+ flu* + X,,(.v* - Ci*/*_i), (8) 

where the observer gain is chosen so that 
A - KpC is stable. When the predicting observer 
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is used, the control law (3) is replaced by 

W/c (9) 

This results in the predicting compensator 

mz)=-F„{z)Y{z), ( 10 ) 

where 

F,iz) :=K,[zI-A + BK, + K^C] 'K^. (n, 

The predicting compensator is appropriate when 
the compulation time is not negligible, and we 
cannot use the current measurement of the 
output to update the control. 

2.2. Filtering observer 
The filtering observer is described by 

+ i/k Ax _I -l" Bujf "t" A f-fA/A-1), 

( 12 ) 

^k/k= Xk/k^\Y Kfiyt - Cx^,i^ ,), ( 13 ) 

where the observer gain is chosen such that 
A - AK^C is stable In this case, the control law 
(3) is replaced by 

= (14) 

This results in the filtering compensator 

U{z)= --Ff{z)Y(z), (15) 

where 

/V(z) •= z/C. [zl - (/ ~ K,C){A - Bk\ )r'AV 

( 16 ) 

The filtering compensator is appropriate when 
the computation lime is negligible, and we can 
use the current measurement of the output to 
update the control. 

When equation (12) is used alone, we call it 
the ‘predicting" version of the filtering observer, 
since it has the same form as a predicting 

observer with gain — AKf. 

The Kalman filter is a filtering observer with 

Kj = PC'(CPC ' -f V) (17) 
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where P is the unique positive semidefinite 
solution to the dual algebraic Riccati equation 

P = APA' -APC^iCPC^ ^Vy^CPA' -h W, 

( 18 ) 

and W > 0 and K > 0 are fictitious process and 
measurement noise covariance matrices, respec¬ 
tively. Define the loop transfer function of the 
Kalman filter loop (see Fig. 1) 

= c;(zy - A) ^AKf = C(z/ - A) ^ 
XiAPC^CPC' + V)-\ (19) 

Note that the optimal slate feedback described 
by (l)-(6) and the optimal observer described by 
(12)-(13) and (17)-(18) are not dual, but that 
the optimal state feedback loop transfer function 
//,f(z) and the optimal observer loop transfer 
function are dual. Hence, the optimal 

state feedback is dual to the ‘predicting’ version 
of the optimal observer described by (12) and 
(17)-(18). 

Guaranteed feedback properties of the 
discrete-time optimal regulator and Kalman filter 
loops also exist (Safonov, 1980; Shaked, 1986; 
Anderson and Moore, 1990). These properties 
can be derived from the discrete-time Kalman 
equality (or its dual) 

\I + B'<t>(z-')'Kr\{B'^MB + R){1 + 

= R + B^*t>{z-YQ^(z)B, (20) 

where ^(z):={zl — A) These guaranteed 
properties are not as good as in the continuous 
case. In particular, there is no infinite gain 
margin. Of course, this makes sense because the 
zero at infinity is outside the unit circle and thus 
in the unstable region. 

We now introduce some notation. First, let us 
define the following transfer function matrices. 
Observer loop transfer function: H{z) = C{zl - 
A) 'AKf. 

Observer sensitivity function: .^„^,(z) = [/ + 
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Output feedback loop transfer function: L(z) = 
C(z)f(z). 

Output feedback sensitivity function: 5„u,(z) = 

[/ + L(z)]-. 

F{z) = Ff{z) or Fp{z), depending upon which 
observer-based compensator is used. The follow¬ 
ing definitions are also needed in the subsequent 
development. 

Definition 2.1 (Davison and Wang, 1974). The 
transmission zeros of system (l)-(2) are defined 
to be the set of complex numbers a which satisfy 
the following inequality 

[aJ-A 


Factor out the nonminimum phase zeros of 
Giz) one at a time as follows: 

Giz) = Cp,iz)GUz). 

G]„{z) = C„Xz)GUz). 


G‘-\z) = C^X^)GUz), 

where G^z) := CUz/C:=C, and 
for / = 1, 2, - . . , 




= C ~' - {^^^)fl,^r{A +1). (25) 

\ a, 1 / 

The vectors rj, and are solutions of 


The multiplicity of a is equal to its algebraic 
multiplicity as defined in MacFarlane and 
Karcanias (1976). 

Definition 2.2 (MacFarlane and Karcanias, 
1976). Let a be a transmission zero of G(z), .so 
that 

■'']= 0 , ( 22 ) 

has a solution with = 1. Then x is called the 
left state zero direction and w is called the 
output zero direction. 

Definition 2.3. The system (l)-(2) is said to be 
nonminimum phase if at least one of its 
transmission zeros is outside the closed unit disk 
in the complex plane and such zeros are called 
nonminimum phase zeros of the system. 
Otherwise the system (l)-(2) is said to be 
minimum phase. 

Note that we call a system minimum phase if it 
has no finite zeros outside the unit circle. 

It is well known that nonminimum phase zeros 
of a transfer function G(z) may be collected into 
a stable all-pass factor, i.e. G{z) may be written 
as 

G(z) = C„(2)Gj2). (23) 

where CJz) is stable, has zeros coinciding with 
the nonminimum phase zeros of G( 2 ), and 
satisfies Ca(z~ ^)^Ca(z) = I. The transfer function 
G^(z) is minimum phase and is termed the 
minimum phase counterpart of G( 2 ). The 
following lemma gives a constructive procedure 
for performing this factorization. 



aj ~ A 


-C 


i~i 

m 


-B 

0 


= 0 , 


(26) 


with = 1. Then the factorization (23) is 
given by G„Xz) = G'J,z) and C„(z) = 
GaXz)CaXz) ■ ■ • C„,(z). 


Proof. Straightforward by applying a bilinear 
transformation to the continuous-time results of 
Zhang and Freudenberg (1990). Details can be 
found in Appendix B of Zhang (1990). 


The above procedure yields a formula for the 
minimum-pha.se/all-pass factorization which is 
useful for our purpose. Other expressions for 
this factorization include the standard inner- 
outer factorization (e.g. Francis, 1987) and a 
factorization due to Shaked (1990). 

From the above lemma we can see that for a 
given transfer function G{z) = C(zl — A)~^B 
there always exi.st C„, and C„(z) such that (23) 
can be written in the following form; 

G(z) = CJz)C^izl-A) 'B. (27) 

We shall now calculate the limiting value of 
the optimal regulator gain as the control cost 
approaches zero. 


Theorem 2.1. Consider a nonminimum phase 
system (A, B, C) and its minimum phase 
counterpart {A, B, C„), with C„, as calculated in 
Lemma 2.1. Let fC, be the feedback gain 
calculated according to (5) and (6) with 
B = (l/<7^)/. Suppose that CB = CAB = • ■ • = 
CA'-^B = 0 and det (CA'’’fl) 0. Then K,^ 
(C„A‘-'By'C„.A' 


Lemma 2A. Given the transfer function G( 2 ) = 
C{zl — Ay^B that has / nonminimum phase 
zeros fl|, a 2 , . . . , ai, (including multiplicities), 
the factorization (23) can be obtained using the 
following iterative procedure. 


Proof Use duality and Theorem 3.1 of Shaked 
(1985). Note that since C^iz) is bicausal, the 
structure of G( 2 ) at infinity is the same as that of 
G„fz) which implies that the conditions 
CB^CAB^ = det(C/l' 



355 


Discrete-time loop transfer recovery 


0 are equivalent to the conditions C„,J3 = 
C„AB = ■ ■ ■ = C„A'' = 0 , det {C„,A' 'B) =/= 0 . 

Also note that the assumptions in Shaked (1985) 
that A is stable and that the system {A, B, C) 
has no poles or zeros at the origin can be 
removed (for details, see Appendix A of Zhane 
(1990)). 

3. LOOP TRANSFER RECOVERY WITH 
NONMINIMUM PHASE ZEROS 
The development of this section is based on 
the discrete-time loop transfer recovery proce¬ 
dure proposed by Maciejowski (19S5), Wc first 
briefly state the procedure. Consider the 
observer-based output feedback system shown in 
Fig. 1. Suppose that the plant transfer matrix is 
minimum phase and det(r^)^(). The proce¬ 
dure is to design the observer loop H(z) = 
C{zl — A) ^AKj to meet design specifications 
(with augmented dynamics if necessary) and 
then to recover this loop asymptotically by 
tuning the state feedback gain K, which is 
obtained from (5) and (6) with R~{\/q^)f. If 
the filtering compensator t'f(z) is used, then 
perfect recovery can be obtained asymptotically 
at the plant output, i.e. as r/— 

G{z)f}(z)-^H(z), (28) 

(29) 

Here, as in the sequel, convergence of transfer 
functions is pointwise in 2 . 

There are three crucial assumptions in the 
above LTR procedure, namely, the require¬ 
ments that the plant is minimum phase, that 
dct(Cfl)^(), and that the filtering compensator 
is used. Wc study next what happens wheri the 
minimum phase requirement is not satisfied. The 
use of the predicting compensator and the case 
where det(Cfi) = () will be discussed in the 
subsequent sections. The following theorem 
reveals the asymptotic behavior of the loop 
transfer and sensitivity functions when the plant 
is nonminimum phase. 

Theorem 3.1. Suppose the plant G{z) is factored 
as in (27), det (Cfl) ^0, and the LTR procedure 
is applied using the filtering compensator />(-)• 
Then, as the asymptotic behavior c^f the 

filtering compensator is given by 

fKz)-(C;,cI>(2)B) ‘ 

x[/-/y„(z)(/ + w(z)) ‘C(2)]'' 

+ (30) 

where H„(z) := C„{zl - AY'AK, . Conse¬ 
quently, the asymptolic values of the loop 
transfer and the sensitivity functions are given by 

Liz)^[H{z)-FAz)\[I +t:(z)\ ( 31 ) 


5.u.(^)--U + £(^)]5nh(z), (32) 

where E{z):={C — CA 2 )C„]{zl — A)~'AKf is 
called the erroi furction. 

Proof. See Appendix A. 

Clearly, when the plant is minimum phase, 
(31) and (32) reduce to the known icsults (28) 
and (29), and the recovery is seen to be perfect. 
If the plant is nonmininium phase, on the other 
hand, then the quality of recovery at a given 
frequency depends upon the size of the error 
function E{z) at that frequency. An inspection 
of the error function reveals that if, in addition 
to the assumptions of Theorem 3.1, the observer 
loop has the same nonminimum phase zero 
structure as the plant, i.e. if 

Hi^) = CAz)C„,(zI-A) ^AKf. 

then perfect recovery can be obtained. Also one 
can see from (30) that, in the minimum phase, 
case, the compensator constructs an inverse of 
the plant and substitutes the observer loop and 
that, in the nonminimum phase case, an inverse 
of the minimum phase counterpart of the plant 
is constructed. This provides a clear picture of 
hov\ the recovery process works and comple¬ 
ments the pole-zero cancellation explanation 
given by Maciejow’ski (1985). Next, we give a 
formula for calculating the error function E(z) in 
terms of the nonminimum phase zero.s and their 
associated directions. 


Lemma 3.1. Let G(z) have / nonminimum 
phase zeros o,, a., . . . , Define Cu( 2 ) -= L 
C*(z) = C„(z)r„,(z) • • • Q(z), £'-(z) = 0. and 
EHz) = {C Ct(z)Ct,)UI-A) 'AKj for k = 
1,2./. Then 


E(z) = E'(z)^'^ - 

k- \ ZUf^ - 1 

X ct '(z)tY^[AA}, (33) 


with and r„^(2) defined by (24) and (25), and 
and rj^ defined by (26). 


Proof. See Appendix B. 

The following theorem states that perfect 
recovery can in fact be obtained in output 
directions that are orthogonal to those associated 
with the nonminimum phase zeros. 


Theorem 3.2. Suppose that plant G{z) has / 
distinct nonminimum phase zeros a,, U 2 . ■ . 

and that det (Cfl) ^ 0. Let , w., . , hv 

denote the corresponding output zero directions 
defined by (22). Define W to be the subspace of 


win 
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C™ spanned by Wi, ^ 2 , . . . , w,, and to be its 
orthogonal complement in C™. Let be a 
projection onto W ‘. Then if the LTR procedure 
is applied using the filtering compensator Fj(z), 
we have 

(34) 

as ^ ► 00 . 

Proof. See Appendix C. 

An analogous result may be obtained for the 
dual version of the continuous-time recovery 
result in Zhang and Freudenberg (1990). 

For systems that have a single (real) 
nonminimum phase zero, more insightful ex¬ 
pressions can be obtained. 

Corollary 3.1. Consider a plant G(z) that has 
only one nonminimum phase zero, and assume it 
is at z = a. Suppose that det(CB)^0 and that 
the LTR procedure is applied using the filtering 
compensator /y(z). Then the sensitivity function 
of the system satisfies 

[/ + (35) 

L 2fl — 1 J 

as where w^H{a) = w^C{al — A)~^AKf 

and w is the output zero direction determined by 
(22). If the plant is scalar, then 

[ 1 + - - \m.a) S„^(z), (36) 

L sa - \ 

as q~^^. 

Proof. It follows from Lemma 3.1 by setting 
/ = 1 that 

Eiz) = - - 

za - ] 

The limit (35) follows by noting ^(AKf = 
r]lC{aI ~ Ay^AKf^ and r/, = w. For scalar 
systems, w = 1. Hence (35) reduces to (36). 

One can see that if the target feedback loop 
H{z) also has a zero at z = a, with output zero 
direction w, then perfect recovery is possible. 
Otherwise there will be an unavoidable error in 
recovery, whose size depends upon the value of 
\\w^H{a)\\, and thus upon the location of the 
zero relative to the frequency range over which 
the target feedback loop gain is large in the 
direction w. It is clear from the above results 
that if the nonminimum phase zero is far outside 
the bandwidth of the target feedback loop, then 
good recovery can be obtained. This confirms 
the observation by Maciejowski (1985). Also, 


there appears to be a tradeoff between the 
feedback properties of the target feedback loop 
and the quality of recovery. Further quantifica¬ 
tions of this tradeoff may be performed using 
Poisson integral relations similarly to the 
continuous case (Zhang and Freudenberg, 1990). 
This has recently been done by L6on de la Barra 
(1991). 

4. USE OF THE PREDICTING COMPENSATOR 

The recovery procedure in the previous 
section assumes negligible computation time, so 
that it is possible to implement the filtering 
compensator. This may be impractical, since the 
time required to compute the control signal is 
not always negligible. If this is the case, then the 
feedback law (9) has to be used, resulting in the 
predicting compensator /^(z) with K^,=AKf. 
Although it has been shown (Maciejowski, 1985) 
that in this case perfect recovery cannot 
generally be obtained, it is of interest to 
investigate what happens if we try to apply the 
recovery procedure using the predicting 
compensator. 

Theorem 4.1. Suppose that the plant ii{z) is 
factored as in (27), that det(Cfl)^(), and that 
the LTR procedure is applied using the 
predicting compensator F^(z) with K^=AKf. 
Then, as q—^^, the asymptotic values of the 
loop transfer and the sensitivity functions are 
given by 

L(z)^[H{z) - E,(z)][I + f,(2 )]-'. (37) 

^.nu(^)'-[/ + t,(z)R.(z), (38) 

where the error function is given by 

E^{z):=z~^[zC-CAz)C\A] 

X (z/ - A) AKf. (39) 

Proof. Similar to proof of Theorem 3.1, 
straightforward calculation of Cf(z)F^(z) by 
substituting {C,„B)~^C\r,A for K,. 

The error function /?,,(z) can be calculated 
using the formulas for C„(2) and C„, that we 
developed in Section 2. Note that the above 
result still holds without assuming Kp = AKj, as 
long as the target observer loop H{z) is 
interpreted as C(z/ — A)''^Kp in such a case. 

Corollary 4.1. Suppose, in addition to the 
assumptions of Theorem 4.1, that the plant G(z) 
is minimum phase. Then the error function 
Ep{z) reduces to 

Ep(z) = z^yAK^. 


(40) 
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It follows that if the observer gain satisfies 

CAK^^i), ( 41 ) 

then perfect recovery can be obtained. 

As noted by Maciejowski (1985), perfect 
recovery is generally unattainable with the 
predicting compensator even for minimum phase 
plants. 7'he difference between the results 
obtained in this case for the two compensators 
Ff(z) and f,,(z) can be explained as follows. 
Note first that the assumption Act 
implies that the plant has an inherent one-step 
delay in all channels. Suppose that the observer 
loop also has a one-step delay. Then the only 
way that this loop can be perfectly recovered by 
the loop with observer-based compensator is for 
this compensator to have a direct feedthrough 
from the plant output to the control input, i.e. to 
have a proper inverse. An inspection of ( 11 ), 
(16), and Fig. 1 reveals that this is possible only 
if the filtering compensator is implemented. On 
the other hand, suppose that the observer gain 
satisfies (41), so that the observer loop has (at 
least) an inherent two-step dela>. Then it 
becomes potentially possible for the observer 
loop to be recovered using the predicting 
compensator. Unfortunately, requiring such a 
two-step delay would result in a less satisfactory 
observer loop than would be the case it a shorter 
delay were present. This situation is analogous 
to that studied in the previous section, where we 
saw that only nonminimum phase (and therefore 
inferior) observer loops could be recovered. 
Hence, we see that when the computation time 
is not negligible and the predicting compensator 
has to be used, perfect recovery is possible only 
it the target observer loop is constrained to have 
an extra step delay. This observation is 
consistent with that of Ishihara and lakeda 
(1986). 

It is of interest to study how recovery lakes 
place when the filtering compensator is imple¬ 
mented to recover an observer loop with (at 
least) a two-step delay in all channels. From the 
above discussion, we know that such an 
observer loop satisfies CAKf=0. Since the 
asymptotic control gain satisfies K, —> 
(Cfl)”'C/ 4 , it follows that the condition 
CAKf =0 implies that Therefore, as 

the feedback control law 

uu = = i-f A:,(,Yv-a\* i)l 

—> 1- 

Hence, asymptotically the optimal state 
feedback control law docs not utilize those states 
that are updated using the current output 
measurement. Therefore the direct icedthrough 
link in Fig. 1 is not used. 


The role of the assumption that det(C^)=^0 
is also now clear. If it is not satisfied, then the 
plant would have at least a two-step delay in 
some channels, and it would be impossible for 
the observer-based output feedback loop to 
asymptotically recover an observer loop with a 
one-step delay in all channels. 

5. LOOP TRANSFER RECOVERY WITH TIME 
DELAYS 

In the previous sections, the assumption is 
made that Act(CB)i^i). However, this assump¬ 
tion will be violated for systems with at least a 
two-step time delay in some channel of the 
plant. It has been observed (Maciejowski, 1985) 
that perfect recovery cannot generally be 
obtained in this case, but it would be of interest 
to see what happens when the LTR procedure is 
applied anyway. In the following, we consider 
application of the LTR procedure to a class of 
plants whose delay structure is characterized by 

CB^CAB=- =CA^ ^B - 0, det (CA^ '/!) 0, 

(42) 

where / ^ 2, i.e. plants that have a uniform /-step 
delay in all channels. This problem has been 
studied by Kinnaert and Peng (1990) for 
minimum phase systems using the predicting 
c< mpensator in the LTR procedure. The 
following results extend and complement those 
of Kinnaert and Peng (1990). 

'rheorem 5.1. Suppose that the plant (j(^) is 
factored as in (27) and has a uniform /-step delay 
as characterized by (42). Assume that the LTR 
procedure is applied to the system using either 
the filtering compensator Ffiz) or the predicting 
compensator /,,(r) with Kf,-AKf. Then, as 
< 7 —the asymptotic values of the loop transfer 
and the sensitivity functions are given by 

L{z)-^\H{z) - EAz)\[l + EAz)\ (43) 

.S'„.„(z)-|/ + £,(7)].S;,„(z), (44) 

where H{z) ~ C{zl - A) 'AK/ . The error func¬ 
tion fe,,(z) is given by 

EAz)^(c-J_-^CJz)C„.A‘ ') 

x(zJ -^A) 'AKf, (45) 

when the filtering compensator is used, and is 
given by 

£,,(7) = (r-^,c,(z)r„,/4') 

x{zl-A)'K„, (46) 

when the picdicting compensator with K,, =AKj 
is used. 
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Proof. Direct calculation of the loop transfer 
function using the cheap control gain given in 
Theorem 2.1. 

Notice that (46) also holds for a general 
predicting compensator with K^^AKf. Howe¬ 
ver, the definition of H{z) in (43) and (44) has to 
be changed to H(z) = C{zl - Ay'K^ in such a 
case. If the plant is minimum phase, then (46) 
reduces to 

EAz) = Hiz) - p CA'izI - A)-'K„. 

Substituting the above into (43) results in the 
following limiting value of the loop transfer 
function 

Liz) = j, CA'izI ~ A)-'K^[I + Hiz) 

--,CA'izl-A)-'K„]-' 

z 

= C(zl - Ay I + 2 '- ‘C/C,, 

+ ■ ■ ■-^CA'~^Kf,)~\ 

which gives the same result as Theorem 3 of 
Kinnaert and Peng (1990). 

One can see from Theorem 5.1 that the 
recovery error E,j(z) is a function of the 
nonminimum zeros, the time delays, and the 
observer loop gain (Kj or Kp) which implies that 
all those factors can affect the quality of 
recovery. For a given target observer loop, the 
recovery error can be calculated, a priori, using 
(45) or (46) and our formulas for C„( 2 ) and C,,,, 
to determine how much and at which frequencies 
it can be recovered. Since E^iz) is generally 
nonzero, perfect recovery cannot be obtained in 
general and the quality of recovery at a certain 
frequency will depend upon the size of the error 
function at that frequency. However, if the 
observer loop to be recovered meets certain 
constraints, then perfect recovery is possible. 
First, we consider the case where the filtering 
compensator is used. It follows from (45) that if 
the observer loop transfer function //(z) = 
C{zl - A) ^AKf^ i.e. the transfer function to be 
recovered, satisfies 
{\)H{z) = CAz)CMl-Ar^AKf, 

(2) CAKf = CA^Kf = . • . = CA^^^Kf = 0. 
then the error function Ea{z) will be identically 
zero. In other words, if the observer loop is 
chosen to have the same nonminimum phase 


structure and at least as many steps of time delay 
as the plant, then it can be perfectly recovered 
using the filtering compensator. 

For the case where the general predicting 
compensator is used, if the observer loop 
Hiz) = Cizl - A) 'Kp satisfies 
i\)Hiz) = C\iz)C,„izI-Ay'Kp, 

(2) CKp = CAKp = ■■■ = CA'~'Kp = 0, 
then one can show that 

Hiz) = C„iz)C,„A'izI - Ay'Kp, 

which, by (46), implies that the recovery error 
Ed(z) is zero. Hence, to have perfect recovery 
using the predicting compensator, the target 
observer loop has to have the same nonminimum 
phase zero structure as the plant and have at 
least one more step of time delay than the plant. 

From the above discussion, one can see that 
the nonminimum phase plant zeros, plant time 
delays, and controller computation delays all 
impose constraints upon the class of recoverable 
target loop transfer functions. Since the recovery 
error is a function of the total time delay in the 
feedback loop, it follows that time delays in the 
plant and controller have the same effect on the 
loop transfer recovery. This observation is 
consistent with that of Kinnaert and Peng 
(1990). 

Finally, note that our results concerning LTR 
using the predicting compensator could also be 
applied, via duality, to the problem of recovery 
at the plant input. 


6 EXAMPLh 

To illustrate some of our results, let us apply 
the LTR procedure to a nonminimum phase 
plant. Consider a sampled data system composed 
of a zero-order hold, a linear time-invariant 


continuous system G(5) = 


and a sampler 


Cv+l) 

in series (see Fig. 2). 

The corresponding pulse transfer function with 
sampling period T is given by Astrbm et al. 
(1984). 


where 


G(z) = 


-f biZ + 

(z-e 




u(kT) 


ZOH 


G(s) 






y(kT) 


Fki. 2. Sampled data system 
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This transfer function has a real nonmininium 
phase zero for 0<r< 1.8399. According to 
Astrom et al. (1984), nonminimuin phase 
sampling zeros usually lie near the negative real 
axis when the sampling period is sufficiently 
small. For this example, the nonminimum phase 
zero approaches —3.732 as the sampling period 
goes to zero. It follows from Corollary 3.1 that 
good recovery may be obtained for this example 
when the sampling period is small, since the 
nonminimum phase sampling zero is expected to 
be outside the bandwidth of the target observer 
loop. We shall see in the following that this is 
indeed the case. 


A realization of Ct( 2 ) is given by 



0 

1 0 


if 

A = 

0 

0 1 

. B 

0 



"3e 3e-'_ 




C^[h, b. /?,]. 

The observer loop is designed using (17)-(18) 
with weightings V = \ and W - BR\ Results 
obtained by applying the LIR procedure with 
the filtering compensator are given in Figs 3-9. 
P'or the first set ol figures (I’igs 3-5), we choose 
the sampling period 7 =0.05 which results in a 
nonminimum phase zero at z = —3.5949. The 
plots demonstrate that the sensitivity function 
-^.lui(^) indeed converges to the function we 
predicted in Theorem 3.1. The second set of 


figures (Figs 6-9) shows how the length of the 
sampling period affects the location of the 
nonminimum pha.iw zero z = a which, in turn, 
affects the quality of recovery. First, we notice 
that the nonminimum phase sampling zero 
moves away from the unit disk along the real 
negative axis as the sampling period decreases. 
This implies that the nonminimum phase 
sampling zero gets farther away from the 
bandwidth of the target loop transfer function as 
the sampling period gets smaller. In these figures 
we plot the recoverable sensitivity function (solid 
line) and the desired target sensitivity function 
(dashed line). As expected, we observe that the 
recoverable sensitivity function approaches the 
target sensitivity function as the sampling period 
decreases. 

7. rONCLUSlONS 

In this paper, we have studied applications of 
the discrete-time LTR procedure to plants with 
nonminimum phase zeros and time delays. 
Explicit expressions are derived for the asy¬ 
mptotic behavior of the sensitivity function and 
loop transfer function resulting from the LTR 
procedure. The results are given for both 
filtering compensator and predicting compen- 
So.tor cases. For a given target loop, these 
expressions show a priori, how much, at which 
frequencies and in what directions the loop can 
be recovered. From our results, we cun sec that 
the LTR procedure, if used properly, may still 
be an effective design approach for discrete-time 
systems with nonminimum phase zeros and lime 
delays. The plant nonminimum phase zero and 
time delay structures and computation delays 
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essentially impose certain constraints upon the 
recoverable target loop transfer functions. In this 
paper, it is assumed that the plant is square and 
has uniform delay in all channels; further 
research is needed to extend the results to cases 
where those assumptions are not satisfied. The 
general results on singular discrete-time filtering 
problem by Shaked (1985) may prove useful in 
this regard. 
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APPENDIX A: PROOF OF THEOREM 3 1 . 

By Theorem 2.1, we know that iC\„P) as 

( 7 —►ac. Now evaluate the open loop transfer function L{z) in 
the limit as Define 4>(z) = (z/-A) Fj{z) can be 

rewritten as 

/)( 2 ) = zA:,f/ + <^(z)7^A:, E<P{z)K^C(A - BKJ] '<Piz)K^ 
= zX.|/-l-(/+4>(z)X/A) ’<t>(z)BA:, 

-(/ +4>(z)X^CA) >(z)A 7 rfl/Cj ‘ 
x{l-t-ip(z)KfCA) 

Note thal /7,„(z) = f',,,4>(z )A/C,. Since K^—*(C\„B) 
as it follows that 

/y(z)-*z(C^fl) ‘C,„A[yF(/-f (D(z)A',6A) 'aj(^)fl 
X(C,„B) 'r„A - (/+tP(z)A:,(:’A) 'ft>{z)K,CB 
'(/-^ 4>(^)A7CA) 'd>(z)A', 

= z(r^fl) V-^Cr.AU + ^iz)K,CA) ‘(l>(z)/f(c;,/f) ' 
-HJzKI-^Hiz)) 'CB{C\,,B) ■] 'HJz){l3-Hiz)) ' 
= z(c;,fl 3 - C\„A{I ~iPiz)K^CA{l 3 4>(z)A7GA) ')4>(z)/^ 
-H„,(z)(I^H{z)) fJB] ^HJz){IeH{z)) ' 

- z\C^B F C.A4>(z)/I - HJz){I ^ Iliz)) 'C>1c^(^)B 
-IIJz)iI-^H{z)) T/?) '/y^,(z)(/3/y(7)) ' 

= z{CJI^Ait>(z))B-HJz)(I^IIiz)) ' 
xC{f^A0{z))B\ 'H„,{z){l4-H{z)) ' 

= \C\„^{z)B-HJz)U^H{z)} 'G4>(z)/i| ' 

xHJz)il^H{ 2 )) ' 

Note that r<t>(z)7t = ( ^j(z)G^<I>(z)fi, we obtain that 
Ff{z)^{(\,M^)B) *[/-- /C,(z)(/ + //(z)) ‘(;(z)l ' 
x//^,( 2 )(f+ H(z)) 

This completes the proof of (30). By definition of A(z), we 
have that 

xC„U)| 'lUz){l+H{z)) ' 

- ( Jz)H„,(z)\l + H(z) - r„(z)//,„(z)) ' 

= [«(z)-/-:(z)|[/ + p(z)| 

as This gives (31). .Since .V,„„(2) = |/+/.(z)) (.32) 

follows readily. 


APPENDIX B: PROOF OF LEMMA 3 1 

First, let us derive a recursive formula for E*'{z). Using the 
factoriZHlion formulas (24) and (25), we get 


C~C*(z)Ct 

= (C - Ct ‘(z)ct ') + r; '(z)f ) 

. \ fljL 3- 1 / 
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^■^r)U7-iHar+i 


_ “J 


-‘(c-c; '(z)C'i,') + - ■'_ ('*'(,1 

x((z ♦ l)(fl(, + l)n*r)rct, ' +(zat - l)K + 

xM +/)-(z + I 1(0*0, - l)T),i,(/4 -I- /)| 

= {C-Ct '(r)C’*, '1 

(Zfl*-l)(fl, f l)(o, + l)‘'' 

X 1(2 + l)(fl, + l)(fl,/ - /() + (20, - l)(a, + 1) 
x(A + I)- (2 + l)(fl,n, - l)(/l + /|| 

= (t’-r; \z)Cl ') + "*“*- 'r; - A) 

za,^ - 1 

In the above denvaiioii we used the lact that 
^ Now by definition of F*(^) we have 

i‘(2)-(f-c;(iK‘,)(z/-/t) 'AK, 

-(f’-f i '(2)(;„ ')(Z/- A) 'AK, 


. "‘ii - ' 

za. ~ I 


Ct '( 2 )r,,i*MA, 


-L‘ + 'U),u1;!.AK, 

Applying (he above retursiNC formula and noting 
we obtain the result 


APPENDIX C PROOI OF TUFORLM M 
From Theorem ^ I we can see that it suffices to show that 

u^biz) 0 Vn r iV 

First, we would like to show by induction that the vector 
TJ^ as dehned in (26) lies in the subspace W for 
A:- 1.2. ./ 


For k-U It IS obvious since = w, by definition 
Suppose that it is true for /: = /-!, i.c e IV for 
A. == 1, 2, , f - I We need to show that it is also true for 

A = / By Definition 2 2 we have that 

w[C{a,/-A) = wfG(c//)-0 (Cl) 

From I emma 2 I we know that 

0-(fl,) = Cl '(a,)6'7(fl,). (C2) 

where ‘( 2 ) is as defined in Lemma 3 1 It follows from 
(C 1) and (C2) that 

(r3) 

By definition of r}, (see (26)), 

rircL'ia,l-A) = (C 4) 

Since a, is a distinct zero, the left nullspace of '(a,) is 
one-dimensional Thus, (C 3) and (C 4) imply that there 
exists a constant c, such that 

rj! = c,wJCi '(«,) 


Hence, we have 



(r5) 


After expanding the the right hand side of (C S), one can see 
that T); lies in the span of {t/,. r/^ . t// ,, w,] over r"* 

Since ?),. i)>, T}f , arc in W by assumption, wc can 
conclude that t]/ e W 

Now wc have established the fact that vectors 
r/|, r/,, , T], belong to the subspace W Hence, if a vector 

I orthogonal to W, it must be orthogonal to 7 ;^, , r}, 

Prom the above arguments, one can conclude that u e IV^ 
implies that u is orthogonal to , i}, This fact 

together with 1 emma 1 1 implies that 

u'E{z) = i)k 

which completes the proof 
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Direct Control Design in Sampled-data 
Uncertain Systems* 

ODED YANIVt and YOSSI CHAITt 

A new direction for Z-domain design of single input I output, sampled-data 
uncertain systems is developed within the setting of Quantitative Feedback 

Theory. 

Key Word§— Control system synthesis, robust control; sampled-data systems 


Abstract —This paper introduces a new direction for design 
of single inpul/outpul, sampled-data uncertain systems 
within the setting of Ouantilative Feedback Theory (OFI ). 
The control system consists ol a continuous-time uncertain 
plant, a discietc-time conirollei connected via a sample-and- 
hold device and a discrete-time prclilter (or rcicience 
tracking The class of problems considered here includes 
robust stability, robust gam and phase margins, robust 
discrelc-liine tracking and robust coniinuous-time tracking 
The new direction involves a OFI technique where control 
design IS performed directly in the Z-domain It is shown 
that OfH bounds can be computed m the /-domain trom a 
set of quadratic inequalities A numerical example illustrates 
the salient leatures of the developed technique 

INTRODUCTION 

In THIS PAPLR wc consider a class of sampled-data 
control systems (Fig. 1) where the analog plant 
belongs to an uncertain set The uncertainties 
can be both parametric and non-paramctric. 
however, we assume that the set can be 
reasonably approximated by a finite number of 
plants 

n<oc. 

The sample-and-hold device is assumed to be 
zero-order. Both the prefiller, F{z), and the 
controller, G{z), are implemented in discrete- 
time. 

In general, design of a sampled-data control 
system can be performed using either a 
discrete-time or a continuous-time approach. 
The discrete-time approach employs extensions 
of various continuous-time control design tech- 

* Received 5 September 1990; revised 25 March 1991; 
revised 9 June 1992; received in final form 30 June 1992 The 
original version of this paper was not presenicd at any IFAC 
meeting. This paper was recommended tor publication in 
revised form by Associate Editor P. Doraio under the 
direction of Editor H. Kwakernaak. 

t Electrical Engineering-Systems Department, lei Aviv 
University, Tel Aviv 6997K, Israel 

t Mechanical Engineering Department, University of 
Massachu.setts, Amherst, MA 01(X)3, U.S.A. 


niques to the discrete-time setting. Naturally, 
control design objectives can be considered only 
in terms of discrete-time transfer functions. The 
continuous-time approach can consider 
continuous-time objectives (eg. inlersample 
behavior). Several directions have been sug¬ 
gested in order to make direct continuous-time 
design possible for this time-varying system. One 
direction employs frequency domain bounds on 
the time varying functions in Fig. 1 via conic 
sectors (Thompson et ai, 1983, 1986). A second 
direction is based on the following well-known 
approximation: the frequency response of a 
discrete-time signal is ''equal”, up the Nyquist 
frequency, to the frequency response of the 
continuous-time signal divided by the sampling 
time (e g. Morari and Zafiriou, 1989). A third 
direction, referred to as PCT (pseudo- 
continuous-lime), applies Fade approximation to 
replace the sampler and zero-order-hold by a 
rational continuous-time transfer function 
(Houpis and Lamont, 1985). A recent direction 
focuses on computation of induced norms of a 
sampled-data system (Leung et ai, 1989; 
Kabamb and Hara, 1990; Hara and Kabamba, 
1990; Chen and Francis, 1991). 

This paper develops a new direction for design 
of single input/output, sampled-data uncertain 
systems within the setting of traditional OFT\ 
Recent extension of the continuous-time OFT 
technique to sampled-data systems were based 
on mapping of the OFT problem into the 
IV-domain using bilinear transformation (Sidi, 
1977; Horowitz and Liao, 1986; Tsai and Wang, 
1987). They were tested using several interesting 
numerical examples (e.g. Schneider (1986) and 
Hamilton et al. (1989)). The technique de¬ 
veloped ill this paper is based on a direct 
Z-domain procedure, which resolves the warping 
problem due to the bilinear transformation. In 
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^F(s) 


R(5) 



T 


Fig. 1. A two dcgrce-of-freedom sampicd-data control 
system. 

addition to the robust stability and robust 
discrete-time tracking problems investigated in 
Sidi (1977), Horowitz and Liao (1986) and Tsai 
and Wang (1987), we also consider the robust 
gain and phase margins and the continuous-time 
tracking problems. 

This paper is organized as follows. A short 
discussion on the notation is presented. It is 
followed by a statement of the specific control 
design problems. Next, we develop the OFT 
design technique in the Z-domain. Finally, a 
numerical example is presented along with 
relevant discussions on: OF!' bounds at different 
sampling times, correlation between QFI 
bounds for discrete-time and continuous-time 
tracking problem. 


NOTATION 

In Fig. 1, T denotes the sampling time and 
(i)^ = 2jtlT denotes the sampling frequency. The 
output of the A/D converter, modeled mathe¬ 
matically as an impulse modulator, fed by the 
signal jc(r) is a string of impulses 

x*{t)=^x{kT)6{t-kT). 

The Laplace transform of this sequence is 

X*{s)=j^ x{kT)t~^'‘’, (I) 

k 0 

while the periodicity of gives 

= E X(s-jkw,). 

i k 

Using the transformation 2 = e’^ X*{s) is the 
Z-transform of Jt(f), X(z) ^ Z{{x(()}. The D/A 
converter is modeled as a zero-order hold, which 
constructs a piecewise continuous-time signal 
from discrete-time signal and has the transfer 
function 

1 

Z0H(5) =-. 

A 

For simplicity, we use the notation 
PzoH(s)^P(s)ZOHis), 


Using equation (1) and z=e'^, we have 
Pzoh(z) = Z^~^{ZOH{s)P{s)} for each P{s) e 
R*{s) = Zie^^{R{s)}, Y*{s) = Z.T~^{Y{s)} 
and £*(^) = £(z) (i^“‘ denotes inverse Laplace 
transform). 


STATEMENT OF THE PROBLEM 
Consider the sampIcd-data system shown in 
Fig. 1. The controller, C(z), assumed to be 
implemented directly in the Z-domain as a 
different equation. The discretized version of the 
analog prefilter F{s) is derived using the 
emulation F{z) = Z^'~^{F(s)ZOH(s)} (the re¬ 
sults in this paper also hold when F(z) is 
obtained using other emulation methods). 

The following closed-loop objectives are 
considered in this paper: for each P{s)e and 
e [0, (i)J2] (z = 

• Robust stability. 

• Robust gain and phase margins: 

• Robust discrete-lime tracking: 


\R*(joj)] 


5. ddru). 


• Robust continuous-time tracking: 


EU<o) 

Rijiu) 


^ 62(10). 


The control design problem is; given the set 
and the prefilters, F(s) and F(z), find an 
appropriate discrete controller, 0(z), to meet 
some or all of the above objectives. 


Remark 1. Robust stability implies that the 
closed-loop system is stable for each P(s)e,f. 
The inequality |1 + F^,,n(ym)G*(yfo)l - M 
related to robust phase and gain margins as 
follows. Let 1 -I- P zoii(^ )G(z) denote a fixed 
characteristic polynomial of a stable discrete¬ 
time system (that is not conditionally stable). 
Introduce the polar form: L^(yiij) = Pyin\(ja)) = 
G*{ju))lc‘^ and z = e'‘''^ (x) e |(), (x)J2]. The gain 
margin is defined as OM - 1// at A = -18(L, 
hence GM = (1 “ //) '. The phase margin of the 
system is defined as PM = 180'" -f A where A is the 
phase of L*(yw) at \L*{j(o)\ — hence PM = 
18(r-i-2 cos ' (^/2). If this objective is guar¬ 
anteed for each P{s) e then the margins are 
said to be robust. Similar relations for 
conditionally stable systems can also be derived. 
For a basic discussion on gain and phase margins 
for fixed (i.e. without uncertainty) discrete-time 
feedback systems see Franklin et al. (1990). For 
a detailed presentation of Nyquist and Nichols 
stability analysis for fixed plants see Dorf (1989). 
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THE DESIGN TECHNIQUE 
Robust stability 

The notion of robust stability amounts to 
checking stability using one randomly chosen 
nominal loop, and then demonstrating closed- 
loop stability of the uncertain system by some 
argument involving the connected nature of 
On a Nichols chart, this is done as follows; 
at each point, i, on the Nyquist contour, 
the responses of L*{j(o)^ 

Pij(t)) e fill in a neighborhood of the nominal 
response =- Pzo„„Ow)G*(y(u), P^,(j,o) e 

This neighborhood is called a template, which 
is assumed to be connected and simply 
connected. As s traverses the Nyquist contour, 
the union of these templates becomes a 
connected region, which we shall call the Nichols 
envelope. If each P{jo)) e has the same 
number of unstable poles, we can invoke the 
following Nichols robust stability criterion of 
Cohen et al. (1992). The closed-loop system is 
said to be robustly stable it and only if the 
nominal sy.stem is stable and 

0 ) e (0, (/),/2], for each pijto) e .‘f. 

If the se! is approximated by a finite number 
of plants, then it is possible that .some plant, not 
retained after the approximation, violates the 
above condition. To minimize such a possibility 
we replace the above by the more conservative 
constraint 

(i) G [0, a)J2], for crich P(j(o) e 

In essence, rather than avoiding a region defined 
by a single point, we consider a region defined 
by the classical constant magnitude curves. This 
condition is also used to define stability margins 
as follows. 

Robust gain and phase margins 

Traditionally, OFT design involves the 
mapping of the robust margins objective into 
certain bounds on a nominal loop transmission 
ms), followed by loop shaping. These bounds 
divide the eomplex-plane into two domains 
where Lo(y) should lie inside one. We now show 
that computation of these bounds in the 
Z-domain can be performed in an elegant way as 
follows. Note that Sidi (1977), Horowitz, and 
Liao (1986) and Tsai and Wang (1987) employed 
IV-domain bounds. Consider the inequality 

\i + 

0 ) G [0, (U,/2l, for each P(yw) g 


at a fixed frequency. Convert the functions into 
their polar forms 

G*Uco)^ge>^ and P^.n^Uto) ^ pt'*', 

then plug into the above inequality 

|1 -I- gpd*^'' "’I 5: p, for each P(j(v) e 

finally square both sides and rearrange in terms 
of g = |G*(ya))|, 

'^g[p cos (y -I- 6)] -I- [1 - p^] 2 0, 
yG[“2;r, 0], for each P(ycu) e (2) 

This quadratic inequality defines a map from the 
problem data into the OFT bounds. Numeri¬ 
cally, this map is evaluated as follows. At each 
fixed z on the unit circle, a fixed controller’s 
phase y, and for a fixed plant from the set 3^, the 
only unknown variable in equation (2) is the gain 
g = The two solutions of the quadratic 

equation, g„„n(y) and ^>,nnx(y)i indicate the 
allowable range of the controller’s gain, g(y), 
necessary to satisfy the robust margins objective, 
typical solutions of equation (2) and their 
interpretation are shown in Table 1. 

A OFT bound on G^ija)) is obtained by 
varying y in the range [-2;r, OJ while keeping 
the frequency fixed. Similar bounds are comp- 
for each uncertain plant yielding a set of 
bounds. The union of this set of bounds defines a 
single bound on C/*(/w) at that frequency. This 
step is repeated at other frequencies yielding a 
set of bounds on G*{j(i)). Graphically speaking, 
these bounds split the Nichols chart into two 
regions, where G*{j(i)) must lie within one. 

The final step in a OFT design involves loop 
shaping of G*{j(v) to satisfy its bounds. In 
practice, it is recommended that one actually 
perform loop shaping using a nominal loop 
transmission, L*){jo)). The bounds for L*^{j(jo) 
are simply equal to the bounds for G*{j(o) 
multiplied by an arbitrarily chosen nominal plant 
p 7 u\iii^) froiTi the set 3^. If an Lo(ya>) is found 
such that we have nominal closed-loop stability 
and L*{ju)) lies within its bounds, then the 
uncertain closed-loop system is said to have 
robust gain and phase margins corresponding to 
the value of p. 


Tabu 1. Possibu soiUTifJNs or i ouafion (2) and thpir 

INTbRPRPTATION (fl > 1) 


^rnin 


Conditions on g 

Coinplex 

Complex 

s>0 

Real, '-0 

Real, 

g>0 

Real, sO 

Real, >0 

S ~ gmi<x 

Real, >0 

Real, >0 

H “ ^min ’ ^ ^ ^ max 
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The discrete-time tracking objective 
Using basic block diagram algebra, the 
discrete-time tracking objective for the system 
shown in Fig. 1 is given by 


£:*(yw) 

R*U(o) 


F*{iio) - 


F*ijio) - F*{j(o) 


R*ij(o)\ 
1 + 




which can be simplified to 


ll-t-F 


zohOw)G (yw)|i . 

o 1 (aj) 

(JO E [0, a)J2], for each p{j(i)) e 


( 3 ) 


But this inequality has the exact form of the 
inequality arrived at in the robust margins 
objective. Therefore, computation of OFT 
bounds for this objective is done in the same 
manner as described above. 

In general, a successful discrete-time tracking 
design may still result in a continuous-time 
response that exhibits intersample ripple. For 
this reason we now consider the continuous-time 
tracking objective. 


where 




|F0co)i 


This inequality can also be transformed into a 
quadratic inequality in order to map the 
objective into its corresponding OFT bounds. 
Again, let us introduce polar forms, G*{j(j}) = 
and p 7 AMi{j(^) ~ and define 


F*{jco)R*Uo^) 
F{j(jo) R(j(jo) 




Note that the particular class of reference inputs 
must be known in order to obtain QFT bounds. 
Now plug these into the above equation. 


1 

1 




Next, square both sides and rearrange in terms 
of the controller’s magnitude g |C7*(yv)| 






p cos {y — 0) -\- 


a cos (y 4 a) 




‘"'I"-'“''-I' 

y E[-27r, O], for each Piju)) e (4) 


The continuous-time tracking ohjectiife 
For the sampled data system in Fig, 1 we have 

Y(s) = P^au{s)G*{s)iF*(s)RHs) - Y*{s)y 

As expected, because the system is time-varying, 
it is not possible to derive a standard transfer 
function from R{s) to TCv). Several schemes, 
described in the Introduction, can be used to get 
around this difficulty. In this paper, we propose 
a new scheme which does not involve ap¬ 
proximations or conservative bounds. After 
several algebraic steps, the continuous-time 
tracking error in Fig. 1 can be written as 


E{ju)) 


R{jw) 



Fijw) - 


Y(iw) 


Rijio) 


F{jo))G'*(j(a) 


1 + P*,ovU(o)G*iioi) 1 + Fhm{ioi)G*{jm) 


(F*Uu,)R*{ju>) 
F{j(j)) Rijo)) 




G* 




The above can be simplified to 

^F*ija))R*{jio) 

F(j(o)R(jw) 

~ F*nil(yw)| 
1 + Pzou(j<o)G*(jo)) 

< 6i(cu), to E [0, 0)J2], for each P{j(o) e 


Given such a quadratic inequality, the procedure 
for computing OFT bounds was described 
earlier. It can be seen from our approach that in 
order to derive a transfer function from R{s) to 
K(.v), one must specify a prion the class of 
reference inputs. 

Remark 2. The term 


F*()to) R^{jw) 
F(j(o) R(j(o) 




distinguishes the continuous-lime tracking ob¬ 
jective from the discrete-time tracking objective. 
In the limit, as T—►(), equation (4) reduces to 
equation (3). 


Simultaneous objectives 

If the feedback problem consists of simul¬ 
taneous objectives, such as robust margins and 
robust tracking objectives, then the nominal 
loop transmission, iLo(.v), must be shaped so to 
simultaneously satisfy both bounds. Note that 
the OFT procedure for simultaneous objectives 
does not lead to a conservative design since OFT 
bounds are computed separately for each 
objective. 


AN EiXAMPLL 

Consider the sampled-data system shown in 
Fig. 1 with a sampling time of T = 0.01 sec. The 
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analog uncertain plant is described by the set 

For computing the QFT bounds, this set was 
approximated by 

The robust phase and gain margins objective is 

|1 + ^zoh{J(^)0*{j(o)\ ^ i, 

(D e [0, ajJ2], for each P{j(o) e 

which implies PM - 30° and GM = 2 The analog 
prefilter is given by 


and the discretized prefillcr is obtained using the 
emulation F(z)=^Zlf ‘{ZOH(y)Fr^)} The 
discrete-time tracking error bound is 


(i) 


^ ^ 10 75 


The robust stability and robust discrete-time 
tracking bounds, computed from the maps 
defined by equations (2) and (3) are shown m 
Fig 2 For an arbitrary choice of the nominal 
plant 


a synthesized nominal loop transmission Lo(^) = 
which lies within its bounds 




and achieves nominal closed-loop stability is 
shown in Fig 2 The controller is given by 


G(z) = 80 


z-0 92 
z-0 65 


The plant templates at various frequencies are 
depicted in Fig 3 

This direct Z-domain technique offers useful 
insight into the tradeoffs between design 
objectives and controller complexity For ex¬ 
ample, from Fig 2 it is clear that the nominal 
Ll{s) lies very close to its bounds in the range 
a)e[l, 10] This implies that the tracking error 
|£*(ya;)| IS very close to its weight b,(a>) at that 
frequency range This fact is verified in Fig 4, 
where it can also be observed that the objective 
IS actually violated at oj = 9, a frequency at 



Fig 2 Robust margins and discreic-time 
and the nominal loop transmission for T - an 
first-order piefilter 


frequency 

Fig 4 Closed-loop discrete-time tracking error vs weight 
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Fig 5 Figure 2 with the controller replaced by 6(z) = BO 

which bounds were not computed This fre¬ 
quency by frequency design procedure can 
reveal to the designer where changes in the 
weight at some frequency range can lead to a 
more economical design (i e lower bandwidth or 
reduced controller’s complexity) Moreover, 
suppose the designer wishes to quickly test 
whether a controller with a certain structure can 
solve the problem For example, it is easy to 
show that the simple controller G(z) = k cannot 
simultaneously achieve robust margins and 
robust discrete-time tracking to see this, we 
only need to plot the nominal L*)(s) over the 
OFT bounds as done in Fig S (with k = 80) 
Returning to analyse the control design, the 
discrete step responses (corresponding to all 
plants) are relatively “close” to each other, 
however, they are not as “fast” as the step 
response of the discrete prefilter (Fig 6) This 



Fig 6 Closed loop time responses with hrst-ordcr prcfilter 


apparent inconsistency is related to the fact that 
precise translation from time domain specifica¬ 
tions into appropriate frequency domain specifi¬ 
cations IS not possible Any frequency domain 
based design technique is subjected to this 
limitation Of course, certain approximations 
and bounds can be used to add rigor to this 
translation In this example, a tracking weight 
based on a first-order prefilter was chosen for a 
second order plant, complicating the translation 
related inaccuracies A more reasonable prefilter 
is given by 


Fis) 


20 25 

4" 3 15s + 20 25 


The emulated discrete-time prefilter is given by 
F{z) = The corresponding 

OFT bounds for the robust margins and robust 
discrete-time tracking problems along with the 
designed nominal Lo(s) are shown m Fig 7 The 
performance bounds coresponding to the new 
prefilter are tougher around w = 3 requiring 
larger loop gam The controller is given by 


G(z)-= no 


Z--09 

z""^- 0 5S 


As expected, this choice of prefilter results in 
acceptable discrete step responses (see Fig 8) 
The continuous-time tracking bounds tor a 
step input with the weight = t*)|(rM) match 

closely those obtained for the discrcle-lime 
tracking objective (Fig 9) This resemblance 
indicates that, in this particular example, even 
with a controller designed based on the 
discrete-time tracking objective, the system’s 
continuous-time step response will follow closely 
the step response of the prefilter /’(s) with 



Fig 7 Robust margins and discrete-time tracking hounds 
and the nominal loop transmission for 7-0 01 and 
second order prchlter 
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S/i 



Fi(i. 8. C'l(iscd-U)t)p lime respmise.s wiih secoiul-order 
prefiltcr 


negligible intcrsample ripple. Naturally, an 
improperly designed controller which leads to a 
pole/zero cancellation close to 2 == 1 in 
G(z)P/aii{^). may result in an irUcrsample 
ripple (e.g. Morari and Zafiriou, U)89). 

Finally, we have compared the OFT bounds 
for discrete-time tracking at different sampling 
times of 7 =0.01 and 7 = 0.5. Because the 
frequency range w < 5 is far below either 
sampling frequencies, the templates of P*{s) at 
either sampling times have similar shapes. This 
implies that we should expect here similar 
bounds at these have similar shapes. This implies 
that we should expect here similar bounds at 
these two sampling times, as verified in Fig. 10. 
Nevertheless, similar bounds at different sam¬ 
pling times do not necessarily imply that the 



Fig. 9. A comparison between discrete-time and continuous 
time tracking bounds for T =^0.01 



Fig. id. A comparison between discrete-time tracking 
hounds for T -0.01 and 7 =0.05. 

same controller can be used for both cases. 
Indeed, the system with 7 = 0.5 will require 
more phase lead in (he controller compared with 
the 7’= 0.01 system due to the added lag in 

CONCLUSIONS 

A new control design technique for sampled- 
data uncertain systems was developed in the 
spirit of continuous-time, single input/output 
QFF. Using this technique, control design is 
performed directly in the Z-domain for the 
robust stability, robust gain and phase margins, 
robust discrete-time tracking and robust 
continuous-time tracking problems. It was shown 
that Z-domain OFT bounds can be easily 
computed from a set of quadratic inequalities. 
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The Generalized H-, Control Problem* 

MARIO A. ROlEAt 

The problem of finding a controller such that (he closed loop gain from 
LzfO, oo) to L^[(), is below a specified level is solved by conr^erting it into 
a convex optimization problem over a finite-dimensional space. 

Key Words Control system synthesis, convex propramminj*, dislurhance rc|cction, linear optimal 
contiol 


Abstract— In this paper we consider the problem ol finding 
an internally stabilizing controller such that the controlled or 
regulated signals have a guaranteed maximum peak value m 
response to arbitrary (but hounded) energy exogenous 
inputs More specifically, we give a complete solution to the 
problem of finding a stabilizing controller such that the 
closed loop gain from ^.[0. x) lo x) is below a 

specified level Wc consider both stale-feedback and output 
feedback problems. In the state-feedback ca^e if is shown 
that if this synthesis problem is solvable, then a solution can 
be chosen lo be a constant state-feedback gam Nece.ssary 
and sufficient conditions for (he existence of solutions as well 
as a formula lor a stale-lcedback gam that solves this control 
problem arc obtained in terms ol a tmitc dimensional convex 
feasibility program. After showing the separation properties 
of this .ynthesis problem, the output feedback case is 
reduced to a stale-feedback problem It is shown that, in the 
output feedback case, generalized /y, controllers can he 
chosen lo be observer based contioilers The theory is 
demonstrated with a numerical example 

1. INTRODUniON 

CoNSiDhK THh finite-diincn.sional linear time- 
invariant feedback sy.stem depicted in Fig. 1, 
where denotes the plant and % the controller. 
The signal w denotes the exogenous input 
vector, while 2 denotes the controlled output 
vector. The signals u and y denote the control 
input vector and the measured output vector, 
respectively. Let denote (he closed loop map 
from the exogeneous input w to the controlled 
output 2 . 

Many important control problems can be 
shown to be equivalent to the problem of finding 
a dynamic controller such that the feedback 
system is internally stable, and the closed loop 
map is small in a suitably defined sense. 

''Received 15 February 19M1; revised 17 September IWI; 
revised 7 May 1992; received m final form 5 June 1992. The 
original version of this paper was presented at the 1st IT AC 
Symposium on Design Methods of Control Systems which 
was held in Zurich, Switzerland during September 1991. The 
Published Proceedings of this IFAC Meeting may be ordered 
from; Pergamon Press Ltd, Hcadington Hill Hall, (Oxford, 
CW3 0BW, U.K „ , 

t School of Aeronautics and Astronautics. Fuidue 
University, West Lafayette, IN 47907, U S A 


Theories such as Hn, Hr ,, and Z., optimal control 
are some of the concrete and well-known 
examples ol this point of view. In these theories, 
the goal is to find a controller that minimizes, or 
maintains below a desirable level, a given norm 
on Such a formulation of the synthesis 

problem not only captures many practical 
disturbance rcjcction/tracking specifications, but 
also some robust stability specifications. 

Recently Wilson (1989) has introduced a 
number of interesting ‘'system gains”, See also 
Corless et ai (1989). Suppose that the feedback 
system in Fig. 1 is internally stable, and let 
denote the closed loop transfer matrix from w to 
2 . From the results of Wilson, it lollows that, if 
7'^^ is strictly proper, is a bounded operator 
from L 2 [ 0 , '^) to L,{i), ^), and its induced norm 
is given by 

II (jioyruju,) iiw). (1) 

The function /(.) is either the maximum 
eigenvalue or the maximum diagonal entry, 
depending on the “spatial” norm used on the 
controlled signal z(/) (for a precise definition of 
spatial norms see Section 2). Note that when 2 is 
a .scalar signal, (1) reduces to the familiar ZZ., 
norm of T^. On the other hand, if 2 is a 
vector-valued signal this induced norm is no 
longer the standard ZZ. norm. Furthermore, due 
to the presence of the maximum eigenvalue or 
the maximum diagonal entry, HH is not 
differentiable. In principle, this lack of 
differentiability could complicate the optimal 
control problem resulting from minimizing 
11:7^^11 over all stabilizing compensators. On the 
other hand, the lack of this desirable property 
will not affect a sub-optimal approach. 

Let Y denote a positive constant. 1'he synthesis 
problem considered in this paper is the 
following. 


.47.3 
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Fig. 1. The synthesis framework. 

Generalized H 2 control problem. ‘‘Find (if 
possible) an admissible controller such that 
the closed loop system is internally stable and 
the induced norm (1) (the gain from L 2 to L^) 
satisfies ||r^H-ll < 7-” 

The importance of this synthesis should be clear. 
It guarantees that whenever the exogenous input 
w has bounded energy, the controlled output z 
has a specified maximum peak value. As in the 
standard H 2 control problem, the performance 
measure (1) is a function of the integral of the 
“square” of the closed loop transfer matrix. This 
motivates us to call the above synthesis problem 
the generalized Hj control problem. 

Previous research on this problem includes the 
work of Wilson (1990), Grimble (1990), and Zhu 
and Skelton (1991). Wilson (1990) showed that 
the stationary Kalman filter is optimal when 
filtering performance is measured by the system 
gain (1). In this case, should be interpreted 
as the map from noise to estimation error. The 
“control” interpretation of this result is that the 
Kalman filter gain is optimal with respect to the 
cost (1), when “full output injection” to the state 
equation is available. Grimble (1990) established 
a connection between the standard LOG 
problem and a “dual” version of the generalized 
Hn control problem. In our context, the results 
of Grimble show that an LOG controller is 
optimal for the generalized H 2 control problem 
for a plant with “larger weighting” in the 
regulated variables. Grimble considered only the 
case in which the maximum eigenvalue is used in 
the definition of the induced norm. Zhu and 
Skelton (1991) also approached the generalized 
Hj problem via the solution to an LOG problem. 
Based on necessary conditions for optimality, 
they proposed an iterative scheme for finding 
“LOG weights” so that a solution to the LOG 
problem also solves the generalized H 2 problem. 
Their approach assumed that optimal controllers 
exist and have order no bigger than that of the 
generalized plant 

When the maximum diagonal entry is used in 
the definition of the performance measure (1), it 
may be shown that the generalized Hj control 
problem is equivalent to a multicriterion H 2 
control problem. This is because in this case the 
performance requirement is to keep the H 2 norm 


of every single row of below y. This 
multiobjective H 2 control problem has been 
considered by Boyd and Barratt (1990); and 
Khargonekar and Rotea (1991a). In these two 
references the resulting multicriterion problem is 
reduced to a search for LOG weights without 
assumptions on the structure of solutions. Boyd 
and Barratt have also provided an algorithm that 
unambiguously determines the feasibility of this 
multiobjective H 2 problem. 

In this paper we give a complete solution to 
the generalized Hj control problem. More 
specifically, we show that: 

(i) When the state of (or the state and the 
exogenous input) is available for feedback, 
mcmoryless state-feedback gains offer the 
best possible performance for this synthesis 
problem. That is, the generalized control 
problem with dynamic full information 
feedback (i.e. plant state and exogenous 
input are available to the controller) has a 
solution if and only if the generalized /Yp 
problem with memoryless (constant) state- 
feedback is solvable. 

(ii) In the case of state-feedback, a nece.ssary 
and sufficient condition for the solvability of 
the generalized Hi problem, as well as a 
formula for a solution, may be obtained via 
a finite-dimensional convex feasibility pro¬ 
gram. This means that there are efficient 
numerical algorithms to solve this class of 
problems. 

(iii) In the case of output leedback, the 
generalized Hi problem is solvable if and 
only if a generalized Hi slate-feedback 
problem, for a suitably constructed auxiliary 
plant, can be solved, A solution to the 
output feedback problem, when ii exists, 
can be chosen to be an observer-based 
controller. The observer gain is given by a 
standard Kalman filter, while the state- 
feedback gain IS a solution of the general¬ 
ized Hi synthesis problem corresponding to 
the auxiliary plant. As a result, the output 
feedback case is no more difficult than the 
state-feedback problem. 

In addition, we formulate the convex program 
corresponding to the generalized Hi control 
problem with state-feedback in a convenient 
form for finding numerical solutions. In this 
paper, the resulting convex program is solved 
with the ellipsoid algorithm. Due the convexity 
properties of our approach, this algorithm 
cannot fail in determining whether the general¬ 
ized H 2 problem is solvable. The algorithm may 
also be used to isolate, to any desired accuracy, 
a controller that nearly optimizes the generalized 
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Hi performance measure. Finally, we give a 
numerical example that illustrates the theory. 

The notation used in this paper is fairly 
standard. For a given matrix A, A' denotes its 
transpose and >1* its conjugate transpose. If 
A = A*, i.e. A is hermitian, its maximum 
eigenvalue and diagonal entry are denoted by 
ant* respectively. If A and B 

are hermitian matrices, A '^s B (resp. A > B) 
denotes A - B positive scmidefinite (resp. 
definite). Given an hermitian matrix A, the 
function 



^,,...,('4) ifr^2 


is extensively used throughout this paper. Note 
that this function is nondecreasing. That is, 
A2:B implies f,iA)>-f,(B). Further, /(.) is 
convex. Linear time-invariant systems arc 
described by state-space models and they are 
denoted by “script” .symbols, white the eoi- 
responding transfer matrices arc denoted by 
italics. For instance, denotes a system with 
transfer matrix G. The Hardy space H, consists 
of matrix-valued functions that are square 
integrable on the imaginary axis with analytic 
extension into the right half plane. The norm on 
this space is defined in the usual way and it is 
denoted by IHl;. 


2 THE OF.NERAl IZED //, CONTROL PROBLEM 
In this .section we define the generalized fL 
cost and give a time-domain characterization ol 
the generalized IG performance measure that is 
useful for establi.shing some of our results. We 
conclude this section with a precise formulation 
of the generalized control problem 


2.1. -4 nalyxis 

Consider a finite-dimensional linear lime- 
invariant system described by the state-space 
model; 


y/: = 


X = Fx + Gw. J.(0) — 

z = Hx ^ Jw. 


( 2 ) 


The matrices F. G. H. and J are real and of 
compatible dimensions, Suppose that J is 
internally stable, i.e. F has all eigenvalues in he 
open left half complex plane. Let F.,, cno e 
transfer matrix from w to 2- ^ 

may define 


2i7r J—nt. 

The generalized H, perionnance 
eost) for the linear time-invariant system J that 


will be considered in this paper is defined as 
follows: 






\f r = 2 
if 


( 4 ) 


Note that Also, if )) 

finite, wc must have 7 — 0 in (2). As in the 
definition of the norm, fr{S(T^n)) defined in 
terms of the integral (3). This motivates the 
name ‘‘generalized Hj cost'’. 

Wilson (1989) showed that the performance 
measure defined in (4) gives rise to the following 
system gains: 

sup |l„,,/||»v|| 2 ;iveL 2 [ 0 , «>). H-=)t0} 

= Vfr{SiT,^)). ( 5 ) 


where 




, :^sup {||z(r)|l,:/ e [0, ■»)}, (6) 

\i.= \ w'(f)H'(r)di. (7) 

^ ■'0 


In (6), the r-spatial norm is given by 


if r = 2 


|2(0llr: = 


max{lz/0|:;= I. dr = ^. 

( 8 ) 


where s is the number of components of z. Note 
that the subscript ‘r’ in (4) stands for the spatial 
norm used on the output z(l). 

When fr(S(T,J) is finite (i.e. in (2), 7 = 0) it 
can be easily computed in time domain as 
follows. Let L, denote the controllability 
gramian of the pair {F, G). That is, L, is the 
unique solution to 

FL, + L,F' + GG' =0. 

Then 

which implies that/.(5(n„,))=/r(//L.W'). The 
following result provides an alternative charac¬ 
terization for the generalized Hi performance 
measure that will be useful for establishing some 
of the results in this paper. 


Lemma 2.1. Consider the system .‘7 defined in 
(2) and let denote the transfer matrix from w 
(o z. Let r>0 be given. Then, £/" is internally 
stable and /,(5(7;„.)) < Y if and only if 7 = 0 and 
there exists y = T' > 0 such that 

FY + YF'+ GG'<0, (9) 

fXHYH')<y. (10) 

The proof of this result is simple and it will be 
omitted. Details may be found in Rotea (1991). 
For a proof of a similar result, see Lemma 2.1 in 
Khargonekar and Rotea (1991b). 
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2.2. Synthesis 

Consider the finite-dimensional linear time- 
invariant feedback system depicted in Fig. 1, 
where is the plant (including frequency 
dependent weights) and % is the controller to be 
designed. In this paper, and % are given by 
state-space models. The transfer matrices of the 
plant and the controller are denoted by G and 
C, respectively. We denote the closed loop 
transfer matrix by 71^.. 

A controller is called admissible (for the 
plant iS) if % internally stabilizes the plant 
The set of all admissible controllers % for the 
plant is denoted by Here "'.4" stands 

for admissible- Note that if and only if 

is stabilizable from u and detectable from y. 

Let y>0 be given. The generalized 
controller synthesis problem is defined as 
follows: 

“Find (if possible) a controller such 

that fXS{TJ) <' y (r 2 or 

where S{.) and /,(.) are given by (3) and (4), 

respectively. 

Note that solves this synthesis 

problem if and only if the closed loop system is 
internally stable, and there exists a positive 
number e such that the following disturbance 
attenuation condition holds. Given any h e 
L 2 [(), and zero initial conditions, 

IU-IU,^(y-c)||K'||2. 

This is a simple consequence of the signal 
interpretation of the generalized /L cost (cf. 
(5)“(K)) given by Wilson (1989). 

The objective ot this paper is to show that 
conditions for the solvability of the generalized 
//. synthesis problem, as well as the computation 
of a solution (when one exists) can be obtained 
in terms of a finite-dimcnsional convex feasibility 
program. 

In Section 3 we will be interested in 
memoryless (i.e. static) controllers for solving 
the above synthesis problem. In such a case, the 
generalized synthesis problem is defined in 
the following way. Define first the set of 
memoryless stabilizing controllers 

{€e.4 (: C 6 , 

where q = dim (w) and p ~ dim (y) (see Fig. 1). 
Then, the memoryless synthesis problem is: 

“Find (if possible) a controller Jf e.4 such 

that/,(iX7; J) < y (r = 2 or 

in the .subscript stands for 

memoryless controllers. (In this case it is 
certainly possible to identify 7. with C and think 
of as a set of matrices. However, for 

consistency of notation, we will use 7/ when we 
want to think of the static gain as a system, and 


C when we want it to be interpreted as a 
matrix!) 

Remark 2.1. Consider the feedback system 
shown in Fig. 1. Given a plant ^ and an 
internally stabilizing controller the closed 
loop transfer matrix is a function of the 
individual transfer matrices G and C Thus, in 
the sequel we use the notation 

T:JG,C), and S{G, C): = SiT^JG. C)), 

to indicate on which plant and controller these 
closed loop quantities depend. 

3 STATE-FEEDBACK AND FULL INFORMATION 
PROBLEMS 

In this section we consider the case in which 
the plant to be controlled is given by a 
state-space model where either the state vector 
or the state and the exogenous input are 
measured (full information structure). We show 
that the generalized //, synthesis problem with 
dynamic full information feedback has a solution 
if and only if the generalized iL synthesis 
problem with memoryless (constant) state- 
feedback has a .solution. This result is analogous 
to the corresponding results for the LOR 
problem (Kalman, 1960) the standard //, control 
problem (Khargonekar et u/., 1988; Doyle et at., 
1989) and more recently for a cla.ss of mixed 
/A///, problems (Khargonekar and Rotea, 
1991b). This section concludes with a convex 
optimization approach to the constant state- 
feedback problem. 

Consider the generalized //> synthesis problem 
defined in Section 2 for the following plants (see 
also Fig. 1). 

(1) State-feedback plant. The plant is given 
by the state-space model 



X = Ax f W, w T 


• - 

z = Cx T Du 

^y=x. 

(11) 

(2) Full information plant. The plant ' 

is 

given by the state-space model 

X = Ax -h W + B2U 



z = C Jt T Du 
- \x'w'\’. 

(12) 


All the matrices introduced in (11) and (12) are 
real and of compatible dimensions. As before, 
we let G,f and denote the transfer matrices of 
(11) and (12), respectively. The subscripts 
and “//“ denote state-feedback and full informa¬ 
tion structure, respectively. Note that the 
state-feedback and full information plants both 
have the same dynamical equation. The only 
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difference between them is in the measurement 
equation. We assume that the pair {A, B-,) is 
stabilizable. This assumption is clearly necessary 
for the existence of an admissible controller. 

Theorem 3.1. Consider the systems and 
defined in (11) and (12), respectively. Let y>0 
be given and consider the sets of admissible 
controllers and Then, the 

following statements are equivalent: 

(1) there exists ^ e such that 

fXS{G,.C))<y- 

(2) there exists e ) such that 

/.(5(G.„ K))<y. 

Theorem 3.1 shows that, for full information 
or state-feedback, the generalized /■/. cost cannot 
be improved upon by the use of dynamic 
compensation. The proof of this theorem is very 
similar to the proof of Theorem 4.1 in 
Khargonekar and Rotea (1991b). Indeed, the 
techniques used by Khargonekar and Rotea also 
work in our context, provided that the 
(nondecreasing) function /,(.) is used in the 
definition of the performance measure. A proof 
of Theorem 3.1 may be found in Rotea (1991). 

Next, we will develop a convex programming 
approach for solving the state-feedback problem. 
More specifically, given the plant defined in 
(11), we give a necessary and sufficient condition 
for the existence of an admissible state-feedback 
gain K such that K))< y. This condi¬ 

tion is given in terms of a set of inequalities. We 
al.so show that this set oi inequalities defines a 
convex subset of a finite-dimensional space of 
real matrices. Hence, the problem ot checking 
whether this set of inequalities has a solution is 
readily solved via convex programming tech¬ 
niques. Furthermore, the construction of a real 
matrix K such that fAS{G,i, K))<y is straight¬ 
forward from our necessary and sufficient 
condition. 

With reference to the plant defined in (11), 
let n=dim(jc) and i/ = dim(M), Let (W, y)e 
f^q^n ^ j^nyn gjven and define 

L(W, y): = Ay+ YA' ^ B.W + 

(13) 

If y is nonsingular, we define also 

Y) \={CY ^ DW)Y '(Cy -h DW)'. (14) 

Theorem 3.2. Consider the system % defined in 
(11) and let ) denote the set of admissible 

memoryless controllers for this plant. Let G,, 
denote the transfer matrix of %f. Let y>() be 
given. 

(1) There exists such that 


fAS(G^f, K))<y if and only if there exists 
(W, y) eXsuch that the following 
condition is satisfied: 

y = y' > 0 

^ L{W, y)<0 (15) 

JAM{W^ Y))<y. 

Moreover, if this condition holds, the state- 
feedback gain Ku:=WY ’ satisfies 

and /^V(G.,,/f,,)) < 7- 

(2) If Im B 2 c Im B, and (A, B,) is controllable, 
then (15) may be replaced by the condition 

y = y' X) 

L(W. Y) = i) (16) 

JAM{W, Y))<y. 

The significance of Theorem 3.2 is explained 
by the next result. 

Theorem 3.3. Let 2 denote the set of symmetric 
n by n matrices, and Q := {(W, y) e x 
2:y>()}. Consider the mappings L:R'^'''x 
2—^2!, and Af:Q-->2 defined in (13) and (14), 
respectively. Then, L is an affine mapping and M 
is a convex mapping with respect to the cone of 
pcjitive semidefinite matrices. Consequently, the 
function /.(A/):fi—and the sets defined by 
conditions (15) and (16) are convex. 

This theorem clearly shows the advantages of 
using condition (15), or (16), to determine 
whether the generalized /L control problem is 
solvable. Note that finding a pair of real matrices 
{W, y) that satisfies either one of these 
conditions, is equivalent to a convex feasibility 
problem. This means that there are efficient 
numerical algorithms to find such a point, or 
determine that none exists. On the other hand, 
the more direct approach of searching for a 
state-feedback gain s such that 

fAS{G,,, K)) < y gives rise to a nonlinear 
programming problem which need not be 
convex. This is simply because the set of 
admissible static gains tlocs not have 

guaranteed convexity properties. 

Note that while is a subset of 

the set defined by condition (15) is contained in 
a space of dimension qnn{n - l)/2. On the 
other hand, when the technical assumption in 
part 2 of Theorem 3.2 holds, the dimension 
reduces to qn for this is the dimension of the 
kernel associated with the linear equation 
L{W, y) -0 in (16). This fact will be used later 
in Section 5, where the numerical solution to the 
generalized problem is considered. 
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Remark 3.1, In the next section, it will be seen 
that the generalized Hj synthesis problem in the 
case of output feedback can be reduced to a full 
information/state-feedback problem similar to 
the one considered before. More specifically, 
given R = R'^Q and letting ^ denote a full 
information or state-feedback plant, we will 
need to solve the following problem: 

“Find (if possible) a controller ^ e such 

that/,(R-f5(G. C))<f\ 

It is fairly easy to show that Theorem 3.1 holds 
true for this “more general” synthesis problem. 
Moreover, Theorems 3.2-3.3 are still valid 
provided that the following modification is 
made; redefine the map introduced in 

(14) according to 

MiW, y): = R + {CY DW)Y \CY DWy. 

Thus, the approach developed in this section is 
directly applicable to the more general output 
feedback problem. 

We conclude this section with the proofs of 
Theorems 3.2 and 3.3. The key idea for 
establishing Theorem 3.2 is to replace the search 
over K e by a search over the set of 

matrices defined by conditions (15) or (16). Tliis 
is done by using Lemma 2.1 in combination with 
the change of variables K = WY~\ where T is a 
solution to the linear matrix inequality (9). A 
similar result was established in Khargonekar 
and Rotea (1991b). For the use of this change of 
variables in robust stabilization problems, see 
Bernussou et al. (1989) and Packard and Becker 
(1990). 

Proof of Theorem 3.2. Let (W, K) denote a real 
solution to (15), or (16). Since Y is positive 
definite we may define the real matrix 
Ko‘==WY \ Wc will show that the state- 
feedback gain Ku satisfies 

and US{G,f,K,))<y. 

Apply the control law u = to The 
resulting closed loop system is given by 


where F: = A and H :=C-^DK(). It is 

now easy to show, using (13) and (14), that 

TF Y), 

HYH =M{W, Y). 

Now, suppose that {W, Y) is a solution to 
(15). Since Y > 0 satisfies the conditions of 
Lemma 2.1, we conclude that and 

fAS(G,f, ACo))<7- 

If (VV, Y) is a solution to (16), we proceed as 


follows. First, note that Imfijclmfii and 
{A, Bi) controllable imply that, given any K, the 
pair {A + B 2 K, Bj) is controllable. Hence, 
{F, fli) is controllable. Since F > 0 satisfies 
L{W, Y) = 0, we conclude that F is asymptoti¬ 
cally stable. Note also that Y is the control¬ 
lability gramian of the pair {F, B^). This fact, 
together with fr{HYH') =fXM(W, F)) < y, 
imply 5ir(, e and fXs{G,,, K,,)) < y. 

Now we establish the necessity of (15). 
Suppose that there exists ^ e ,rf,„(‘5,yr) such that 
/.(S(Gj^, f())< y. With this controller the closed 
loop system is given by (17), where F: = 
A -f B2K and H : = C -h DK. From Lemma 2.1 il 
follows that there exists K = K' > 0 such that 

FY+YF^ ^B,B\<i), (18) 

fXHYH)<y. (19) 

Define W:=KY. Then, from (18)-(19), it 
immediately follows that (W, T) satisfies (15). 

The necessity of (16) is proven in a similar 
way. Indeed, given ^ 6 .qi/^,(G,/), we may 
conclude that the controllability gramian, say Y, 
of the pair (A + B.K, B^) is positive. Using an 
argument similar to the one above, and the 
condition f,{S{Cf^, K))< y, the necessity of (16) 
is established. 

Theorem 3.3 is similar to Theorem 4.3 in 
Khargonekar and Rotea (1991b). The main 
difference is that Khargonekar and Rotea have 
shown that the scalar valued function trace (M) 
is convex, while here wc show a more general 
result. Namely, the convexity of the matrix¬ 
valued function M. 

Proof of Theorem 3.3. The fact that the map 
L(.) introduced in (13) is atfinc follows from its 
definition. The convexity the map M follows 
from Marshall and Olkin (1979). Indeed, the 
domain £2, of definition of Af, is clearly convex. 
Note also that M is the composition of two 
maps, the linear map {W, Y)—^CY ^ DW and 
the map {X,Y)—^XY ^X'. The convexity of 
this latter map is in Marshall and Olkin (1979). 
We may now conclude that M is convex. The 
convexity of the function f^M) follows from the 
convexity of M and the fact that /^(.) is 
nondecreasing and convex. Finally, the con¬ 
vexity of the sets defined by (15)-(16) follows 
from the properties shown for their defining 
functions. 

4. OUTPUT FEEDBACK 

In this section we consider the generalized FI. 
control problem defined in Section 2 for the case 
of output feedback. The main result of this 
section (Theorem 4.1) shows that the output 
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feedback problem reduces to a state-feedback 
problem. More specifically, Theorem 4.1 shows 
that a generalized controller, if it exists, can 
be chosen to be a combination of a standard 
Kalman filter and a state-feedback gain for the 
generalized H 2 synthesis problem of a suitably 
constructed auxiliary plant. 

Suppose that the plant in Fig. 1 is given by the 
following state-space model: 




i ~ Ax B^ BjU 
^ — C\X D\U 
y = C^x -h Diw, 


( 20 ) 


where all the matrices in (20) are real matrices of 
compatible dimensions. As before, we let G 
denote the transfer matrix of (20). Note that 
there is no feedthrough term from the exogenous 
signal w to the controlled output z. Similarly, 
there is no direct feedthrough from the control 
input u to the measured output y. Although it is 
possible to include these termvS, wc have chosen 
not to do so in order to keep the presentation as 
simple as possible. In addition, we will make the 
following assumptions. 

Assumption 1, The triple {(\, A, B,) is stabi- 
lizable and detectable. 

Assumption 2. D 2 has full row rank and 

D2[B[ Z)') = |0 /]. 

Assumption 3. The pair {A, fi,) has no 
uncontrollable modes on the imaginary axis. 

Clearly Assumption 1 is necessary, otherwise 
the set of admissible controllers is empty. 
Assumption 2 is a standard assumption in 
filtering theory. It ensures that there is full 
measurement noise and that there is no 
correlation between process and measurement 
noise. Finally, in the view of this, Assumption 3 
together with (C\, A) detectable constitutes a 
necessary and sufficient condition to guarantee 
that the Kalman filter, that estimates the state of 
is stable. Note that the full rank property of 
D 2 implies that the orthogonality assumption can 
be made with no loss of generality. Indeed, a 
preliminary output injection transformation, will 
enforce D 2 [B\ D 2 ] ~ [0 /]. 

It is well known that, under the above 
assumptions, there exists a (unique) real 
symmetric matrix Q such that 

AQ^QA'- QC^C.Q -^B,B[=(\ (21) 

and A-QC 2 C 2 is stable. Furthermore, Q^O 
and if the pair (A, fli) is controllable then 
G > 0. In order to give the main result of this 
section we define the following auxiliary system. 

{x=AxA QC2r T B^u 

%f{Q)^. = \v = C^x + D^u ( 22 ) 

yy=x, 


where the matrices A, C,, D,, B 2 are as in (20). 
Let G,f(Q) denote the transfer matrix from 
(r, u) to (v, y) in (22). Here the notation %fiQ) 
means that this auxiliary plant depends on Q and 
has state-feedback structure. The main result of 
this .section is given next. 

Theorem 4.1. Consider the feedback intercon¬ 
nection of Fig. 1, where the plant 'S is given by 
(20). Suppose that As.sumptions 1-3 hold. Then, 
there exists a unique (real symmetric) matrix Q 
that satisfies the ARE (21) and such that 
A - QC 2 C 2 is stable. Moreover, C—0- Let 
y>0 be given and consider the sets of 
admi.s.sible controllers and 

Then the following statements are equivalent. 

(1) There exists such that 

aS{G.C))<Y. 

(2) There exists 'JC e such that 

/.(C,(?C; + i'(G,;(C). A:))<y. 

Moreover, given any real matrix K that satisfies 
Condition 2, the dynamic output feedback 
controller 

_ t = (A-QC^C2 + B2K)^ + QC:2y 

satisfies 

and S(G,C\,) 

= C,QC\+S(G,f{Q), K), 

which implies that /.(5(G, Go))< y. 

Theorem 4.1 shows that the generalized H 2 
synthesis problem with output feedback has a 
solution if and only if the generalized H 2 
synthesis problem for the state-feedback plant 
%fiQ) has a solution. Of course, this auxiliary 
state-feedback problem is readily solved with the 
tools developed in Section 3, see Remark 3.1. 
Moreover, if the auxiliary state-feedback prob¬ 
lem is solvable, a generalized H 2 controller for 
the plant can be chosen to be an 
observer-based controller. In this controller, the 
observer gain is the Kalman filter gain —QC 2 , 
while the stale-feedback gain is a solution to 
the generalized H 2 synthesis problem cor¬ 
responding to the auxiliary plant %riQy 
Although Theorem 4.1 resembles the separation 
structure of the standard LQG problem, there is 
a difference. Here, the state-feedback gain K of 
Condition 2 could depend on the stabilizing 
solution Q to the filtering ARE (21). 

The proper way of using these results for 
solving the generalized H 2 synthesis problem in 
the output feedback case is summarized in the 
following conceptual algorithm. 

(i) Find the (unique) stabilizing solution Q to 
the ARE (21). 
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(ii) Using Theorem 3.2, with the modifications 
indicated in Remark 3.1, determine whether 
the state-feedback problem for the auxiliary 
plant %f{Q) is solvable. If a solution K 
exists, the controller (23) solves the 
generalized H 2 synthesis problem for the 
plant Otherwise, the generalized H 2 
synthesis problem for the plant does not 
have a solution. 

To prove Theorem 4.1 we need a few 
intermediate results and definitions. First, note 
that Assumptions 1-3 imply that the stabilizing 
solution Q to the ARE ( 21 ) exists and satisfies 
Q^O. Equipped with this matrix Q, we may 
now define the following auxiliary system 


jc = Ajt + QC^r -f B2U 
X{Q) =^ v = C,x-^D,u (24) 

[y = C2X+r. 


Let H(Q) denote the transfer matrix from (r, u) 
to (u, y) in (24). 

Observe that the plant ^ defined in (20), and 
the auxiliary system ^( 0 ), share the same triple 
(A.fi., CJ. Thus, Wc will 

now show that the generalized H 2 control 
problem for the plant ^ is equivalent to the 
generalized control problem for the auxiliary 
plant ^{Q). Theorem 4.1 will then follow by 
solving this auxiliary output feedback problem. 
This reduction of the problem is obtained from 
the following lemma. 


Lemma 4.2. Suppose that Assumptions 1-3 
hold. Let Q denote the stabilizing solution to the 
ARE (21), and consider the system 'S and >^{Q) 
defined by ( 20 ) and (24), respectively. Then, 
rd[M{Q)). Let an admissible controller 
^G^C^) be given, and consider the feedback 
systems shown in Fig. 2. Then, C) e H 2 if 

and only if C) e H 2 . In this case, 

S(G, C) = C,QC\ + S{H{Q), C). (25) 

The significance of the separation property 
enjoyed by the integral S(G, C) (which holds for 
any admissible controller %) cannot be overem¬ 
phasized. Note that (25) generalizes the 
well-known separation property of the scalar- 
valued LOG cost trace (5(G\ C)), used in 
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frequency-domain derivations of the LOG 
solution, to the matrix case. 

Even though Lemma 4.2 is an easy conse¬ 
quence of well-established techniques (see for 
example, Doyle (1984) and Doyle et al. (1989)), 
it appears that the separation property of the 
matrix S{G, C) given above has not been noted 
before. For the sake of brevity, the proof of this 
lemma will be omitted. For further details, the 
interested reader may consult Rotea (1991). 


Proof of Theorem 4.1. First note that Assump¬ 
tions 1-3 imply that the stabilizing solution Q to 
the ARE (21) exists and satisfies Q^O. 

We will now use Lemma 4.2 to solve the 
generalized H 2 control problem for the plant 
Applying the function fX) to both sides of (25), 
wc may conclude that a controller ^ satisfies 
Condition 1 in Theorem 4.1, if and only if Sfo 
satisfies 


and f(C\QC[ 

+ S(H{Q)X'))<r ( 26 ) 


Hence, to solve the synthesis problem for the 
plant it suffices to solve the generalized Hn 
problem (26) for the auxiliary plant Next, 

we show that this auxiliary output feedback 
problem may be solved if and only if the 
state-feedback problem in Condition 2 of 
Theorem 4.1 is solvable. 

Suppo.se that Condition 2 in Ihcorem 4.1 
holds, we will show that the dynamic output 
feedback controller defined in (23) satisfies 
(26), and thus Condition 1. Note that '60 is an 
“observer-based” controller for the auxiliary 
plant ^((2) Moreover, since both A T B 2 K and 
A — QC 2 C 2 are stable matrices, it follows that 
Note also that Co) = 

L,riG,f(Q), K), for this observer-based control¬ 
ler leaves the closed loop transfer matrix 
invariant. Hence, we obtain that 


SiHiQ), C„) = .S(GV(C). K). 


Adding C)CC'| to both sides of this last 
equation, and applying the function J,, from 
Condition 2 wc conclude that CO satisfies (26). 

Now, suppose that there exists a controller C 
that satisfies condition (26). Define the following 
full information plant 




X = Ax QC^r -I- B2U 
V = C\x + D,u 
y = Wr']\ 


and let GfXQ) denote its transfer matrix. Then, 
from the definition of 3if((2) (cf. (24)), it follows 
that dynamic full information controller C 0 (v): = 
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[C(j)C2 /] satisfies 

and MC\QC[ 

+ S{Gf,{Q), Cf,)) < y. 

From Theorem 3.1 and Remark 3.1 it now 
follows that there exists 

Xe such that fr(CiQC[ 

+ ‘Ha,(0).K))<y. 

Hence, Condition 2 must hold and the proof is 
complete. 


.S. NUMERICAL -SOLUTION OK THE 
GENERALIZED H, CONTROL PROULEM 

In this section wc reformulate the convex 
program corresponding to the generalized //, 
control problem in a convenient form for finding 
numerical solutions. From the previous section it 
follows that it suffices to consider the state- 
feedback case. Below, we use the notation of 
Section 3, and to accommodate the case of 
output feedback, we will incorporate a positive 
-semidefinite matrix R in the definition of the 
problem, see Remark 3.1 and Theorem 4.1. 

Con-sider the plant introduced in ( 11 ). As 
shown in Theorem 3.2, and with the modifica¬ 
tion indicated in Remark 3.1, the generalized 
problem of finding a controller J{ e .such 

that f,(R + S{(j,f, K))<^ Y may be reduced to the 
convex feasibility problem (15). Using well- 
established methods from the convex optimiza¬ 
tion literature, it is po.ssible to develop effective 
numerical methods for solving this convex 
feasibility problem. Sec, for example, Boyd and 
Yang (1989) for the use of Kelley’s cutting-plane 
method and Shot’s subgradient method, in a 
convex problem similar to (15). However, in 
practice, it might be more advantageous to 
consider the following (perturbed) version of the 
generalized Hj control prob'cn. 

Let /!>() and R=^R'=^() be given, and 
consider the state-feedback plant 




X = Ax -K [By V^/Jh' -K ifiU 
z - a -I- Du (27) 

ly^x. 


Find (if po.ssible) a controller .K ^ 
such that /,(R + 5(0',;(/f), K)) < y. 

The main advantage of adding a small 
tolerance ft is that the (perturbed) generalized 
control problem may be solved using the 
convex program (16) instead of (15). Note that 
the convex program (16) has dimension qn, 
where q denotes the number of control inputs 
and n the state dimen-sion. On the other hand, 
(15) involves a search over a space of dimension 


qn - 1 - n{n — 1)12. This significant reduction in the 
dimension ot the search space is a consequence 
of the second part of Theorem 3.2. Note that, 
since the perturbed exogenous input matrix 
[fl, VjS/] has full row rank, the technical 
assumption in this part of the aforementioned 
theorem holds. Of course, if the matrices A, Bx. 
and B 2 satisfy this assumption, one can set /J = 0 . 

It is obvious that, given any 

Moreover, since we 
consider strict sub-optimality, the existence of a 
controller 'X that solves the problem with /J = 0 
guarantees the existence of a (sufficiently small) 
^max >0 such that, given any /? e [ 0 .J, jfC 
also solves the generalized Ih problem for the 
perturbed plant Thus, if we set fi to a 

sufficiently small number (i.e. machine preci¬ 
sion) wc arc certain that the two problems are 
equivalent for all practical purposes. 

We now reduce the convex program (16), 
corresponding to the generalized 7*/^ problem for 
the plant (27), to a simpler program in which 
there are no equality constraints. Let > 0 be 
given and consider the affine mapping L(W, Y) 
defined in (13). Let denote a particular 

solution to the linear equation 

L(W, y)-f/^/-O. (28) 

Note that such a solution always exists, for 
{Ay Bj) is stabilizable. Moreover, (W,,, Y,,) may 
be found by computing a particular stabilizing 
gain .TT,, e and setting 

y,, = controllability gramian of the pair 

Let ((t/|.V,)_(t/,.K,)}, where U, = t/,'e 

and y, 6 /?'^^'*. denote a basis for the 
A-dimensional (generically s = qn) linear 
subspace 

{(W, y) X : y = y' and 

AY-\- YA' ^ B.W -\-W'B2 = 0). 

Let = [fl| a 2 - ‘ u,y e R' be given. Define the 
affine mappings 

^l{a):=Yp + '^a,U, and <l)(fl) : = W,,-K 

i-l /- 1 

(29) 

It now follow-s that (W, Y) satisfies Y = Y' and 
L(W, Y) + pi = 0 if and only if there exists (a 
unique) vector a e R' .such that y = 'i'(fl) and 
W = D(fl). 

Using Theorem 3.2, condition (16), we may 
conclude that there exists X e such 

that f,{R '-S(G,j{p), K))<y if and only if there 
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exists aeR" such that 

U,(fl) :=/,(« +A/(<t»(a),'l'(fl))) < y. 

where the mapping M(., .) was defined in (14). 
Furthermore, if solves (30), then /C«,i: = 
*I*(“soi)'F“'(as„i) is a solution to our problem. It 
is important to note that the functions tp and (pr 
are convex on their domains of definition. Thus, 
(30) is a convex feasibility problem over R'. This 
is a consequence of Theorem 3.3. 

In some cases, one may be interested in 
computing the optimal generalized 

performance 

V, := inf iUR + 5(G,, (^), K )): Xe 

and a (nearly) optimal controller. This can be 
accomplished by solving the convex optimization 
problem 

“ min ^ R such that (a) 0}. (31) 

where \p{,) and <pri ) are defined in (30). Note 
that the constraint set Q = {a e R': < 0} is 

nonempty, for 0e&. Also, since the 

infimum is attained in 0. This is because, at a 
solution, the controllability gramian is positive. 

Most methods for solving the convex problems 
(30) or (31) require not only the evaluation of 
the functions \p{.} and (pX )^ but also the 
evaluation of subgradients for these functions. 
Closed form expressions for a subgradient of 
ip{a), at a point a e R\ and a subgradienl of 
0^(fl), at a point a e 0, are easy to obtain using 
standard tools for calculating subgradients. A list 
of some of these tools may be found in Boyd and 
Barratt (1990). For brevity, details are omitted. 

The algorithm we have implemented (algo¬ 
rithm GH2) to solve the convex programs (30) 
or (31) is a "‘deep cut” version of the “ellipsoid 
algorithm with constraints” described by Boyd 
and Barratt (1990). Roughly, this algorithm 
produces a sequence of volume decreasing 
ellipsoids in R^ that contain feasible points or 
optimizers. The algorithm GH2 determines in a 
finite number of iterations (provided that y ^ v,) 
whether the convex feasibility problem (30) has 
a solution. When solving the constrained 
optimization problem (31), GH2 determines in a 
finite number of steps an exit point that satisfies 



Fig. 3. The feedback configuration o( the numerical 
example. 


and <^r(floxii) V,. F £, where e is an 
arbitrary positive constant. Hence, the general¬ 
ized Hj cost with the controller /fcxit' = 
is within € of the optimal 
generalized H 2 performance v^. To run the 
algorithm GH2 an initial point no £ /?' and a 
symmetric positive definite matrix 
must be specified. The pair (ao, ^n) defines the 
initial ellipsoid £(,:= [a e R ': {a — ao)'^() ~ 

flo):sl}. The initial ellipsoid £0 should contain 
the feasible set defined by (30) to guarantee 
convergence (Boyd and Barratt, 1990). Even 
though it seems possible (based on the structure 
of our problem) to find (adp Ao) so that the initial 
ellipsoid contains the feasible set (30), we use 
flo = 0 and Ao = kI, where k denotes some 
positive constant. 

We conclude this section with a numerical 
example. This example is taken from Boyd and 
Barratt (1990). Consider the single-input-single¬ 
output feedback system shown in Fig. 3, where 
the plant is given by 


P(^) = 


1 10-j 
10 H- 


The controlled output vector is chosen to be 
z = [elYi e/yi «/y,]', where e denotes the 
regulation error, u the control input, and the 
constant weights yj, y^, yi will be specified 
later. The exogenous input vector is given by the 
process noise r;,,, and the measurement noise 
r)„,. That is, w = \i]^ 

Let denote the 3x2 closed loop transfer 
matrix from w to 2 corresponding to the 
controller y?, and define 

5(6') = -- f 

2JI J-r 

First, wc consider the problem of finding (if 
possible) an admissible controller such that 

fXS(C))<l r=:2oroc. (32) 

The second synthesis problem is to compute the 
optimal generalized H 2 performance 

v, : = inf {fXS(C)):% admissible}, (33) 

and a controller that comes as closed as desired 
to the optimal performance v,. Two sets of 
constant weights arc used for this example. 
Specification 1, y, =0.07, y 2 = 0.04, y, = 0.15. 
Specification 2. = 0.0698, y 2 = 0.04, y^ = 

0.149. 

Note that the two specifications are very close to 
each other. This is important to determine the 
sensitivity of our method. 

Before performing any calculations there are a 
few important things to observe. First of all, due 
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Table 1. GENERAi.tzFo H, costs or LOG comTiouERs 
ANDGENEHAL IZEI) W, cost S of FILT ERING 

Quantity Specification 1 Specfication 2 


Yi, 72. 73 
Trace (5(C, )) 


niC,Q(\) 

AC^QCl) 


0.(n, 0 04, 0.15 
2.1255 
1 3310 
I 2297 
0,7371 
0.7071 


0.0698,0.04,0.149 
2.1333 
1 3341 
1.2307 
0.7373 
0.7071 


the inequality 

^ ^ trace (.), 

it is obvious that if the standard LQO controller, 
say satisfies trace (5(C,J) < 1, then this 
controller is also a solution to the feasibility 
problem (32). Secondly, the condition 

fr{C\QC[)<l 

where C ^ is the output matrix corresponding to 
controlled output z, is necessary for the 
existence of a controller 'i that satisfies (32), see 
Theorem 4.1. The quantity friC\QC\), which is 
computed from the ARE (21), is coordinate 
independent and it represents (he generalized Hn 
“cost of filtering’". The numerical values of the 
quantities just discussed are given in Table 1. 

From Table 1 it follows that neither the LOG 
controller satisfies the specifications nor the costs 
of filtering violate the specifications. Thus, we 
have no choice other than running the ellipsoid 
algorithm GH2 to determine whether the 
specifications can be met. 

The results of running the ellipsoid algorithm 
GH2 to solve the feasibility problem (32) are 
shown in Table 2. The parameter introduced 
earlier was set to machine precision, in this case 


/8 = 10“^^ The particular solution to (28) was 
chosen to be the LOR gain. The initial ellipsoid 
was chosen according to the rule described 
earlier with a constant k = lAx Ur\ 

In Table 2, and (p^ are as defined in (30). 
Note that the number of subgradient evaluations 
for the objective function <p^ is much larger than 
those corresponding to the constraint function 
jp. Thus, in this example, the algorithm spends 
most of its time iterating on the objective 
function 0^. The quantities and 0“^ denote 
the lower and upper bounds on the objective 
function 0^, used to stop the algorithm. See 
Boyd and Barratt (1990) for further details on 
stopping criteria. For the case r = i.e. 
fr{) = we can compare the results of 

Table 2 with those of Boyd and Barratt. Their 
method took 59 iterations with 47 evaluations of 
their cost function to determine that Specifica¬ 
tion 1 is feasible, while 34 iterations with 22 
function evluations were required to find that 
Specification 2 is not feasible. 

Table 3 shows the results of running GH2 to 
compute the optimal performance v, with an 
absolute error of 10"\ The parameter fi and the 
initial ellipsoid are as before. On exit, the 
algorithm also gives a nearly optimal controller 
that satisfies /,(5(C,„)) ^ v,-h 10 \ Of 
course, on exit, the algorithm just produces a 
state-feedback gain. The filter gain is precomp¬ 
uted according to Theorem 4.1. The row labeled 
improvement over LOG controller corresponds 
to the quantity 


l(K)x 


/r( 5 (C,x,)) 


- 1(K). 


To give an idea of the possible differences 
between generalized Hi synthesis, and standard 


Tablf 2 Ri sun oi runnini; ihf algorithm GH2 lo dlifrmini the 

I I ASIHII IT\ OHHk SPFC TEK ATIONS 


Quantity 


Specification 1 Specification 2 


Norm if,) 

Iterations 

Suhprad evaluations V' 
Suhgracl. evaluations (p, 
Exit condition. 0 and 
Feasible? 






14 

42 

14 

74 

5 

4 

5 

4 

9 

38 

9 

70 

0^ >l 

1 

0, 1 

0'J’ 1 

No 

Yes 

No 

No 


Tabie 3. Rlsult oe running the algorithm GH2 to rOMPUTh THE UFITMAl. 

GLNI RAl IZED C OSTS 


Ouaiiiity 


Specification 1 


Specification 2 


Norm ifr) 

Iterations 

Subgrad evaluations; yp 
S^grad. evaluations: (p. 

Improvement over LOG 


'‘■inuK 

R 4 

5 

79 

1.0835 

6.51% 


102 

4 

9H 

0.9997 
10 92% 





no 

5 

4 

82 

106 

J 0848 

l.tXKU 

6.47% 

10.92% 
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frequency (nid/sec) 



FlO. 4, Frequency responsc.s of optimal controllers corresponding to Specification I Solid' LUO, dashed' 

r = 2. dash-dot' r - <*- 


LOG synthesis, the nearly optimal generalized 
H 2 controllers generated by the ellipsoid 
algorithm, and the standard LQG controller, 
corresponding to Specification 1, are given below; 

C^g= '-24.9091 

_(j -l-0.3441)(j + 10) 

(j + 3.1443 ± y3.3804)(,r + 10.823^) ’ 
Ce,..,2= -46.9815 

(j+ 0.1773)(.f + 10) 

^ (j + 3.4866 ±y2.7734)(.r + 23.2865) ’ 

= -96.8849 

(.y + 0.1447)(.; + 10) 

^ (j + 6.1952 ±/1.3494)(.v + 21.4151)' 

The fact that the three controllers share a zero 
at j = - JO is not mere coincidence. This zero is 
fixed by the Kalman filter. Indeed, since the best 
estimate for the state of the “all-pass"’ factor is 
the model itself (i.e. the filtering gain for this 
state is zero) ,y = -10 is always a pole of the 
Kalman filter. Moreover, due to the special 
structure of this example, this pole becomes a 
zero of the controller for any state-feedback 
gain. The frequency responses of these control¬ 
lers are shown in Fig. 4. Note that the three 
controllers perform more or less the same 
classical task—lead compensation. The main 
differences are the low-frequency gain, and the 
amount of phase lead introduced. 


6 CONCLUSIONS 

In this paper we have considered a (sub- 
optimal) generalized Hj control problem. This 
synthesis problem is well motivated since it 
represents a problem of disturbance attenuation, 
as measured by the peak-value of the controlled 
outputs, in the face of unknown energy 
exogenous inputs. 

We have shown that when the state of (or 
the state and the exogenous input) is available 
for feedback, static slate-feedback gains offer the 
best possible performance for the generalized H^ 
synthesis problem. That is, in the state-feedback 
case, dynamic compensators buy nothing extra 
as far as the generalized synthesis problem 
concerns. Further, necessary and sufficient 
conditions for the solvability of the generalized 
H 2 synthesis problem with state-feedback, as 
well as a formula for a solution, are obtained via 
a finite-dimensional convex program. In the case 
of output feedback, it is shown that generalized 
//? controllers can be chosen to be a combination 
of a standard Kalman filter, and a state-feedback 
gain that solves the generalized synthesis 
problem for a suitably constructed auxiliary 
plant. Due to the convexity properties of our 
approach, a solution to the generalized tU 
problem may be readily found. 

The results in this paper may be extended in 
several ways. Firstly, they can be readily 
modified to solve a multicriterion generalized Hj 
control problem. That is, the problem of finding 
a stabilizing controller such that several closed 
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loop transfer matrices are jointly small as 
measured by their generalized //, costs These 
extensions are trivial. Secondly, \he results in 
this paper may be combined with those in 
Khargonekar and Rotea (1991b) in order to 
solve the synthesis problem of finding a 
controller such that an “upper bound” for the 
generalized cost of one closed loop transfer 
matrix and the norm of some other closed 
loop transfer matrix are below desirable levels. 
This multicriterion control problem is extremely 
important since it represents a problem of 
nominal performance, as measured by a 
generalized H, cost, with robust stability, as 
enforced by the constraint. The solution to 
this problem may be found in Rotea and 
Khargonekar (1991). The extension of this 
mixed problem to accommodate more than one 
generalized performance measure i.s relatively 
straightforward. Extensions to discrete-time 
problems are al.so possible by combining the 
results in this paper with those in Kaminer el al 
(1993). 
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Abstnicl —We consider the problem ol “stabilizing” 
uncertain, singularly perturbed, linear time-varying, systems 
whose fast dynamics can be unstable We propose a class ol 
nonlinear composite controllers which assure global uniform 
ultimate boundedness of the trajectories ol the closed loop 
system, provided the singular perturbation parameter is 
sufficiently small Lhese controllers consist of a fast 
controller which stabilizes the last dynamics and a slow 
controller which vields the desued stabililv properties for the 
slow dynamics We consider the structure of the fast 
controller to be simpler than stnictures previously proposed 
m the literature lo obtain these controllers, wc hist develop 
some new results for singularly perturbed systems under 
output feedback In particular, it is shown that the rnaUhinf* 
assumption, which deals with the manner m which the 
uncertainties enter the system, and is made on the 
reduced-order system, is invariant under linear feedback of 
the fast variable 

1 INTROnUCniON 

UNCtRiAiN MODEi parameters and muUiple-lime 
scale behavior of the plant under consideration 
are common problems encountered in the 
synthesis of control systems. Since if is not 
always possible to measure all the quantities 
which affect a plant, because ol technological. 


* Received 19 October 1990, revised June 1992, received 
in final form 2K August 1992 A preliminary version of this 
paper was presented at the IFAC Symposium on Nonlinear 
Control Systems which was held in Capri, Italy during June 
19K9 The Published Proceedings ol this IF'AC Meeting may 
be ordered from Pergamon Press I id, Headmgton Hill Hnll, 
Oxford, 0X3 UBW, U K This paper was recommended for 
publication m revised form by Editor T Ba^ar 

t School of Aeronautics and Astronautics, Purdue 
University, West Lafayette, IN 47907, USA 
t Supported by the U S National Science Foundation 
under Grants MSM-87-06927 and MSS-9()-57079 

,li Dipartimcnlo di Intormatica e Sistemistica, University 
degli Studi di Napoli “Federico 11”, via Claudio 21, 1-80125 
Napoli, Italy 

I'This work was performed during a stay at the School of 
Aeronautics and Astronautics, Purdue University, supported 
by Consiglio Nazionale delle Ricerche, Italy, under Grant 
203.07.17 


economical or practical constramls, it is usual 
engineering practice to consider a nominal plant 
during the design of the control system, and then 
check a posteriori the satisfaction of the design 
specifications, either directly on the plant or via 
detailed numerical simulations. A similar proce¬ 
dure IS often used when dealing with plants 
characterized by multiple-time scale behavior; in 
that case, only the slow dynamics arc considered 
in the design stage and the full dynamics are 
taken into account m the validation phase. 

In recent years, sounder mathematical 
foundations for these engineering practices have 
been sought, so that, while maintaining control¬ 
ler design sufficiently simple, some a prion 
knowledge of a more complex system than the 
nominal one can be embedded m the early stages 
of design. Good examples of this approach are 
the deterministic control of uncertain systems via 
Lyapunov’s second method and singular pertur¬ 
bation theory. 

In the former (Corlcss and Leitmann, 1988; 
Leitmann, 1989a contain a comprehensive list of 
references) the uncertain quantities are assumed 
to range in a priori known sets and to enter the 
dynamics of the plant via given subspaccs, but, 
apart from some mild technical conditions, 
nothing is assumed to be known on their actual 
time history. Lyapunov’s second method (Kal¬ 
man and Bertram, 1960) is then used to 
construct stabilizing controllers. 

Singular perturbation theory (Kokotovi6 et a/., 
1986; Kokotovic and Khalil, 1986) permits the 
decomposition ot a two-time scale system into 
two subsystems of lower order, one for each 
time scale. This, in turn, divides the full control 
system design into two parts and, moreover, 
yields an a priori evaluation of the robustness of 
the procedure, i.e. of how different from each 
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other the time scales must be in order for this 
approximate design to work. 

The combination of the two approaches in 
order to obtain robust controllers for a singularly 
perturbed system subject to uncertain parameter 
variations, has been studied in a number of 
papers. More specifically, in Leitmann et al., 
(1986), Ryan and Corless (1991), Corless (1987, 
1989), Leitmann (1989b) and Corless et al. 
(1990) the method of singular perturbations is 
used to assesvs the robustness of feedback control 
laws with respect to fast dynamics neglected in 
the design stage. In Garofalo (1988), Garofalo 
and Leitmann (1988), Corless et al. (1989), 
Garofalo and Glielmo (1989) and Garofalo and 
Leitmann (1990) the fast uncertain dynamics are 
not assumed to be stable, and the proposed 
controllers are composite controllers in the sense 
that they consist of a “fast” controller 
“composed” with a slow one; roughly speaking 
the fast controller stabilizes the fast dynamics 
and the slow controller yields the desired 
stability properties for the slow system 
dynamics. Similar ideas have been successfully 
used in the framework of adaptive control; see, 
e.g. Taylor et al. (1989). 

In this paper, as in Corless et at. (1989), we 
consider a class of linear time-varying uncertain 
singularly perturbed systems, whose boundary- 
layer subsystem may be unstable, and we 
propose a class of nonlinear composite control¬ 
lers which assure ultimate boundedness of the 
trajectories of the closed loop system for 
sufficiently small values of the singular perturba¬ 
tion parameter. We emphasize that, although 
the design philosophy is that of composite 
control, the structure of the fast controller is 
simpler than the structures previously proposed 
in the literature. This simpler structure is 
achieved by exploiting the linearity of the fast 
dynamics. 

In order to develop our new controllers, we 
first obtain some new results on singularly 
perturbed systems subject to output feedback. In 
particular we show that, for the reduced-order 
system describing the slow dynamics, properties 
such as controllability, observability, and match¬ 
ing of uncertainties are invariant under well- 
posed output feedback laws. 

I’he results are illustrated by application to a 
simple flexible mechanical system. 

2 PROBLEM STATEMEN r 

We consider uncertain singularly perturbed 
systems described by 

•^(0 = ^ Ii('7(0)'<c(0+ 12 ( 9 ( 0 ) 2(0 

+ B,(q(t))u(t) + b,{q(t)), (la) 


fiz{t) = v42i(9(0)jf(0 + ^ 22 ( 9 ( 0 ) 2(0 

+ B2(9(0)«(0 + M9(0), (lb) 

where t e J = [fo, “) is the “time”, x{t) e R" and 
2 (/)e/?"' are state vectors, u{t)eR^' is the 
control input, q{t) e is the vector of uncertain¬ 
ties, Bj{') and b,{‘) are continuous matrix 

functions of appropriate dimensions, and jU > 0 is 
the singular perturbation parameter. Note that 
b](q{')) and bjiqi )) can be interpreted as 
disturbance inputs. Regarding the uncertainty 
q{') we make the following assumption. 

Assumption 1. There exist a compact set Q 
and a positive constant both known, such 
that and q is continuously 

differentiable with ||^(/)|| ^ on .T.t 

The next assumption deals with the depend¬ 
ence of some of the matrices of equation (1) on 

Assumption 2. The functions 42i( ). zf 22 ( ). 
B 2 {') and fcp( ) are continuously differentiable. 

Note that we do not require continuous 
dilTercntiabilily of all the A,j{') since we do nol 
require an exact decomposition of the system 
dynamics into a slow and a fast part; compare 
this with Kokotovic et al. (1986). 

We also assume that the singularly perturbed 
system (1) is in standard form (Kokotiviti et al., 
1986) as follows. 

Assumption 3. The matrix is invertible on 

Q 

Letting /x = () in equation (1), the associated 
reduced-order system is given byt 

Xo = /4()(9)Xn + B^^(q)u 4- 6o(h^), (2a) 

z^^-= u, q), (2b) 

with 

/i(xo, M. q) 

= “-^22 (9)1'^2i( 9)-^o 3" ^2(9)*^ "1“ ^2(9)]- (^) 

In the above equations, XnE R" is the slow time 
scale approximation of state x, e is the 
“quasi steady state” component of the state z, 


\ If '/is a set, by “on .T' wc mean “for all s e !f'\ 
t In what follows, arguments of functions will be omitted 
when no confusion is likely to occur. 
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and 

'^ 0 ( 9 ) - ^ 11 ( 9 ) ~ ^uiq)Ai 2 {(])A:iy{q), (4a) 
B„{q) = Bt{q) - Au(q)A 22 (q)B 2 iq). (4b) 

M9) = fci(9) - A, 2 {q)Ay 2 'iq)b 2 iq). (4c) 

We introduce the following definition. 

Definition 1. A system of the form (2) has 
uncertainties matched with the input around a 
pair (An, Bn). An e W""''. Bn e if 

• the pair (Ao, B^^) is stabilizable; 

• there exist matrix functions £, C7, wi-h 
G{q) nonsingular on Q, s.t. 

Aniq) = An+ BnE{q). 
Bn(q)-BnG(q), 
bn(q) = Bngiq), 

on Q. 

Regarding the reduced-order system we 
assume the following. 

Assumption 4 (uncertainly matching assump¬ 
tion), There exists a pair (Ao, B(,), AoeR"'"'’, 
fl„es.t. system (2) has uncertainties 
matched with the input around (A„, R(,). 

It is worth noting that in Garofalo and 
Leitmann (1990) and (iarofalo and Gliclmo 
(1989), matching of the uncertainties is assumed 
on each of the two subsystems (la) and (lb); 
this is different from here. 

In this paper we consider the problem of 
''stabilizing’’ system (1) via a memorvless state 
feedback controller. Since system (1) is uncer¬ 
tain, the controller must assure the stability 
property independently of the particular form 
the uncertainty q() takes. I’hc continuous 
controllers we propose guarantee uniform 
ultimate boundedness of the trajectories of the 
closed loop system fo- ill p > 0 sufficiently small. 
Loosely speaking this means that the trajec¬ 
tories will be uniformly bounded and, after a 
sufficiently long time, be confined to a ball 
containing the origin of the state space. 

We point out that we do not assume the 
matrix A 22 (q) he stable on (9; in other words, 
the boundary-layer system! associated with the 
full order system (1) is not assumed to be 
asymptotically stable. This, from a composite 
control point of view, implies that a controller 
must be designed to stabilize the boundary layer 
system. Usually this is accomplished by feeding 
back the boundary layer state z - z^. This is not 

t A precise defimlion is given in the next .seetion. 


possible here since z„ depends on the uncertainty 
q. However we note that, since the fast 
subsystem is linear, the boundary layer system 
can be made asymptotically stable by simply 
feeding back the state z. This type of controller, 
however, changes the reduced-order system. 

In Section 4 we analyse the effects of linear 
feedback of the variable z on the reduced-order 
system and on the matching assumption. To do 
so, we first develop some new results on 
singularly perturbed systems under linear output 
feedback; these are presented in the next 
section. 


3 LINEAR OUTPUT FEEDBACK AND THE 
REDUCED-ORDER SYSTEM 

Output feedback control of singular perturbed 
systeiiis has been considered by many authors. 
Some of them (Chemouil and Wahdan, 1980; 
Fossard and Magni, 1980; Khalil, 1985; 
Moerder and Calise, 1985, 1988; Calise et ai, 
1990) have studied the design of output feedback 
controllers for singularly perturbed systems, 
others (Khalil, 1981, 1984; Vidyasagar, 1985a, b; 
Corlcss and Gliclmo, 1993) have dealt with the 
robustness of output feedback controllers de¬ 
signed on the basis of the reduced-order system 
(i.e. for ^ = 0) but applied to the full order 
system (for p 0). 

Here, we analyse the effects on the reduced- 
order system, of a linear output feedback 
applied to the full order system. 

Suppose system (1) is endowed with a 
measured output of the form 

y == C\(q)x T C.iq)! + c{q), (5) 

where y{t)eR\ and f’i(’), r 2 ( ) and c( ) are 
continuous matrix functions. 

If system (1), (5) is subject to a linear output 
feedback controller of the form 

u{l)^ Fy(l) + au), (6) 

then the resulting closed loop system is 
x=At,(q)x + At 2 U/)z + B^(q)u + h^iq), (7a) 
fiz = A 2 \iq)x + A 22 (q)z + B2{q)r4 + ^?(^). (7b) 

with 

A„^A,iB,FC,. (8a) 

b,^b, + B,Fc, (8b) 

for i, j -\, 2. 

Clearly, system (7) is in standard form, i.e. the 
equation 

{) = A2,iq)x^,^ A2n(q)z^, + R2(q)u + 

has a unique solution z,), if and only if the matrix 
nonsingular on Q. Under this 
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hypothesis we have 


Zq = ~ A22{q)[^2\iq)^o + ^ 2 ( 9 )“ + ^ 2 ( 9 )] 

(9) 

Substituting (9) into (7) we obtain 
reduced-order system 

the 

Xn = Ao(q)Xo + Bu(q)a + b„{q). 

(10a) 

y = Co(q)x„ + D„(q)u + Co(q), 

(10b) 

with 


A,) = Aii ~ Ai2A22A2\, 

(11a) 

B„^B,-A,2A2fB2. 

(lib) 

bo~ b\ — Ai2A22^2> 

(lie) 

C„^C,-C2^-2U'2,. 

(lid) 

Do^ -C2A2fB2, 

(lie) 

Cq — C ^2^22 ^2* 

(Ilf) 


We introduce the following definition. 


Definition 2. The reduced-order system (10) is 
well-defined if the matrix nonsingular 

on Q. 

System (10) is obtained by first applying the 
output feedback law (6) to system (1), (5) and 
then letting (i = 0. We now reverse the order of 
these operations by applying controller (6) to the 
original reduced-order system associated with 
(1), (5). This is given by 

x„ = Ao(q)Xo-^ B(,{q)u + b„(q), (12a) 

y = C„(q)Xn +D^fq)u + c^yiq), (12b) 

where A,,, B„, and b„ are given by (4), and 

Cu = CI — C 2 A 22 A 21 , (133) 

D^^= —C 2 A 22 Bt, (13b) 

C(, - c - C2v4^'f)2- (I3c) 

If system (12) is subject to controller (6), then 
y = Coiq)x„ + Duiq)Fy + D^fq)u + Cffq). (14) 

For the above equation to have a unique 
solution for y , we require I, - D„F to be 
invertible on Q. Under this hypothesis we can 
solve equation (14) for y and obtain the 
following closed loop system 

Xo = A^lq)x^, -I- K{q)u + ^n{q), (I5a) 

y = A)(gK) + f>niq)u + C»{q), (1.5b) 


where 

A. = (//-A)F)- 'Dc„ (16b) 

A(, = Ai) + B(|FC(i, (Ibc) 

^0 - Bf)(/p + fAi). ( 16 d) 

^11 ~ (4 ■” 'co, (Ibe) 


(16f) 

The following definition is then justified. 

Definition 3. The closed loop system (15) is 
well-defined if If — Di^(q)F is nonsingular on Q. 

Hence, systems (10) and (15) are closely 
related: the first one is obtained by computing a 
reduced-order system after applying the output 
feedback control law (6); the second one is 
obtained by closing the loop directly on the 
reduced-order system (12). In other words, the 
operations “closing the loop” and “letting 
are applied in different orders. The 
natural question arising is whether they comm¬ 
ute: although the answer is intuitively 
affirmative, it is not obvious. We formalize it in 
the next theorem. 

Theorem 1. Suppose Assumption 3 holds and 
consider systems (10) and (15). 

(1) If one of them is well-defined, the other one 
is well-defined. 

(2) They are identical. 

Proof. System (10) is well-defined iff A 22 is 
nonsingular, and (15) is well-defined iff 7/- 
A)f = If-^ is nonsingular. Actu¬ 

ally, these conditions are equivalent, as can be 
readily proven using the matrix equality (sec for 
example Kailath (1980)) 

det(/ + /tB)-dct(/4 BA), (17) 

where A and B are any matrices of suitable 
dimensions. Indeed, one has 

det(// DoF) = det (7/H-C,A..'T^.f ) 
-del(/,, ^AllB.FC.) 

= del (A 22 ) det (A.pJ^ (*^) 

hence 7/ ~ AiF is invertible iff A 22 is invertible. 

As regards the identity of the two systems, this 
is equivalent to the identity of corresponding 
“tilde” and “hat” matrices in (10) and (15). To 
demonstrate the latter, we first notice that 

^\ 2 A 22 ~ {Ai 2 ~~ A \ 2 A 22 ^ 22)^22 

= FoFC,A2V, 

and, similarly, 

^22 ^2\ ~‘^22 ^ ^22 P2hC{), (20) 

^ 22 ^ 2 " ^ 22^2 ~ ^22 ( 21 ) 

From this, equations (11) yield 

B,, = Bo%-FC 2 Ai^Bf), (22a) 

Q) ~ (A “ ^2^22 (22b) 
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A„ = A„ + B„FC„ (22c) 

= A„ + B„FQu (22d) 

c„ = (/, - C.A^JB 2 F)co, (22e) 

b„ = b„ + (22f) 

= h„ + B„Fc„. (22g) 

On the other hand, the matrix inversion 
formula (Kailuih, 1980) gives 

(/, - D„F)-' = I,- C 2 A 22 'B.F. (23a) 

4+FA, = (4-FZ)o) ' (23b) 

~ ~ BC 2 A 12 B2- (23 c ) 

By applying the above equalities to (16) and 


comparing with equations (22) and (lie), one 
can derive the desired result. 

In view of Theorem 1, we shall refer only to 
system (10) and call it the reduced-order closed 
loop system. We will call system (2) the 
reduced-order open loop system. Moreover, we 
can introduce the following definition. 

Definition 4. The output feedback control law 
(6) is well-posed for the singularly perturbed 
system (1), (5) if the reduced-order closed loop 
system is well-defined. 

Remark 1. In view of Theorem 1, the wcll- 
posedness of a feedback law is equivalent to the 
requirement that one (hence all) of the matrices 
A 22 ,1/ ~ D^)F, - FD^^ be nonsingular on Q. 

Then, the useful formulas (23) apply. 

Remark 2. Since corresponding “hat” and 
“tilde” matrices arc equal, we can use either 
(16) or (22). The form and symmetry of these 
equations suggest some interpretations. Looking 
at system (10) from the control input point of 
view, we write 


B„ = B,{l„-FD„y\ 

(24a) 

All = A^) + B^)FC 1 ), 

(24b) 


(24c) 


Thus, the effects of a well-posed output feedback 
law on the original reduced-order system (2) are 
equivalent to 

• right-multiplying the input matrix by the 
nonsingular matrix (/^, - FDo)‘‘; 

• feeding back, through the new input matrix, 
the quantity 

F(C,hCo + c,,) = f (CiJCn + C 2 h{x„, 0, q) + c]. 

A dual interpretation, from the output point 
of view, is obtained by considering 

Q = (/,-Dof)--'C,. (25a) 


Co = (4-/;of)"‘co, (25b) 

A() — Ai) BdFCq, (25 c ) 

+ Bofco- (25d) 

A further consequence of equations (24)-(25) 
is the following theorem. 

Theorem 2. Suppose system (1), (5) is time- 
invariant (i.e. ^ is a constant function), satisfies 
Assumption 3, and the output feedback law (6) 
is well-posed. Then, the reduced-order closed 
loop system is stabilizable (detectable) iff the 
reduced-order open loop system is stabilizable 
(detectable). 

Returning now to Assumption 4 of the 
previous section, wc notice that, using (24), one 
obtains 

— B^)Gp, (26a) 

Au = An + B^^{E Gf FC„) 

= Ao T BoEf,, (26b) 

{2fK) 

where Gf,, Ef and gf depend on q. We point out 
that, if the output feedback law is well-posed, 
the matrix G^{q) in (26a) is nonsingular on Q. 
The above equations lead to the following 
theorem which, for the particular scope of this 
paper, can be considered the main result of this 
section. 

Theorem 3. Consider system (1), (5) and 

suppose Assumption 3 holds and the linear 
output feedback law (6) is well-posed. Then, the 
reduced-order closed loop system has uncer¬ 
tainties matched with the input around the pair 
(A^. ®o) iff the open loop reduced-order system 
has uncertainties matched with the input around 
the same pair, 

4. STABILIZAITON OF IHE BOUNDARY-LAYER 
SYSTEM 

At the end of Section 2 wc mentioned that 
our intention is to use linear feedback of the fast 
variable z to stabilize the boundary-layer system. 
To this end, consider a feedback law of the form 

u{t) = Fz{t)^a(t), (27) 

where the control u is to be used in the design of 
a controller for the reduced-order system. 

Clearly, (27) is a special case of (5) (6) with 

I = m, (28a) 

c{q)^0. (28b) 
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C,(( 7 )^ 0 , (28c) 

C2(q)^L- (28(1) 

Thus, we can utilize the results of the previous 
section. 

In particular one has 

G,=G{/„ + FA22'ff2)\ (29) 

ftn = Bn(/, + FA,,'B2r' 

= B„Fc,, (30) 

C„=-A.2'A2,, (31) 

A 22 = A 22 + BjF. (32) 

Consider now any nonsingular matrix M e 
and define a new input vector 

v = M-'a-, (33) 

the input matrix of the reduced-order closed 
loop system relative to this new input is 

= (34) 

with 

C = G(/p + M.Vfl2) 'M 
= G^M. (35) 

We note that the matrix Liq) depends on 

matrices F and M, These are specified in the 

next assumption. 

Assumption 5. There exist a z-feedback matrix 
F and a matrix M s.t. on C?, 

(1) A 22 iq) is asymptotically stable, 

(2) L(q) is positive definite. 

Roughly speaking, Assumption 5.1 assures us 
that (a) the feedback law (27) is well-posed, and 
(b) the boundary-layer system associated with 
the singularly perturbed system (7), (8) and (28) 
is asymptotically stable in the fast time scale 
T = for any uncertainty q( ) and any 

“frozen” time instant Of course, a stronger 
condition than mere stability could be chosen, 
such as constraining the eigenvalues of Ajziq) to 
a prescribed sector of the left half complex 
plane, thus ensuring sufficient damping of the 
fast time scale dynamics. 

Assumption 5.2 will enable us to establish a 
feedback control law generating 1 ; (essentially a 
feedback “direction”) which stabilizes the 
reduccd-ordcr system. In order for Assumption 
5.2 to be satisfied a necessary condition is that 
the matrix Gf.{q) be nonsingular on Q\ this is 
guaranteed by Theorem 3. 

Remark 3. If Assumption 5 is assured with some 
matrices F and Af^, then a whole class of 
matrices M will assure Assumption 5.2 and some 
sort of optimization turns out to be possible. 
One possibility is that M be chosen so as to 


solve the following optimization problem: 

inaxminA,n,„[L(<7) + L(9)^], (36) 

where M is defined as 

{Me:||A/|| = 1}. 

It will be clear in the following that this 
contributes to decreasing the gains of the 
proposed controllers. 

Remark 4. If G{q) is positive definite on Q, and 
A 22 (*) and Bni ) are constant over Q, i.e. they 
are not affected by uncertainties, then Assump¬ 
tion 5.2 is certainly satisfied for some M. In 
particular a “natural” choice for M is 

A/= 4 4 FA 22 B 2 . (37) 

With this choice L(q) ^ G(q). 

Remark 5. In the scalar input case, i.e. p = l, 
and G{q) > 0 on (7, Assumption 5,2 requires 
essentially that the scalar 1 H- ^A 2 :‘(^)^ 2 (^ 7 ) docs 
not change sign over Q. In this case, 
1 4^ FA 22 B 2 = det (4 4- FA^'B.) = det {A 22 ) det 
(/422) (employing (17)); hence, noting that 
det[A 2 ?.( 9 )l docs not change sign over since 
A 22 (q) is asymptotically stable on Q in view of 
Assumption 5.1.t we obtain that a necessary 
and sufficient condition for Assumption 5,2 to 
hold is that del|i4 22 (< 7 )] does not change sign 
over Q. This, in turn, is guaranteed by 
Assumption 3, provided the set Q is connected. 

Remark 6. We notice that, in view of (28) 
equation (9) can be written as 

Zo = (L ^ A 22 ^(q)B 2 (q)F) ^h(x,, u, q), (38) 

from which we see that the effect of z-feedback 
on the slow part of the z state is equivalent to 
multiplication by the “gain” matrix (4,4- 
A22^B2F) V 

Since z-feedback modifies the reduced-order 
system (see equation (26)), it is desirable to 
reconsider the choice of the nominal dynamical 
matrix An of the reduced-order system, so as to 
minimize the magnitude of the uncertain terms. 
From equation (26b) we have 

A„(q) = A^. + B,Mq), (39) 

where 

A^.^A„+B,,K*, (40a) 

EAq)^E^{q)-K\ (40b) 

and the matrix K* € R’’"’' solves the following 

1 Del (-v422t</)) IS equal lo ihc constant term of the 
characteristic polynomial of 
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problem 

min n\'dx\\Ef,{q)- K\\. (41) 

The stabilizability of the pair (/in, So) is 
equivalent to that of S„). 

5 A NONi INEAR CONTROLLER 
In this section we utilize the properties of 
linear z-teedback ♦o propose a nonlinear 
composite controller which guarantees ultimate 
boundedness for the trajectories of system (l).t 
For the sake of readability, we first summarize in 
the following the form of the matching 
assumption for the reduced-order system after 


z-feedback; 

(42a) 

B,,(q)M = B,Mq). (42b) 

liufi/iq)- (42c) 

rhe controller has the form suggested in (27), 

u{t)^FzU)^u{t). (43) 

and we now specify 

(44) 


thus realizing feedback of the slow state x. In 
equations (43) and (44) matrices F and M are 
those described in Assumption 5; the function w 
IS specified in the following subsection. 

5.1. Structure of the nonlinear controller 

Choose any positive definite symmetric (7„ e 
let f*,, be the unique positive definite 
symmetric solution of the Riccati equation 

+ e.i - 2I\Xb/,Pu =-- (45) 
and define (he functions 

vU)^ iUP„x, (46a) 

.'(v) = (l + ||v||) 'v. (46b) 

Then, the function w has the form 

w(x) = ^/J,v-/j,.v(yv). (47) 

where, tor the sake of simplicity, we have 

dropped the dependence of on jc. In equation 
(47), /j, and fi 2 are non-negative constants, y is a 
positive constant. The control law is made up of 
two parts: (a) the linear part counteracts the 
effect of the uncertainties in the matrix Au, (b) 
the nonlinear part is a continuous approximation 
of a discontinuous law (for details see Gutman, 

1 See the Appendix for d dehmtion of ultimate 

boundedness and for a lemma on how to guarantee this 
properly. 


1979; Corless and Leitmann, 1981; Ambrosino et 
al., 1985) and counteracts the disturbance input 


If 

To choose jC, and jSi, define 


kf..- 

max ||£'a-.(^)||, 

«/eC? 

(48a) 

k, - 

(1/2) min A„,„[Z.(^) + LiqY], 

(48b) 

k = 

max ||^:,.( 9 )||, 

qtq 

(48c) 

and 

k- 

Q A /, -1 1 ^ A* 

^ ' ' 2A.„,„((3„)A, ■ 

(4ya) 


Ih^k.Jk,. 

(4yb) 

and let 


(.^Oa) 



(50b) 


The only requirement on y is that it be strictly 
positive. Nevertheless, increasing y reduces the 
size of the ball of ultimate boundedness of the 
reduced-order system, and thus it may be 
desirable to select it as large as practically 
possible. 

5.2. A stability result 

For the control law given above it is possible 
to prove the following theorem. 

Theorem 4. Consider system (1) subject to the 
nonlinear controller (43)-(5()) Then there exists 
a constant ju*>() such that, whenever 
the trajectories of the closed loop system are 
uniformly ultimately bounded. 

Remark 7. We would like to point out that the 
control design procedure presented in this paper 
is different from the usual composite control 
design procedure. In that approach (sec 
Kokotivic et al. (1986), for the general 
philosophy and, for example, Garofalo and 
Leitmann (1990) tor application to uncertain 
systems) the slow control design is carried out on 
the basis of the original reduced-order system; 
the 2 -feedback is designed to be inactive when 
= because it feeds back 2 -z „.4 

Feeding back the state z, while being simpler, 
modifies z,), as shown in equations (9) and (38) 
and, in turn, the reduced-order system. In a 
linear “certain’' system, these effects can be 

1 In the Ccise of an unccriam singularly peiiurbed sysicm, 
) turns out to be dependent on the uncertainly qi ) (see 
equation (2b)) A nominal quasi steady state trajectory 
corresponding to a nominal uncertainty q{') is then 
considered. 
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canceled out by the jc-feedback; in the present 
case, due to the presence of the uncertainties, 
these effects are partly embedded in the nominal 
reduced-order system. 


Proof of Theorem 4. As in Saberi and Khalil 
(1984) the proof is based on a Lyapunov 
function for the closed loop system which is 
obtained as a convex combination of two 
Lyapunov functions, one for the slow and one 
for the fast time scale approximation of the 
system. 

With the notation introduced in Sections 3 and 
4 (assuming Cj = 0, C 2 = /^, c = 0), the closed 
loop system can be written in the form 

x=A^lq)x - + j 82 .v(rv)] 

+ M^) + Ai 2 iq){z - H{x, q)), (Sla) 
liz = M 22 (< 7 )(z - H{x. q)). (51b) 

where 

H(x, q) = -Ai 2 \q)[A 2 ,iq)x 

+ B2iq)Mivix) + b^iq)]. (52) 

Consider now the positive definite quadratic 
function 

V(x)^x^P,yX, (53) 

and evaluate its derivative along the solutions of 
system (51). Using equations (42) and (45) we 
have 


V{x, z, t) = lx^Pft{A,(.x - fi„Ll/8,v + /l 2 .v(yv)j 
+ fio(£K-r +AV)} 

+ 2x'P,,Ai2iz - 




llyv|P 

1 + llyvll 


+ 2 ||v||^ 


+ 2/CE,. llvll lk|| + 2A:,, ||v|| 

+ P.Jkll Ik( 54 ) 

where 

2 max llPoi,2(9)11. ( 55 ) 

ry 


Using (50) we obtain 

V{x, z, t)<-p„ \\xf + pi+p^, ||j:|| ||z -/y||, 



(56) 

PXX ~~ ^m\n{Q(l) ^ y ^ , J 

2(Pi«;.- I) 

(57a) 

Pi -2 y"'^2A:/.. 

(57b) 


Consider now the 9-dependent solution 5(9) 
of the Lyapunov equation 

^22(9)^‘5(9) + S{q)A22{q) = (58) 


In view of the first part of Assumption 5 this 
solution is positive definite on Q. Moreover, 
since A 22 ( ) is bounded as a result of the 
continuity of A 22 (-) and the compactness of Q, 
S{-) is also bounded. Differentiating equation 
(58) with respect to 9 ,, for i = 1, 2, . . . , 9 , we 
obtain 


, , ,.,35(9) , dS(q) 

^ 22 ( 9 ) 2 ^ ^ 22 ( 9 ) 

oqi dq, 


^^22{q)^ 

3q, 


Siq) + S{q) 


dA22{q) 
dq, J’ 


for / = L 2 , 


(59) 


As a consequence of part 1 of Assumption 5, the 
solutions 3S{q)ldq, of equation (59) exist, and 
they are bounded since A 22 {') and dA 22 {')ldq, 
are bounded by Assumption 2, and .V( ) is 
bounded. From this and from Assumption 1 , it 
follows that finite positive constants 
£ = 1 , 2 , . . . , r;, exist such that 


BS{q{t)) 

3q, 



Sryt * 


j for any q() 

[ and any t e J. 


( 60 ) 


Let us now consider 

Wix,z,t)^[z-H{x,qm'^{<l{()) 

X [ 2 -//(x, 9(0)1; (61) 

evaluating its derivative along the solutions of 
closed loop system (51), we obtain 


aw dw 

lV(x, z, l) = -—x + —- z 
ax dz 


dW 

+ -- 

dt 


= -2(z- HyS'J,x 

+ 2 p ‘(z -H)'SA22 (z - H) 
- 2 (z - HySJ„q 


+ 


3S 


~{z-H)q,. (62) 


!- I 


In the previous eo^uation 7^ and J are the 
Jacobian matrices J.^dHldx and — 3HIdq. 

The first one can be shown to be continuous and 
bounded; as regards the second one, it is 
continuous in view of Assumption 2 , and the 
bound on its norm is affinely dependent on the 
norm of x. Thus we can compute constants 
hjgo and such that 

\\S{q)JAx.z)\\^kj,, (63a) 

||5(9)y,(j:, 9)11 + Ikll, (63b) 

for all 9 e Q and x e R". 
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Equation (62) results in 

\\z-H\\' 

+ 2kj, \\z-H\\ ||i|| 

+ 2kj,,,k^ \\z-H\\ 

+ 2kj^,k^\\z-H\\ 11^1 

+ (64) 


Since equation (5la) implies that 

+ + II 2 -//II, (65) 


for some scalars fli, a. ^0, it follows ihal 


W(x, z, 



with 

+ ■5'^. Ik II l|Z - W|| + .Vr Ik - 

h\\, (66) 


•'V.' ~ 2/c_,,a + ^ , 

f 1 

(67a) 


s,A2(kj,a, + A,„,A^), 

(67h) 


i, = 2{kj,a, 4- A,,,„A,,). 

(67c) 

Now 

we are able to define the 

following 


Lyapunov function candidate. 

L{x, Z.t) = {\ -)()V’(a) ■>rxW(x. 2 , r), (6«) 

where x ^ constant belonging to the 

interval (0, 1). 

A bound on its derivative along the solutions 
of the closed loop system (51) is easily evaluated 
from (56) and (66) as 


Ux, z,f)^-(ll^ll ll^-H||) 


(1 -x)/’.. 


(1 -x)/hz + 


0-X)P,z+X>Ikz X,, 

- -( 1 -^.A.J 

2 P 

.( Ikll \ 

(0 ^(^--X)Pi- (^i9) 


The upper bound on jU for the existence oi an 
interval (Xi, X 2 ) of values of x for which the first 
matrix of equation (69) is positive definite is 
given by Saberi and Khalil (1984) as 




P.. 

PxK^ZZ Pxz^xz 


(70) 


Thus it is easy to verify that, for any 
there exists a family of functions L for which the 
hypc»theses of the lemma reported in the 
Appendix apply. Uniform ultimate boundedness 
of the trajectories of the closed loop system (51) 
follows- 


Remark 8. Although fi* is actually computable 
via (70), it is, in general, a conservative estimate 
of the actual upper bound of admissible values of 
the perturbation parameter. Our emphasivS here 
is on the existence of such a bound rather than 
its computation. 


6 AN hXAMVUl 

Consider the system illustrated in Fig. 1. The 
two bodies of masses m, and m. arc connected 
by a massless, flexible rod characterized by a 
free length /„ and a linear stiffhi'ss coefficient 
k >0. The distance between the origins of the 
two coordinate systems, C7, and C>>, is /,); u and q 
are a control and an uncertain disturbance force, 
respectively. Moreover we assume k/(OI~^£/ 
and \q{t)\ ' k,^ for some known k^ and k^^. 

One can recognize that, for large but finite 
values ol k, this mechanical system has a 
two-time scale behavior The ''slow” behavior is 
related to the rigid body motion of the system; 
the “fast” one is related to the oscillations of m\ 
and rn^ at the ends of the slightly flexible bar. 
Since no damping is assumed in the rod, the fast 
behavior is only marginally stable, and this is 
bound to cause stability problems if one designs 
a controller on the basis of the slow dynamics 
only. Hence, out aim is to stabilize the system in 
the "ultimate boundedness” sense, by means of 
a "composite’’ action on the fast and slow time 
scales. 

The motion of the system is described by 


m,ri - k{r 2 - r,) + w, (71a) 

4* (71b) 


Choosing jU = A: ‘letting 


/' 2 \ ^ A 

V,/’ " \k^'\r2-r,)r 


(72) 


a 






Fio. 1. A simple nicchLinicai syslem 
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the system is described as follows: 


the solution of Riccati equation (45) is given by 


vjuiz 



k. (73) 


with 1/Af^, - 1/^1 -h 1 /^ 2 - System description 
(73) is now in the form of system ( 1 ). (For 
further reading on the singular perturbation 
approach to flexible mechanical systems, see 
Ficola et al. (1983), Marino and Nicosia (1984) 
and Spong (1989).) 

The quasi steady state behavior of the z state, 
obtained by setting ^ = 0 , is given by 


z,„ =-(Mp/m,)u + (74a) 

2()2 ~ 0- (74b) 


Flence, the reduced-order system is 




where A/,— mi- 1 -^ 2 . Note that equation (75) 
describes the system in Fig. 1 when the rod is 
rigid. 

It is readily seen, from equation (75) that the 
uncertain disturbance is matched with the input 
matrix. 

The boundary layer system associated with 
equation (73) is only marginally stable. In this 
case one can show that, for any slow feedback 
law u = - giJCi ~ which stabilizes the nomi¬ 
nal (i.e. with q(t) = 0) reduced-order system, the 
full order nominal system is unstable for any 

> 0 . 

The boundary layer system is asymptotically 
stabilized by any z-feedback of the form 


M =/, 2 , -h/ 2 Z 2 + a, (76) 

with 

/,>-l-m,/m 2 , /2>0. (77) 

Following Spong (1989) we feedback only Z 2 , 
i.e. we let /, = 0. In this case, as can be seen 
using equations (74) and (76) the reduced-order 
system (75) is unaffected by z-fecdback. 
Moreover, since its system matrix in our 
notation) is not uncertain, the optimization 
described in (39)-(41) is not necessary. We have 


Choosing 


G-1, = 





(78) 



It follows that 

V = B^PiyX = Mf{x^ + 2x2), 

P, = i. 

The full controller, expressed in terms of the 
original state variables of equation (71), is as 
follows: 

u = /2A:^^"(/2 - r,) - P^M,{r2 + Ir.) 

-li2siyM,(r2^2r2)), (79) 

where the function s is defined in equation (46). 
Theorem 4 assures us that, provided f2>0y 
>^, and p 2 ^ 1 ^ 2 ^ the trajectories of the 
closed loop system will be uniformly ultimately 
bounded for k sufficiently large. 

6 .1. Numerical simulation results 

Here we present some numerical simulation 
results carried out with m, = lkg, m 2 = 2 kg, 
<y(/) = 5 cos (0.5r) N, A:^ = 5N, and choosing 
/2 = 1. With /^, = 1, /f 2 - 5 . we set /j, = L5^i,, 
p 2 - Pz J^nd y = 5(K)/9. The initial conditions 
were r,(0) = ^ 2 ( 0 ) = 5 m, ri(()) ~ niO) “ 0 m sec' 
First we consider the behavior of the 
reduced-order system under controller (79); in 
Fig. 2 the time history of the displacement r^ is 
plotted. The efficacy of the nonlinear component 
of the control law can be appreciated by 
comparing Fig. 2 with Fig. 3, for which the 
simulation has been carried out on the 
reduced-order model using the linear controller 
only, i.e. setting “ 0 Figure 4 depicts the 
behavior of r 2 for the full order system subject to 
controller (79) and with A:=^l(y*Nm ‘. No 
appreciable difference can be noticed in 
comparing with Fig. 2, A more visible change in 
system behavior can be noted in Fig. 5, which 
corresponds to k = 1(K)N m“*; however, the full 
order system maintains its boundedness. Figure 
6 illustrates the loss of boundedness which 
occurs for fc = 4 N m~'. 

Finally, we illustrate the consequences of not 
using z-feedback, i.e. letting /2 = 0. With /2 = 0, 
the response of the reduced-order system 
remains unchanged. However, even for 
the full order system exhibits 
unbounded behavior; see Fig. 7. 

7 CONCLUSIONS 

In this paper we have considered the 
deterministic control of a linear singularly 
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perturbed system subject to time-varying bo¬ 
unded parameter uncertainties and unknown 
bounded disturbance inputs. We do not assume 
that the boundary layer system is stable. It has 
been shown that matching of the uncertainties 
with a nominal input matrix, assumed on the 
reduced-order system, is invariant under linear 
output feedback. On this basis a composite 
controller has been proposed, linear in the fast 
variable and nonlinear in the slow variable, 
which, for jU sufficiently small assures uniform 
ultimate boundedness of the trajectories of the 
closed loop system, independently of the 
uncertainties. The approach which has been 
followed is different from the usual composite 
control technique, although there are some 
similarities and these have been pointed out. 
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APPENDIX 

Some definitions and a lemma 

We refer to a differential equation of the form 

m=ny(0j). m 

with y teR and F(>): xR-f>R^. We say that the 

trajectories of system (80) are uniformly ultimately bounded 
within a spherical neighbourhood of radius p of y = 0 
(indicated with B(p)) iff the following properties are 
satisfied. 

Existence of solutions. Given any (y„, tf^) e R^ x R there 
exists a solution y( ):lfo. of equation (80) with 

indefinite extension of solutions. Every solution 
yCO-Kii of (80) has an extension over [t,,. °°) 

Global uniform boundedness. Given any bound reR^, 
there exists a positive d(r) < » such that if y( ) is a solution 
of equation (80) with y(tj € B(r), then y(t) e B(cl(r)) for all 

Local boundedness within B(p), There exists a spherical 


neighbourhood B(po) of y = 0 such that, if y( ) is a solution 
of equation (80) with y(fo) e fl(Po) then y(r) £ B(p) for all 

Global uniform ultimate boundedness within B(p). Given 
any bound r e there exists a Tir) e R^ such that, if y( ) 
is a solution of equation (80) with y(/„)EB(r), then 
y(/) 6 B(p) for all 12: to + T(r). 

The listed properties of system (80) can be assured with the 
aid of the following lemma; see Corless and Leitmann 
(1981). 

Lemma 1. Consider system (80) and suppose that there 
exist a positive j <>», a C* scalar function ^:R^ x R—^R^, 
class KR functions Xi and ^nd a class K function Xj such 
that 

xMy\\)^^iy>0^X2(\\y\\)> (8i) 

+ 0f(y. 0^ -^jdlyll). (82) 

for all leR and Then for any p>Xi 

trajectories of the system are uniformly ultimately bounded 
within B(p). 
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Systems. Part 1: Perfect State Measurements 
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Design of composite controllers for singularly perturbed systems under the 
H" criterion requires a much different methodology than that available for 
the singularly perturbed regulator problem 
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Abslmcl —We study the H optimal control of singularly 
perturbed linear systems under perleel st ite measuiements 
for both finite and infinite horizons Using i differential game 
theoretic approach we show that is the singular peitiiibttior 
parimeter e appro lehes zero ihe optimil disturbance 
attenuation level for the full order system under i quadratic 
performance index converges to i vtIuc that is bounded 
above by the miximuin of ihe optimal disturbance 
attenuation levels lor the slow and list subsystems under 
appropriate slow and I isi qu idi iiic cost functions 
1 urthermore we eonslrucl i composite eonlroller bised on 
the solution of the slow and last g lines which gutr.mtees i 
desired achitviblc perlorminet level loi the lull Older plant 
as c ipproaches zero A slow eonlroller however is not 
gtnerill) robust in this sense but still under some 
conditions which are delmcited in the piper the last 
dynamics can be totally ignored The piper also studies 
oplimalit) when the controller includes a feedforward term 
in the disturbance and presents soiiil numeric il ex imples to 
illustrate the Iheorelieal results 

I INIRODUCIION 

Onf of 7 HF iMPOKiANi Fcccnt dcvclopmcnts in 
control theory has been the recognition of the 
close relationship that exists between H -optimal 
control problems and a class of linear-quadratic 
differential games which has not only led to 
simpler derivations of existing results on the 
former, but also enabled us to develop 
worst-case (// -optimal) controllers under var¬ 
ious information patterns, such as (in addition to 
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perfect and imperfect state medsurements) 
delayed state and sampled state measurements 
An up-to-date coverage of this relationship and 
the derivation of /T'-optimal controllers under 
different information patterns can be found in 
the recent book by Ba^ar and Bernhard (1991), 
which also contains an extensive list of 
references on the topic 

This paper makes further use of the 
relationship between //" optimal control and 
differential games in studying the issue of 
worst-case controller design for systems with fast 
and slow dynamics—systems commonly modeled 
using the mathematical framework of "‘singular 
perturbations The mam objective here is to 
obtain "approximate’ controllers independently 
of the “small singular perturbation parameter, 
say c and to prove that the approximate 
controllers can be used ‘"reliably” on the original 
system when c is sufficiently small It is, of 
course well-known that in the absence of the 
disturbance, a composite design based on two 
separate designs for the slow and fast subsystems 
performs remarkably well in the linear-quadratic 
(singularly perturbed) regulator problem, with 
the approximation to the optimal cost being 
0{e^) (Chow and Kokotovic, 1976) Another 
appealing feature of the compc^site controller (in 
the regulator problem) is that it avoids the 
difficulty of dealing with stiff differential 
equations 

One natural question here is whether the 
counterpart of the “singularly perturbed reg¬ 
ulator theor>” uan be developed for //“-optimal 
control, or equivalently for the related class of 
linear-quadratic zero-sum differential games, by 
going through similar steps One of the messages 
of this paper is that this is not necessarily so, 
and one has to develop a separate theory by 
going through some new lines of reasoning. 
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which has indeed been accomplished in the 
paper. To make the above statement somewhat 
more precise, let denote the //’"-optimum 

performance of the full-order system under 
perfect state measurements, and and y, 
denote the //"-optimum performances of (ap¬ 
propriately defined) reduced slow and fast 
subsystems, respectively. Our first result is that it 
is not necessarily true that y*(e = 0) = y^, and an 
//*-norm bounding controller for the reduced- 
order system that guarantees a performance (for 
the reduced system) in the neighborhood of y, 
could lead to a catastrophic performance for the 
full-order system, no matter how small c is. In 
terms of the soft-constrained game associated 
with the full-order system, such a feedback 
controller (which depends only on the slow state, 
by construction) may lead to an unbounded 
value. This is quite in contrast with the limiting 
properties of feedback controllers in 
disturbance-free optimal control problems with 
singularly perturbed dynamics, but is not that 
surprising in view of some existing results on 
singularly perturbed Nash games (Gardner 
and Cruz, 1978) where it is known that the 
well-posedness (or, continuity at € — 0) depends 
on the consistency of the information structure 
between the full and reduced-order problems. 
Zero-sum differential games fall somewhere 
between optimal control problems and nonzero- 
sum differential games, and there the saddle- 
point value shows continuity whenever it exists 
for both the full and reduced-order problems, 
while the condition of existence depends on the 
consistency of the underlying information struc¬ 
tures. But this is precisely what determines the 
optimal level of performance and the cor- 
resjxjnding minimax controller in //'- 
optimization, which therefore requires a careful 
study of the dependence of conjugate points in 
zero-sum differential games on the small 
(singular perturbation) parameter c. This paper 
undertakes such a study and provides a complete 
analysis of the singularly perturbed // -optimal 
control problem under perfect state measure¬ 
ments. It is proven that y*(c = 0) max {y,, y,}, 
and that a composite controller exists under 
which such a performance bound is attained 
“approximately”. 

Design of controllers tor singularly perturbed 
systems subject to unknown disturbances has 
been studied before in the literature, notably in 
papers by Garofalo and Leitmann (1988, 1990) 
and Corless et al. (1990), where the objective 
has been to obtain composite controllers that 
guarantee stability of the overall (possibly 
nonlinear) system. The main approach of the 
authors in these papers has involved the 


construction of appropriate Liapunov functions, 
in terms of which a class of stabilizing controllers 
has been characterized. However, no optimality 
properties have been associated with these 
controllers, which is our main concern in this 
paper. Yet another paper that deals with 
uncertain (linear) systems which exhibit time- 
scale separation is the one by Luse and Ball 
(1989), which obtains a two-frequency scale 
decomposition for //’"-disk problems, but does 
not address the issue of optimal controller 
design. 

The present paper is organized as follows. In 
the next section (Section 2) we formulate the 
singularly perturbed //’"-optimal control problem 
and the associated linear-quadratic differential 
game, both under perfect state measurements. 
We also provide in that section the solution to 
the full-order problem, for both finite and 
infinite horizons. In Section 3, we identify the 
slow and fast subsystems and the associated 
differential games. There are in fact two “slow” 
games, differing only in the information 
available to the controller, both of which are 
relevant to the problem at hand. The saddle- 
point solutions to these “slow” games as well as 
to the “fast” game are also presented in Section 
3. In Section 4, where the main results are 
presented, we obtain composite controllers in 
finite and infinite horizons, prove optimality 
properties of these composite controllers in the 
context of the original //'-optimal control 
problem and obtain precise (tight) performance 
bounds attained by them. Section 3 presents 
some numerical results to illustrate the theory, 
and Section 6 provides a discussion on 
extensions to other information patterns; some 
of this work has already been completed and will 
appear elsewhere, and some is currently 
underway. The paper ends with four Appen¬ 
dices, which provide the details of some of the 
derivations given in the main body of the paper. 


2 PKOHLLM KJRMULATION 
The system under consideration, with slow 
and fast dynamics, is described in the standard 
“singularly perturbed” form by 

x, = Au{()x^ + Aa{t)x2 

+ By{t)u 4- D,(r)w; JCj(O) = 0, 

( 2 . 1 ) 

cjct ~ >421 (/)jrI 4" A22^^'^^\ 

4- B2{t)u + D2{t)w ; jr2(0) = 0, 


where jc':= (jrl, JC 2 ) is the n-dimensional state 
vector, with jc, of dimension n, and JC2 
dimension u is the control input, 

and w is the disturbance, each belonging to 
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appropriate (i?") Hilbert spaces and 

respectively, defined on the time interval [0, t,]. 
The control input u is generated by a closed-loop 
control policy pi, according to 

u(t) = fi(t, x,o,„). ( 2 . 2 ) 

where '-[O, tf] x is piecewise con¬ 

tinuous in t and Lipschitz continuous in x, 
further satisfying the given causality condition. 
Let us denote class of all these controllers by M. 
With this system, we associate the standard 
quadratic performance index: 

L(u, w) = |jr(i,)le, + I' (U(0lo(,) + l«(0P) d/ 

Al 

-|jc(rf)|y,+ ||x|hy-hl|u||^ 0.^0, 

C>()-0, (2.3) 

where Qt will show dependence on e >0, as to 
be clarified later. Let us also introduce the 
notation w) to denote L(u, h'), with u given 
by (2.2). The W-optimal control problem is the 
minimization of the quantity 

sup {7(f/. h')}''^/||h'| (2.4) 

over all permissible controllers fj, and in the case 
a minimum does not exist, the derivation of a 
controller /i that will assure a performance 
within a given neighborhood of the infimum of 
(2.4). Let us denote this infimum by y*(F). i.c. 

mf sup (/(^i, *v)}''V Itvll = y*(e), (2.5) 

flf- 4^ ►v F 

where we explicitly show the dependence of y* 
on the singular perturbation parameter ^ >0. 

For each e ^ 0. we can associate a soft- 
constrained linear-quadratic differential game 
with this worst-case design problem (see Ba§ar 
and Bernhard, 1991) winch has the cost function 

L,{u,w) = L{u,w)-f\\w\\\ (2.6) 

The performance level y*(c) in (2.5) is then the 
“smallest" value of y^^O under which the 
differential game with state equation (2.1) and 
cost function (2.6) has a bounded upper value, 
when u is chosen according to (2.2). We also 
know that for every fixed e >0, and for each 
y>y*(c), this differential game admits a 
saddle-point solution, with the saddle-point 
controller /i* being a linear feedback law (sec 
Ba§ar and Bernhard, 1991). 

Even though the problem formulated above 
has been solved completely for every e > 0, the 
computation of y*(e) and that of a correspond¬ 
ing //^-optimal or suboptimal controller for 
small values of e>0 present serious difficulties, 


due to numerical stiffness. To remedy this, we 
pose in this paper the question of whether y*(6) 
and the //^-optimal controller can be deter¬ 
mined, for small values of c^O, by solving 
well-behaved e-independent smaller-order prob 
lems, as in the case of the singularly perturbed 
linear-quadratic regulator problem (Chow and 
Kokotovic, 1976). Another question of interest 
is whether the fast subsystem dynamics can be 
neglected completely in the design of such 
controllers, and if so under what conditions. 

Before studying these questions, we first 
present below the solution to the full-order 
problem, under three basic assumptions; 

Al. Qi and Q( ) in (2.3) are partitioned as 




0m(O (2i2(0 
(? 2 l (0 C? 22 ( 0 - 


where m each case the 11-block is of dimension 
and the 22-block is of dimension 

^2 x: ^ 2 - 

A2. The matrix functions 5,(0» 

D,{() (/ = 1, 2, 7 = L 2) arc continuously 
differentiable in r 0. 


A3. The matrices A 22 U) and Q 2 AO are inver¬ 
tible for all t € [0, tf]. 

Let us further introduce the notation A{t) and 
A,(/), to denote the partitioned matrices 



^ 12(0 
>^ 22(0 - 


AAt) = 


A\]t 

( 1 /^)'^ 2 l (0 


^ 12(0 

(1 / e)i422(0 - 


(2.7) 


where we take e >0. Similarly, we introduce the 
partitioned matrices 


^( 0 - 

/->( 0 - 



( 2 . 8 ) 

(2.9) 


and define 


( 2 . 10 ) 

SAr. y) : = BAOBAO - 

y 

with the ijih block of S{t; y) denoted henceforth 
by S,At; y). ij= 1,2. 

We know from the existing theory on 
linear-quadratic differential games (see Ba§ar 
and Olsder, 1982; Ba§ar and Bernhard, 1991) 
that for each e>(), there exists a y{e)^0 such 
that for all y>y(e) the zero-sum differential 
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game described by (2.1) and ( 2 . 6 ) has a bounded 
upper value (which in this case is equal to zero, 
because the initial states are zero) as well as a 
saddle point, with a saddle-point controller being 
the feedback law 


u*(t) = p-^(r, jc( 0 ) = e)x(tl t - 0 , 

( 2 . 11 ) 

where Z(t; e) is the unique bounded nonnega¬ 
tive definite solution of the generalized matrix 
Riccati differential equation (GRDE) 

Z + + ZA, - ZS/Z -h e = 0 ; Z(t,) = Q,. 

( 2 . 12 ) 

For y < y(r), on the other hand, this GRDE 
has a conjugate point in the open interval ( 0 , r^), 
which says that it has finite escape, and 
furthermore the associated soft-constrained 
game has unbounded upper value. The level 
y( 6 ) is indeed the W”-optimal performance 
y*(c) given by (2.5). 

Now, to study the behavior of the solution of 
( 2 . 12 ) for small values of e > 0 , we first partition 
Z as follows (in a way quite analogous to the 
partitioning in the standard regulator problem, 
and consistent with the given partitioning on 

Q^Y 


Z\x fZ |2 

• ^^22- 


(2.13) 


where Z |2 = Z 2 i. Then, by substitution of this 
structure into ( 2 , 12 ), it is a simple matter to see 
that Zn, Zi 2 and Z 22 satisfy the following matrix 
differential equations: 

Zii + y4iiZ,i -h A2\Z^2~^ ^\\^[\ 2i2^21 Qw 

— Zi|lS,iZ|1 — Zx2^l'Z\ \ — Z] \ Sx 2 Zx 2 
-Zi 2 i 22 Z ;2 = 0 ; Z,,(rr) = 0tn. (2.14) 
€Zx2'^ ^A nZi2 ^21 ^22 12 212^4 22 C?12 

— eZxxSxxZ\2~~ ^2125? 1 Z 12 ~ Zii 5)2^22 
"^ 12 ‘^ 22^22 ~ fli ^\2{U)—Qn2i (2-15) 

^^22 122 i 2 ^ ^4 22^22 ^Z 12^4 |2 

4" ^ 22^22 Q 22 “ Z 12 

“ ^^ 22 ^ 1^12 ~ ^^ 12 ^ 12^22 
”-^ 22 ^^ 22-^22 ~ ^22ih) — Qf22' (2.16) 

Let Zji, Z 12 and Z 22 denote the limiting 
solutions to above equations in the open interval 
(0, ff), as 6 i Ot. Then Z,i, Z^ and Z 22 are given 
by the following differential and algebraic 


tThe existence of such limits will be shown later in this 
section (see Theorem 2) 


equations, which are obtained by setting e = 0 in 
(2.14)-(2.I6): 

Z\x A \iZ\x T i42]Zj2 4- Z 1 2^421 

Gil ” ~ ^12^21^11 ” Z[lS\2Z\2 

-Zx2S22Z[2^Q; Zii(tf) = Gni, (2.17) 

^21^224" Z,,>ti2+ Zi2^22 Gi 2 ” Zn^l2Z22 

”^ 12^^^22222 = 01 (2.18) 
-422^22 4" Z22-422 4" 022 ” ^ 22 *^ 22^22 = 0. (2. 19) 

Since as the solution Z 12 and Z 22 above 

will not, in general, satisfy the given terminal 
conditions (in (2.15) and (2.16), respectively) we 
have to introduce a boundary layer correction 
term (see Kokotovic and Yackel, 1972). Toward 
this end, introduce the fast time variable 
T = (t ~ tile), and rewrite (2.15) and (2.16) on 
this time scale, as c i 0 : 

;^^I 2 (t)= -i^,2(T)(v4 22 (/|) 
dr 

” ^^22(6)L22(t)) — (>42|(f,) — Zii(f()5i2(tf))L22('r) 

-Zn(td^niti)~Qn(tiY ^ 12 ( 0 ) = Gtn, ( 2 - 20 ) 
d 

J L22(t) 
dr 

“ ~^22(h)L22('^) “ L22(t)/422(6) 022 ( 6 ) 

4-L22(‘r)522(6)L22(r); ^ 22 ( 0 ) - 0f2?. (2.21) 

Define 

Z;H(T): = Li 2 (r)~Z, 2 (/r). ( 2 - 22 ) 

Zx^(t) \= L.iiT) - Z 22 ( 6 ) (2.23) 

Then the solution of (2.14)-(2.16) will be 
approximated by 

Zi,(/) = z,i (0 + G(O. 

z, 2 (t) = z,,(n ++ «(e). vt 61 (), /r], 

(2.24) 

Zz.(t)= z,Ai)+ 

Conditions under which this approximation is 
valid will be delineated later in Theorem 2 of 
Section 4. 

Of course, the preceding analysis is valid 
provided that y is larger than .sup {^(e), 0< e < 
e,,}, where is some prechosen small (positive) 
.scalar. 

To study the infinite horizon case (i.e. as 
t(—»«i, as well as when tf= “), we take A, B, D, 
Q to be time-invariant, and Q, = 0. Further¬ 
more, wc assume that (/f,, fl^) is controllable, 
and (Af, Q) is ob.servable for every e > 0 . Then, 
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for each f >0, there exists a y«(f) such that, for 
all the infinite-horizon soft- 

constrained game has a finite upper value, 
achieved by the time-invariant feedback 
controller 

= -B[Z^ie)x{t), rs?0, (2.25) 

where Zcc>0 is the minimal positive definite 
solution of the generalized algebraic Riccati 
equation (GARE) 

+ + 0 = 0. (2.26) 

The level y*(e) is again the W-optimal 
performance level (y!(c)) of the infinite-horizon 
disturbance attenuation problem, and for 7 < 
YrJ^e), the soft-constrained game has infinite 
upper value. 

As in the finite-horizon case, we now 
substitute the structure (2.13) into (2.26), to 
arrive at the following coupled algebraic 
equations for Z.,i|, Z ,,2 and 

2.^1 ]y4 1 , + Zj^I 2 A 2 i 

3" 011 Zacil^VliZorii — Z, 12^^21^=^ 11 

— Z:r 1 1 ^^|2Zr, 12 “ Zar 12^^22 Z.r P = 0, (2.27) 

CA t\Zr^\2 + A 21 Z r22 2-,.11 A 12 + Z, 12 A 2 ? 

+ 012 — 6^Z^lpVi|Z, 12 — 

- Z.,pSpZ.22 “ Zp2^S;2Z.22 = 0. (2.28) 

eA 12 Z ri 2 A22^^22 cZ^pA 12 22^22 

G 22 ~ ^ 12 

- fZ:,2.Vi2Z.22 “ = b- (2.29) 

Letting Z^^n, Z^p and Zr 22 denote the 
limiting solutions to the above equations as 
eiO.t we obtain (by setting c = 0 in (2.27)~ 
(2.29)) that Zxip Z.p and Z .22 satisfy the 
following coupled matrix equations: 


llZ^,U 

+ A 2 iZjc 2 

, + z 


Cn 

-z. 

cipVijZx 1 1 

-z. 



Z.2, 




— Z X 12 ^^ 22 ^^ 21 

= 0, 

(2.30) 

2\Zy.22 

+ Z r 1 1 A 1 

2 + Z 

X 12 A 22 + ^ 12 




— Zrx,| l5p 

Z X )22 ' 

- Z^p5'22Z^22 

= 0, 

(2.31) 

^12^- 

to 22 ‘^«22 

A 22 A 

022 — Z^>2^^2 

2 ^'22 

= 0. 


(2.32) 


Thus completing the analysis of the direct 
solution to the full-order problem as e i 0, we 
now turn, in the next two sections, to the 
original goal of this paper—which is the 
derivation of the approximate solution based on 


tThe existence of such limits will be verified later in 
Section 4 (sec Theorem 2). 


a time-scale decomposition. First we identify, in 
Section 3, the slow and fast subsystems 
associated with the original problem, and obtain 
the solutions of two separate //^-optimal control 
problems, one defined on the slow time scale 
and the other one on the fast time scale. 


3 A riME-SC\LF DECOMPOSITION 


3.1. The slow subsystem and the associated soft- 
constrained game 

To obtain the slow dynamics associated with 
(2.1), wc let c=-0 and solve for jc^ (to be 
denoted JC 2 ) in terms of jri=;jr,, w = :w,, w = :w^, 
and under the working Assumption A3; 

Jtn = - A 22 (A 2 iJr, + + ^ 2^0 (^- 1 ) 

Using this in the first equation of (2.1), wc 
obtain the reduced-order (slow) dynamics; 

X, = Aox, 4 B^^u, + DoW., (3.2) 


where 

A{)‘. = AII — A i2A2i^A i^, (3-3) 

B,,-B^-A,2A22B2, (3.4) 

Ai: = D,-ApA.z/)^. (3.5) 

Using (3.1) also in the cost function (2.3) leads 
to the reduced (slow) cost (with V) 




y.M+f (ksic^M+ 

■^X2QuX^ + |A.-,|-y„+ |wj“)df. 


(3.6) 


In view of our earlier discussion, the 
//'-optimal control problem with state equation 
(3.2) and cost function (3.6) is ck)sely related to 
the zero-sum differential game with the same 
state equation and with cost function 

= y^^llv (3.7) 

and the quantity of interest is the upper value of 
this game. But note that this differential game 
(henceforth called the '‘slow game”) is structu¬ 
rally somewhat different from the “full” game 
discussed in the previous section, because the 
cost function (3.6) now includes “cross terms” 
between the state x,, control u^, and disturbance 
IV,. The presence of these cross terms makes it 
necessary to distinguish between two different 
types of information structures: one where the 
control is allowed to depend only on the state 
(and not directly on the disturbance), and 
another where the control is allowed to depend 
on both (the current values of) the slate and the 
disturbance. We will refer to a controller of the 
former type as state feedback (SF) controller, 
and to one of the latter type as full information 
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(FI) controller. As we shall see shortly, these 
controllers require different existence conditions, 
and hence lead to different optimal values for 
y—of course the one associated with the SF 
controller being more restrictive than the one 
obtained under the FI controller. In the absence 
of the cross terms, however, no such distinction 
exists (see Ba^ar and Bernhard, 1991). 

We now study the solution to the slow 
differential game under these two controllers (or 
information structures) separately, since both 
will be needed in the analysis to follow in 
Section 4. Let us first consider the case of the FI 
controller, which essentially involves a distur¬ 
bance feedforward term. Compatible with the 
given information structure, we introduce the 
transformations 

■ [^s "i" (f ^ 2^22 'C? 22 A 22 *® 2 ) 

• B 2 A 22 'Q22^22(,i^2l ~-^22022'C?2l)jf»+f^2‘*'s)]. 

(3.8) 

[Wj + (y ~ ^2i.^22Q22 ^22 B 2 B 2 ,) ' 02 ) 

• [ 02 (A 22 022*^22 + B 2 B 2 ) 

' (^22Q22Q21 ~ ■^2i-^s)]< (3-9) 

where the latter one is valid under the condition 

y I ~ D2iA22Q22^22'^ B 2 B 2 ) (3.10) 

which can equivalently be written as 

S22(r)+A22Q22^22>0, (3.11) 

where 

522(y): = fl2fi2-;p£>2£>i- (3.12) 

If we use (3.8) and (3.9) in (3.2) and (3.7), we 
arrive at the following standard LQ differential 
game, which has no cross terms between state, 
control and disturbance in the cost (see 

Appendix A for details of the underlying 
manipulations); 

x, = AoX,+ B,A + D„w,- x,(0) = (), (3.13) 

= W0)ly„, + f (kl^ + |u,|" - Iw,!") dt. 

■'O 

(3.14) 

The coefficient matrices above are explicit 

functions of the parameter y, and are written as: 

”^ll “>412022 C?21 “ (‘SlZ 3" >4 12 022 >4 22) 

* (^ 2+>422022 >422) ' ■ M2I “ >422022 02 l)r 

(3.15) 

^o(y) “ -f B 2 A 22 (3.16) 


^o(y) — (^0 flofi2(>422022^>422 + ^ 2 ^ 2 ) ^^ 2 ) 

• (y^/- Z>2(>42202“2>422+ fl2fl2) ’02)”‘^, 

(3.17) 

0 (y) “ 011 “ 0 12022 021 + (>421 “ 0 12022*>422) 

■ ^22 + >422022 >422) * ’ (>421 ~ >422022^ 021 )■ 

(3.18) 

For each y^O, we associate the following 
generalized RDE with this standard LQ 
differential game: 

-AA;)Z, + 0=(); Z,(O = 0fn. (3.19) 
Let us introduce 

5o: = A,A;- A)^o, (3.20) 

which can be rewritten in terms of the original 
system matrices as follows (see Appendix A for 
details): 

^0 ~ + >4i2 022*>4i2 

“ (‘^^n + >4 12 022 >422)("^22 + >422 0 22*>4 22) ' 

‘ (^^21 + ^22022*^ Iz)- (3.21) 
In view of this, let us introduce the set: 

f,: = {y' > 0 : Vy ^ y',. 922 (y) + > 42 ? 022 *> 4.22 > 

and (3.19) has a bounded nonnegative 
definite solution over |0,rf|.} (3.22) 

and further define 

y,: = inf{yEr,}. (3.23) 

Then, it follows from the analysis of Baf^ar and 
Bernhard (1991), Chapter 8, that the trans- 
formed game with cost function Ly^ has a 
bounded upper valuet if y > y,, and only if 
y^y,. For y>ys, let Z,y be the unique 
nonnegative definite solution of (3.19). Then, 
there exist strongly time consistent (feedback) 
saddle-point policies for the transformed game, 
given by 

u*y=fi:y(t, X,(t)) = (3.24) 

< = K(f- = 0;,Z,,xM)- (3.25) 

Applying the inverse transformation of (3.9) to 
(3.25), we arrive (after some matrix manipula¬ 
tions, details of which can be found in Appendix 
A) at the following expression: 

tt'sy — '^sr(f. ■*^•.(0) ~ (^|2,y — D2iS22 

+ A22Q2M22r'i{S2^+A22Q22^n)Z,y 

- (Az, - A22Q22Q2,)))x.(n- (3.26) 

t Note lhal under condition (3.11), Q{y)^0, 
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The inverse transformation of (3.8) applied to 
(3.24) yields: 

= ■“(/ + B 2 A 22 Q 22 A 21 B 2 ) ^ 

{B^)Z^y — B 2 A 22 ^QizAzz 
' (^22022 Q 2 I “ ^ 2 l))-^s (0 

® 2 (^ 2®2 ^ 22022 ^^ 22 ) (^*^7) 

which IS a controller that achieves the finite 
upper value (which is zero) m the slow FI game 
Note that this controller involves a feedforwaid 
term in as expected. If, however, we wish to 
obtain one which depends only on the state, we 
simply substitute (3.26) into (3.27), to arrive at 
the following expression; 

*^sy ” Msy(6 ■X,,(t)) 

= (~BiZ,, + B^2{Su + ^22C722 ^ 22 )^' 

' ((^21 + ^22 022*^4 12)'^sy 

-{A^^-A^.Q.jQ.MxSty (3 28) 

By the “ordered interchangeability^’ property of 
multiple saddle point equilibria (sec Ba’ 5 ar and 
Olsder, 1982), pi^y is also in equilibrium with 
but not for all y > y^. In other words, there will 
exist some y e f\ for which under i.i*y the upper 
value of the game will be infinite, even though it 
is finite (actually zero) if /i,V/ gwen by (3 27) is 
used. We will shortly see thal the pair (/ 2 *^, v*^) 
in fad provides a saddle-point solution to the SF 
slow game, but under a more restrictive 
condition (than (3 10)) Toward this end we now 
study the upper value oi the slow game under 
the SF information pattern 
To bring Ly^ to standard form, we now 
introduce the transformations given belowf 

= (// - 'Q 2 ?Ah' 0 :.)'''[H\ - (r'l 

- D 2 A 22 ' 022-4 22 '/^ 2 ) ' 

■ ^>2^22‘0>1^22’(O>«s 
+ (/I 2 , — ^422022 02l)-*s)l’ (4-29) 

1 \ ' 

Ms “ 4 ^^ 2 ^^ 22 C? 2 r*^ 2 : “ ~2 ^^ 2 ^ 2 ^ ^ 2 ) 

■ M, + (/ + fl2(-422 0 22'>l22 “ ^ ^ 2 /^ 2 ) ^ 2 ) 

■ ^ 2 (^ 22022^22 ^2 ^ 2 ^) 2 ) 


■ (■^21 ^ 22 C ?22 C? 2 l)-^s 


(3 30) 


t Note that the dependence ol on is allowed here, 
because we are interested in the upper value of the game 


which are valid under the condition: 

~~ ^2-^22 ^C?22-^22'^2 ^ 0, 

<^ a ^^ q ^^ a ; 22 -\ D 2 D ' 2 > 0. (3 31) 

y 

These t^ansform^tions essentially “complete the 
squares” in so as to cancel out the cross 
terms in and The transformed game 

then is 

A = +fio«s+A)tVs: a:,(0) = 0, (3.32) 

i-y. = ksCOIe,,, + [ + ImJ^ Ih-J^) dr. 

(3.33) 

where the coefficient matrices are (see Appendix 
A for details of the derivation.s) 

■4o(y) = All — A 12022 021 ~ (‘^12 A12Q22A22) 

■ {S22 A22022'-4:22) ' 

(A21 — A22022*02i)' (3.34) 

fl„(y) 

= (fill + Ai^ 2 (a 22 022 'A ?2 “ ^ 1^2^-’) 

■ iJ + B'2[A22Q22'A22--2D2D'2j fi,] 

(3.3.-5) 

A,(r) = A,(y'/ - a;a'2 -'022A2-2'/:»2)'''" 

0(7) = 0n - 0,2022'02, + (A;, - 0,2022'Ai2) 

■ (‘5*22 + A 22 022 'A 22 ) ' 

■ (A 21 “ A22022'02i) (3.37) 

The GRDE associated with this differential 

game is 

t + A,',Z, + Z.A„ - tXZ, + 0 = 0; 

t(O = 0.n. (3.38) 

where 

5„: = A.fi;.-A.A;, (3.39) 

Some matrix manipulations, details of which are 
again included in Appendix A, show that 

A() = A(( 0 = 0 ‘^0 = --^(11 

where An, 0, were defined earlier by (3.15), 
(3 18) and (3.21), respectively, and hence that 
the GRDE (3.38) is identical with the GRDE 
(3.19). It can further be shown that condition 
(3.31) implies condition (3.11), and hence that 
the set 

A;={y'>0;Vy2: y', (3.31) holds and (3.38) 
has a bounded nonnegative definite 

solution over [0, ff]}, (3.40) 
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is included in f^. Now, define 

y,: = inf{yefj, (3.41) 

which is clearly not smaller than y,, defined by 
(3.23). Following the earlier reasoning, the slow 
differential game with the SF information 

pattern has a bounded upper value if y > y^, and 
only if y = f'>,. For y>f's. the game admits a 
strongly time consistent (feedback) saddle-point 
solution, given by the policies (for the 
transformed game); 

a:, = ^s(O) = (3.42) 

vVsV = v:,(r, Jc.(r)) = DnZ,,jc,(r), (3.43) 

where is the unique nonnegative definite 
solution of (3.19) (or equivalently (3.38)). Now 
applying the inverse transformation of (3.30) to 
(3.42), we obtain precisely the policy (3.28). 
This shows that for the slow game with SF 
controllers, (3.28) leads to a finite upper value, 
provided that y > y^ Iti general, y^^ys, and 
hence one natural question is whether one could 
have a correction term, depending only on the 
state, added on to (3.28), which would improve 
the performance from y, to y,. The answer to 
this lies in the analysis of the “fast game” which 
is studied next, and is provided fully in Section 
4. 

3.2. The fast subsystem and the associated soft- 
constrained game 

Let = -i 2 . Wf: = u — u,, Wf : = w — and 

(/' - tic) where we take t to be frozen, and 
T to vary on the same scale as r. We define the 
fast subsystem and the associated cost (as in the 
standard regulator problem; see Chow and 
Kokotovic, 1976) by: 

x\^ Ay2{t)x^\-^ D.{t)w[\ 

dr 

jt:(())-jc,(0, (3.44) 

/.;,= [ (|x;iy„„) + l«!l‘- kli*) di. (3.45) 
Jn 

The GARE associated with this infinite-horizon 
game, for each /, is: 

A22(0Z> + Z,A22U) + Q22(0 - Z,S22(t)Z, - 0 . 

(3.46) 

Wc now let yj denote the minimax di.sturbance 
attenuation bound for the //"'-optimal control 
problem defined by (3.44)-(3.45) under closed- 
loop information!, and y|,, denote the same 
under open-loop information, where we must 

t To ensure thal yj “f, it will be suDicieni to lake the pair 
Bj(t)) to be .stabilizable. 


have yof^y'f (see Ba$ar and Bernhard, 1991). 
For every y>y'f, let Zfy(t) be the minimal 
positive definite solution for (3.46). Then, for 
any y > yj, the feedback controller that attains 
the finite upper value of the fast game is 

M-(T) = fi^:(x'r(T)) = -Bi(t)Z,,(t)xi(rh 

^;,(f) = ^t;;(T|(o)) = -B^(oz,,(t)x^t) 

= -B^(t)Zry(t)(x2(t)-X2(t)). (3.47) 

Substitute (3.1), (3.28) and (3.26) into (3.47), to 
obtain 

^rv(6 T(/)) = -B;Z,yX2 - B^ZryQ^MnZ.y 
+ O2I - ^ 22('^2 + A22O22 
■ ((“^Zl T A22Q22Ai2)Z^y 

~ (A 21 ~ A22Q22*Q2i)))T|(0- (3.48) 

We now define 

y,:= sup y'f, (3.49) 

nrlU.frl 

y„f:= sup y'o,. (3.-50) 

Then, for every y > yr, the GARE (3.46) admits 
a positive definite solution for all t e [0, 6). Let 

y: = max {y*, y^}. (3.51) 

Wc will shortly see that this value plays an 
important role in our problem. 

3.3. The infinite-horizon case 
We now turn to the infinite horizon case. Let 
i4, B, D and Q be time invariant and Qf be zero. 
By following steps similar to those in the finite 
horizon ca.se, we first decompose the system into 
slow and fast sub.systems. After an appropriate 
transformation, the slow game with FI controller 
is described by 

is = ^iF^s jrs(O) = 0, (3.52) 

4s =f {\x^l^\uf-m^)dt, (3.53) 

-'ll 

where Ai ^^nd Q are as defined before, 

with the only difference being that they arc now 
time invariant. The associated GARE is 

- ZM + 0=0, (3.54) 

Let us define the following set as the counterpart 
of (3.22): 

f,„;= {y'>0:Vy>y', >0. 

(3.54) has a nonnegative definite solution 
Z^y, and A,',(y) - 5„(y)Z,^ is Hurwitz} 

and further define 

y,,,~inf{yef,„}. 


(3.55) 
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For every let be the minimal 

positive definite solution to (3.54). Then, for 
each Y > ys»T the FI controller that attains the 
upper value is the time-invariant version of 
(3.24), and the maximizing disturbance with 
respect to this control is the time-invariant 
version of (3.25). Then we can solve for and 
v*.,,, which arc the time-invariant versions of 
(3.26) and (3.27), respectivelyj. To obtain the 
corresponding SF controller, we substitute 
into n*,y, to arrive at an expression for /x*., 
which is the time-invariant version of (3.28). 
Note that this SF controller yields a finite upper 
value if 

y> ys<:=inf {ye TJ, 

where 

{y' - y^^ : Vy y', (3.31) holds}. 

The fast part of the system is the same as in 
the finite-horizon case, where the coefficient 
matrices are now constants. The fast game is 
described by (3.44)-(3.45), and the CJARF is 
the same as (3.46). We will u.se y,, to denote the 
minimax disturbance attenuation bound (and 
y„,, for the open-loop case), and let Z,y denote 
the minimal positive definite solution to (3.46). 
Then, the minimizing controller is given by 
(3.48). As the counterpart of (3.51), we define 

y, :^max (y,., y,,}. (3.56) 

This quanJity wiJl also play an important role m 
our analysis in the next section. 

4 MAIN RESUITS 

Before we present Ihe main results, we first 
give three useful results, listed below as Lemma 
1, Lemma 2, and Fact 1. 

Lemma 1. Consider the following GARL’ 

A’Z + ZA + Q-z(^liB’ - ~DD'yZ = (), (4.1) 

where the matrices A, Q, B and D are taken to 
be quite general, apart from the requirement 
that Q ^0 and {A, B) is controllable. Let 
0 y ,„<00 be the infimum of all ys under which 
this GARE admits a nonnegative definite 
solution. Then, for y y^, 

BB'-\dD'+AQ 'A' '0. (4.2) 

y 

t Agaiiij^ for y,, to be finite, it will he sufliicicnl to have 
the pair (A„, fl„) controllable dX y ~ which is equivalent to 
having the pair {A^^, controllable 

X Note that here is maximizing under but is not 
necessarily minimizing under sec Ua^ar and Beinhard 

(\m) 


Proof, Since (A, Q) is observable, the nonnega¬ 
tive definite solution of the GARE is actually 
positive definite (see Ba§ar and Bernhard, 1991). 
Let Z be the minimal positive definite solution of 

(4.1) . It is well known that, under the conditions 
specified, the matrix A — (BB* — (l/y^)DD*)Z is 
Hurwitz. Let S := BB' - (l/y^)DD\ Then, using 
standard matrix manipulations and the GARE 

(4.1) , we obtain: 

S + AQ~ 'A' -=S - Z \-ZA)Q"'A' 

= S- Z '(A Z T g - ZSZ)Q ^A^ 

= S-Z U' + 5Ze-Z“‘A'ZG 'A' 

= (5-Z M')(/ + Z(? ’A') 

= Z \ZS - A ')ZQ-\Q A Z)Z ' 

= Z \ZS - A’)ZQ-'Z{SZ - A)Z >0. 

Lemma 2. Under Assumptions A1 and A3, if 
y^yi (or 7 >yf^, in the infinite horizon case), 
then Q defined by (3.18) is nonnegative definite. 

Proof. Since y > yi (y>yf.) and il 

follows from Lemma 1 that ^'22 3“ A:r 2 C? 22'^22 ^ 0, 
Since nonnegative definiteness of Q implies 
C?ii " QmQijQii the result follows. 

Next we recall a well-known fact. 

Fact 1. Let A, and Q '=^0 be three square 

matrices of the same dimensions. Then if (A, Q) 
is observable, so are (A,Q-^P) and (A-h 
Q F P), the latter for all K of appropriate 
dimensions, where P[P,, - P. 

4.1. The composite controller 

We now introduce, for both finite and infinite 
horizons, the composite controller: 

x) ^ x) -f X), (4.3) 

where jti*, and pi*y were defined by (3.28) and 
(3.48), respectively, and y>y (or y,, in the 
infinite horizon case). After some manipulations, 
this composite controller can be rewritten as 


'A, 

0 ■ 

-^1 

-Z. 


^2- 


where 

Z,:=Z,yQ^2\A\,Z,y + Q2>) 


“ (/ + ZiyQ 22 A 22){^22 "I" A 22 Q 22 A 22 ) 

I 

■ ((“^'21 + A22Q22A[2)Z^y 


-{A2^-A22Q22'Q2i)) 

(4.5) 

ZfyU+V 

(4.6) 

Z,y(U,Z,y+ U2) + V^Z,y+V2, 

(4.7) 
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and 

— “■ Q22^^22(^2 ^22022^22) H ‘ S 21 

+ >l 22 02V^i2). (4.8) 

fJ2'~ Q22Q21 4 " 022^22(^^2 ^ -^22022^22) '(^21 
”-^ 22022 ^ 02 l)i (4.9) 

1^1 •“ ~ i ^22 -^ 22022 ^-^ 22 ) -^22 022 *-^ 12 )» 

(4.10) 

^2 '•“ (*^2 “*■ -^ 22022 -^ 22 ) H -^21 ” >^22022 02 l )- 

(4.11) 

Now, we are in a position to present our main 
results, first for the infinite horizon and 
subsequently for the finite horizon cases. 


lers. This shows that both Ys^ and Yf^ are finite, 
implying that y* is also finite. Now fix y>y*. 
Under Assumptions A1-A4, we know from 
Lemma 1 that ^ 22 -I-A 22022 A 22 >0. Then, by 
Lemma 2, 0^0, and also by the nonnegative 
definiteness of 0, 0ii - 012 022 02 i — 0 . Since 

Qs'—[I “A21A22 

/ 

X 

- ~ A 22 A 21 

= 011 -012022 02 , + ( 0 . 2022 * 

— A 21 A 22 *)022(022 021 ~ A22*A2i), 

by Fact 1, (Ao, 0J is observable. Also, we can 
rewrite (3.15) as follows: 



4.2. The infinite horizon case 
To guarantee that y^ is finite (although the 
theorem to be presented below is true also when 
it is infinite), we make two additional 
assumptions; 


Assumptions 

A4. (i4o, Qu - QnQzzQii) is observable. 
A5. (Ao, So) and (A 22 , ^ 2 ) are controllable. 


Theorem 1 . Consider the singularly perturbed 
system (2.l)-(2.6), with t^ = ^, 0i = and A, fl, 
D, 0 time-invariant. If Assumptions A1-A5 
hold, then, 


( 1 ) y*{^) — Y^^ asymptotically as e—^0, where 
Y^, as defined in (3.56), is finite. 

(2) Vy>y«, 3ey>0 such that Ve e [ 0 , c^), the 
GARE (2.26) admits a positive definite 
solution, and consequently, the game has a 
finite upper value. Furthermore, the minimal 
such solution can be approximated by 


Z = 


■ + 0 (e) 

.eZ, + 0{€~) 


cZi + 0{€-) 


(4,12) 


(3) Vy>y,, if we apply the composite control¬ 
ler ^i*y to the system, then 3Cy>0 such that, 
VeelO. fy), the disturbance attenuation 
level y is attained for the full-order system. 

(4) Vy>max{y,„ y„,.,} = max {)>„, y„,.,}. if we 
only apply the slow controller fi*y to the 
system, then >0 such that, Ve e [0, c"), 
the disturbance attenuation level y is 
attained for the full-order system. 


Proof. We first note that under Assumptions 
A3, A4 and A5, both slow (with either SF or FI 
controllers) and fast games have saddle-point 
solutions for sufficiently large values of y, since 
at y = 00 both problems become “regular” LQR 
problems admitting stabilizing optimal control¬ 


Afl — A(i [B() £)(>] 


^2 

r)D;. 


B'l 

X (.^22 "1“ A 22 C?22* A 22 ) *(^21 ^22^22 


By Fact 1, {A^u Q) is observable. From the 
observability of pairs (Ao, Q) and (A22. Q22)’ we 
conclude that Zsy>0, Z,y>0 (see Ba§ar and 
Bernhard, 1991). Furthermore, A 22 --^ 2 Z,y is 
Hurwitz for y > yfr, and by definition Ao - 5oZsy 
is Hurwitz, for y > y,,. From the observability of 
(A22. Q22). we can deduce that, for c sufficiently 
small, the pair (A,, Q) is aLso observable (see 
Kokotovic el al., 1986). ITicn the game 
( 2 . 1 )-( 2 . 6 ) has a bounded upper value if (2.26) 
admits a po.sitive definite solution, while the 
upper value is unbounded if there is no 
nonnegative definite solution. We show in 
Appendix C that the set of coupled matrix 
equations (2.30)-(2.32) admits a solution Z,, = 
Z,y. Z ,2 = Z' and Z 22 = Z,^. We now use the 
Implicit Function Theorem to complete the 
proofs of parts 1 and 2. Let K^, and be 
the vector forms of Z^^. Z 12 and Z 22 , 

respectively. Let us write equation (2.27)-(2.29) 
as |(f, K], K,. K^) = U. We already know that 
1(0^ K,. K,. = where AC,, K. and are 

the vector forms of Z,y, Z, and Z,^. Let 


{f,Ki.K2 h^)-W Ki 


/ = 1, 2. 3,; = 1 , 2, 3. 


Then 


*—11 “ “32 ~ 

— ^n2 ® (A 22 — ^22-^lv) + (A22 “ ® 

^22 “ (A 22 — ® Ai 1 1 

where ® refers to the Kronecker product. 
Furthermore, and E 22 are invertible. Solving 
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for Zi2 from (2.28) we obtain 

Zi2 ~ ~Z|i(i4|2 ~ ‘^12222)(.^22 ~ ‘^22222) ' 

— (C?12 /■ ■^2iZ22)(242'’ ~ ‘^ 22^22) ' + Oic )■ 

(4 13) 

and Its substitution into (2 27) leads to an 
equation in terms of 5, A',, K 2 and A,, which we 
denote as ^,(e, A,, A2. A,) = 0 Let 


we use ju*, in system (2 l)-(2.6) 


X, 

ex2 


\Au-B,B\Z,^-B,B' 2Z, An-ByB'^Z,,^ 


^ -Au-B,B\Z,^-B,B' 2Z, 
.A.,- BjB'.Z^y-BjB'^Z, 


A 22 ^2^2^fy'^ 


-1/ 


Hi 

5A, 


;-i,2.3. 


, ti Kt K, K^\ ((I A| A. A I 


then 


(4 14) 


H.2=0, 

where R= A^ - S^^Z^y (see Appendix C) I Icnce, 


... -. . 2ZfJ 

rkl^)dr (4.16) 

h 

The ARb associated with this maximi/ation 
problem is 

A',W+ WA, ^\wD,D',W+ Q = [), (4 17) 

y 


where 


— 11 —12 — 1 *^ 

—21 —2'> —21 

L^ll -12 


= 


A\\ Ai 


X,]' 


0 


—II 

* En ♦ 

L 0 ') H„J 


IS nonsingular 


where * reters to any constant matrix This 
implies that 


_(l/e)A2i fl/e)A22J 

A p = A12 ~ fl| BiZ^f^ , 
.421=^2,- B.B'.Z, -fi.B'.Z,, 

^2^ — A 2 T ByB2Zi^ , 


(4 18) 

(4 19) 
(4 20) 
(4 21) 
(4 22) 


0 = Q + 



— 11 

-12 

— 13 






-21 

—21 

21-1^ 

is 

also nonsingular 

X 

z„ 0 


^ —11 

—12 

— U _ 




Zf^ 

Then, 

by 

the 

Implicit 

function Iheorem, I 

\Qn 


2.V 

z: 

b : 

0 

Zjy - 



\B\ B 2 ] 


Be, >0 such that Vc e (0. e,), there exist Zii(£). 
Z,2(f) and Z.2(c) that solve (2 27)-(2 24) and 
Zn(c) = Z„ +0(t), /p(f) = Z; + 6;(e) and 

Z2^(f ) = Z,^ -I- 0(f) Since Z.,, '^0 Z, -0 and 
Z^y, Ziy arc the minimal positive dehmte 
solutions for (3 .54), ( 46), Be,, 0^ c, - c,, 
such that Vc e [0, c, |, the minimal positive 
definite solution! for (2 27)-(2 29) is 

1 Z„(c) cZ,2(e) 

LeZ2,(e) cZ.Xe) 

So far, we have proved parts 1 and 2 For part 3, 


C?21 C?22J 


(4 23) 
(4 24) 


By Fact 1, the observability of implies 

the observability ot (A^,Q) Then the maxi- 
mi/alion problem (4 l‘^)-(4 16) has a hnite cost 
il (4 17) admits a positive dehmte solution, and 
the cost IS unbounded if (4 17) does not have a 
nonnegative definite solution Suppose that W is 
in the form 


W = 


LeVT.i CW 22 


where ^\2 = ^ 2 \ Expanding (4 17) around 
e = 0 we obtain 


t The positive definiteness follows from 
and the following well known result let 


7 = 


Q 

P 


P 

R 


0 /p 


0 


where Q and R are positive definite mdlrites Then 7 >0 il 
and only \i Q - PR ‘P' > 0 


-h/4;,Wi,+ + ^12^21 + Qii 

+ p (lVnD,/);VV„ + VVpDzD^VVu 
+ Ws^D,D2W\2+W,2D2D’2W[2) = il (4 25) 
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eA'uW,2 + M,W22 + WuAu+WuA22 

+ Q^2 + \ i€W,,D,D[W,2 + €W,2D2D[W,2 
y 

+ W„ D, £>^1^22 + M'i202Z?ilV22) = 0. (4.26) 

&/4 121^12 "t" ^422^^2 ^^12^ 12 ^^2''^22 

+ C 22 + A (e^VV; 2 D,D; W ,2 + fW 22 D 2 D;VV ,2 

y 

+ fVV;2D,D2VV22 + W 22 D 2 D 2 W 22 ) = 0. (4.27) 

It is straightforward to verify that when e = 0, 
Wm = 2sy. ^12 = Z[ and JV 22 = ^ly satisfy (4-25)- 
(4.27). Let ^ 1 , kj and be the vector forms of 
VV,,. 1^,2 and W 22 . respectively. Viewing 

equation (4.25)-(4.27) as the vector equation 
fi(e, k,, k 2 . k^) = 0, we let K,, K 2 and be 
the vector forms of Zj,y, and Z^y, and 
introduce 

=■ = 1 ^ 

'' dk, ((,KuKi.Ki) = (f>.K,.K2.K^) 

1 = 1,2, 3 > = 1,2,3. 

Then 

S3,=0, 5,32 = 0, 

“3.1 ~ A 2 ® (-^22 “ ‘^2'Zfy) + iA22 - Z 1 YS 22 ) ® /„,, 
-22 = (-422 ~ •Zfy'^ 2 ) ® In, I 

which shows that and E 22 are nonsingular. 
Solving for VV 12 from equation (4.26) we obtain: 

vv ,2 = - vv,,(^ , 2 + :p 0 , 0 ^ 1 ^ 22 ] 

x(a22 +^ 2 ^ 202 ^ 22 ) 

-{Qi2 + M^W22)I^A22+\d2D'2W22^ + 0 (f). 

(4.28) 

Substitution of this into (4.25) leads to an 
equation in terms of e, ^2 and ku which we 
write as 7/,(e, k,, kj, k^,) = 0. Let 

- 

- 1 , = ^ - - / = 1'2.3. 

\{c.K,.Ky,K-y) =(n.A:,.A'y,A",) 


Then, we have the structure: 


Cil 2 — 0 , 

Hh= 


(4,29) 


where we can readily see, from the analysis of 
Appendix C, that R = A(,~ S(,Z^y. Then 5i, is 
invertible by the stability of R, and thus 3 is 
established by the Implicit Function Theorem. 

For part 4, we first compare (4.10) and (4.11) 
with (3.28), to obtain (i*y=-{B[Z^y + 


B 2 V)Xs(t)- Substituting nty into system (2.1)- 
(2.6), we arrive at the maximization problem: 

Lei2J [A2,-B2B\Z,y-B2B'2V A22J 




I wl^) df—» maximize. (4.31) 


The associated ARE is 


A2Y+YA,+—,YD,D[Y+0^^. (4.32) 

y* 


where 


A. 


4^11 44|2 

(l/e)A 2 , (l/e)A 22 J’ 


(4.33) 


An=Au-B,B\Z^y-B,B'2V, 

A 12 = A) 2 , 

A2,^A2,-B2B\Z^y-B2B'2V, 
A 22 = A 22 , 


z 

^sy 

V 

B, 

z,. 

0 

. 0 

0 . 

B,- 

. V 

0. 


Qu Qn 
I-O 2 , Q22^' 


(4.34) 

(4.35) 

(4.36) 
(4. .37) 

(4. .38) 
(4.39) 


By Fact 1, the observability of (A,, Q) implies 
the observability of (A,, Q). Then the optimiza¬ 
tion problem (4.30)-(4.31) has a finite co.st if 
(4.32) admits a po.sitive definite solution and the 
cost is not bounded if (4.32) docs not have a 
nonnegative definite solution. Also, y > v„i, 
implies that -(l/y)"Z? 2£>2 +A 22 & 22 'A 22-^0 by 
Lemma 1, and consequently S 22 + A 22 Qh'^ 22 ^ 
0. Thus, the equality max {y,,, y„f,} = 
max {y,,, y,„,} holds by the definitions of y^., 
and y,,,,. Starting again with the form: 

y>= ' 

.fTp, cTjoJ’ 

where K, 2 = T 21 ) we expand (4.32) around e = 0: 
•'4||T|] + A2iTi23' ^^iiAii -I- T 12 A 21 + 0i 1 

4 \iYuD,D'2Yu 4 Y,2D2D\Yn 4 yM^iT>2Ti2 

4 f,2£>2£>;f;2) = 0 (4.40) 
^•^1 |Ti2 4 A2|iz2 4" i\|A|2+ Tl2A22 4'^l2 

4-^(eynD,D;y,2 + ff, 2 £> 2 £>;Tn 4- y,,£), 02^22 

4- y,2D2£>2y22) = 0, (4.41) 
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€ A \ 2 Y \2 + -^22^22 ^^12^12 + ^22-^22 ^22 

H 2 12 ^ 1/^1 ^12 ^ ^^ 22 ^ 2 ^ 1^12 

r 

H- 61^12^1^2^22 4* 122^2^2^22)“^- (4.42) 

It is shown in Appendix C that for e = 0, 
^^11 = ^sy. Y 21 = Z,,iyU + V and ^22 = satisfy 
(4.40)~(4.42), where Z^,^y denotes the solution to 
the “open-loop” GARE of the fast subgame: 


-^22^f 4" 2^fA22 4" Zf 2 /^2^2'^f 4" 02’ d- 

y 


We now show how the Implicit Function 
Theorem can again be employed to prove 4. Let 
K], K 2 and K-^ be the vector forms ot Fn, 1^2 
and y 22 » respectively. Writing equations (4.40)- 
(4.42) as ^{e, K^, K^) = 0, we let A,, Kn and 

to be the vector forms of K'-h f/'Z^jy 
and Zofy. Introducing 


3k, 


(t K| Ks) (0 A., Ai.A ,) 


/ = l,2,3,y- L2,3. 

we arrive at 


i'll ~ 0 ; *^^2 “ 


in — 4i3 ® ^^72 4- ■ 2 Dll) 





t22=[A'22+~Z,„yD,D',)<»I„,. 

Since y > y,„,. A.? + (\Jy^)DoD 2 Z„fy is Hurwit/, 
and hence En and E,., are invertible From 
equation (4.41): 

y, 2 --yn(>iiz + Ao,D;K,) 

x(A22 + y^D2ir.Y22) -(Qx2 +A^2^%) 

x(a 22 + ^2^2D2Y2) +Oi€). (4.43) 

Sub.stitution of (4.43) into (4.40) leads to an 
equation in terms ot f. Ki, K 2 and Kt., which wc 
write as A,(e, /C,, Aj, Ai) = 0. Let 


aA| 

3k, 


( f A. I, \, 


Ai) 


(0 A., A; A.,) 


1,2,3. 


Then, we have the structure 


S„- 0 , 

En = /„,®R + A'®/, 


(4.44) 


where R = An~ S^)Z^y. (See Appendix C for the 
manipulations that lead to this expression for R.) 


Hence, 



i 2 i 
Sn J 


En 0 * 

* Z 22 * 

0 0 s 




is nonsingular, 


where * refers to any constant matrix. Part 4 of 
the theorem then follows from the implicit 
Function Theorem. □ 


Remark 1. Although the theorem above prov¬ 
ides only an upper bound for the //^-optimal 
performance level of the full-order plant, we 
have actually proved recently, in Pan and Ba§ar 
(1992), that this upper bound is also a lower 
bound for the performance level for sufficiently 
small c Thus, we have the following limit.t 

hm y*(6) = y,. (^ 45) 


4.3 The finite horizon case 
Theorem 2. For the singularly perturbed system 
(2.1)-(2 6), let Assumptions A1--A3 be satisfied, 
the pair RiiO) controllable for each 

t E [0, r,], and the following condition hold: 

G ,22 — ^iy(^i)-l-, where Z^y{t^) is the solution to 
(3.46) at / = r, with y fixed. 

Then, 


(1) y^(c)— y, asymptotically as where y, 

as defined in (3.51), is finite. 

(2) such that Vc e [0, the 
GRDE (2.12) admits a positive definite 
solution, and consequently, the game (2.1)- 
(2 6) has a finite value. Furthermore, the 
solution to GRDE (2.12) can be approxim¬ 
ated by 


Zsy(/)^G(c) 

c(Z,(t) -f Z,h(T)) + 0(6') 
e(Z :(0 + Z:,(r))FO(rO 
6 (Z ,,(0 + Z,Jt))4 0 (e^)_’ 


for all t e [0, r,], where Z^i^(t) and Zn,(T) are 
the solutions to (2.22)-(2.23) and as t—> -oc 
they converge to 0 exponentially in the t 
time scale. 

(3) Vy > y, if we apply the composite controller 
^*y to the system, then 3e;> 0 such that 


t This result IS not included here because the arguments 
are quite technical, and its counlerfiarl in the finite horizon 
case still has not been established 

i This condition is imposed in order to make the boundary 
layer term Li 2 (t) converge to /fy(t,) 
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Ve e [ 0 , e'y), the disturbance attenuation 
level y is attained for the full-order system. 
( 4 ) Vy > max {y*, y„,} = max yo,}, if we ap¬ 
ply only the slow controller n*y to the 
system, then 3 e ">0 such that Ve e [ 0 , e"), 
the disturbance attenuation level y is 
attained for the full-order system. 

Proof. By a reasoning analogous to that used in 
the proof of Theorem 1 , Assumption A 3 , 
together with the controllability of (A 22 , ^ 2 ). 
leads to finiteness of and hence to that of y. 
Now, fix y> y. under Assumptions A 1 -A 3 . 
Then, we get Zf^(r )>0 and that ^ 22 ( 0 “ 
S22(t)Zfy(t) is Hurwitz. Following a reasoning as 
in the infinite horizon case, we deduce that 
Zn = Z,y{t), Zi 2 = ZX 0 and Z22 = Zty{t) satisfy 
equations ( 2 .n)-( 2 . 19 ). We now use Theorem L 
in Kokotovic and Yackel ( 1972 ) to prove 1 and 
2 . (For convenience to the reader, we include 
this theorem in Appendix D.) 

View equations ( 2 . 14 )-( 2 . 16 ) as a nonlinear 
singularly perturbed system. Obviously, condi¬ 
tion LI of Theorem L is satisfied. Since Z „.^(0 is 
the unique solution for ( 3 . 19 ) (see Ba§ar and 
Bernhard, 1991 ) L 3 is satisfied. Following some 
of the reasoning used in the proof of Theorem 1 , 
L 4 is established. We only need to show L 2 , 
which is equivalent to showing that L 22 (t), 
Li 2 (r) exist, and L22 {t)-* Zfy{tf), L, 2 (t)-^ Z'(rf) 
as T-^- 00 , and Z,,( 0 (V^i ^ t/'iZf/r)) + ^^2 + 
U2Z^y{t), Zfy{() are the asymptotically stable 
equilibria for ( 2 . 20 )-( 2 . 21 ) uniformly in Zn(r) 
when tf is replaced by t, where L22 {t) and Li 2 (t) 
satisfy ( 2 . 20 )-( 2 . 21 ). From the proof of 
Theorem 1 , we have that 

U[Ziy(t))UjZiyit), Zfy{t) thc cq^* 

libria for ( 2 . 20 )-( 2 . 21 ). Then they are the 
asymptotically stable equilibria because the 
linearized system is asymptotically stable. 

We now show that L 22 (^)~^ ^fr(^f) If 

Qa2~ fhe result is trivial. Suppose 

therefore that Z,y(rf) > Qj 22 t Consider ( 2 . 21 ), 
which we rewrite in thc form: 

^^22(^) * ~ ^22(^f)^22('*^) ' 

“ ^ 22 (^) 

— L22(t) ^C?22(^)^22('^) ^=0 
We evaluate equation ( 3 . 46 ) at t = 6 , and rewrite 

tit is well known that !. 22 (t) converges to as 

if (sec Ba.>ar and Bernhard, lV9i) In the 

cases when Qf^i is not strictly positive definite or it is not 
strictly less than thc results arc still true, which follow 
from the fact that an increase in the terminal condition (in 
the sense of matrices) leads to an increase in the solution to 
thc RDE. 


it as: 

+ ^ 22 (ff) “ ^fy(ff) ^Q 22 (ff) 2 fy(ff) * = 0 . 

Let A : = L22it)~^ - Zfy{tfy\ Take Ihe differ¬ 
ence of the above two equations and obtain, 

£ A - A 22 (fi)A - AA 22 (ff)' - AG 22 ( 7 i)A 

— AQ22U()Zfy(tt) ‘ — Zfy{tf) — (^- 

Let A(T'):=(Zfy(ff)AZfy( 6 ))"' where r'=—r. 
Then the above equation is equivalent to 

d 

— A 4 -A^A-h aa; 

-Z,,(M-‘G 22 (ff)Zr,(fr)~'= 0 , 

where At =^ 22 ( 6 ) “ 5 ^ 2 (ff)Zfy( 6 ). Now introduce 
the following system: 

d 

— y = A,y; y( 0 ) = yn. 
dr 

Obviously, this system is stable, and 

=y'Z,y{t,) 'Q 22 Ui)Z,yit,y'y, 

SO that 

= >-('.( A( 0 ) 

+ 1^ e^''Z,yU,r‘C}22(if)Z,y(t,r'e^"di)yn. 
Then, since y^ was arbitrary, 

A(T') = p-'''^ (a(()) 

Since Zty{ti) > Qa2 - we have A(()) > 0 , 

and consequently, A(t ')> 0 . Hence A exists and 
is positive definite for every r. By the stability of 
^ 22 ( 6 ) ~ S22Zfyitf)y wc deduce that A(r')— 
as which implies that Zf^^O as 

r-^ -30 and the convergence rate is exponential 
in the T time scale. 

Now, we rewrite ( 2 . 20 ) as follows: 

d 

^^ 12 (^) ” ~^ 12 (l^)(-^ 22 (ff) ~ ^ 22 (ff)Z|y(ff) 

‘^22(Zfy(ff) — L22(r))) - {A2i(tt) 

~ Z|i(lf)5i2(fi))L22(r) ~ Z,i(lf)Ai2(fi) — C?i2(ft)- 

Since A 22 (ff) ‘^ 2 (ff)Zfy(ff) is Hurwitz, and 
522 (rf)(Zfy(ff) - L 22 (r ))-^0 as r-^-o^, in view 
of known results for stability of linear systems 
(see Coppel, 1965 , p. 70 , Theorem 9 ) L, 2 (t)-^ 
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ZcOf) exponentially. So L2 is satisfied. Then by 
Theorem L, the solution to (2.14)-f2 161 exists 
for sufficiently small e. Since eigenvalues 
S 22 {t)Zfy{t)) depend continuously on 
t, there exists a OO such that X(AyM)- 
^2(04 (/))s-c. Then, by applying Theorem 
3.1 in Chapter 1 of Kokotovid et at. (1986), 1 and 
2 follow readily. 

T sub.st,tute ,nto system 
( 2 . 1 )-( 2 . 6 ), and then follow a rea.soning similar 
to that used in the proof of Theorem 1 3 to show 
(rather easily) that LI, L3, L4 are satisfied It is 
only necessary to show L 2 to prove 3. Introduce 
the boundary layer system, 

d 

= -L,2iT)iA22(t,) ~ B2it,)B2(t,)Z,,(t,) 

+ p /) 2 (fr)D;(t,)L„(T)) - (^;,(f,) 


2sy(ff)5'|2(^) - Z'cUl)B2(tf)B'2{ti))L22(T) 

- Z,y(tl)A,2(t,) - 0,2(6) - Z,^(t,)S,2(t,)Zty(t,) 

- Zl(tt)B2ili)B2(lt)Z(yiti), L|2(0) = 0fl2, 


(4 47) 
d 

^ Lzzir) - (> 122 ( 6 ) - ^.v(6)52(6)5k6))/-22(r) 
-£>22(T)(>l22(6) -fl2(6)/>;(6)Z„(6) 
- 022 ( 6 ) - Zf, (f,)fij( 6 )fi 2 ( 6 )Z,^( 6 ) 

^ 2 ?(t) ~2 ^^ 2 ( 6 )^ 2 ( 6 )^ 22 ('r); 

1-22(0) = 0(22 (4 48) 


It IS obvious that Z,,(f)(l>;(7;Zf^(t)) -I- I/ 2 + 
^'zZfyit), Z,y{{) are the equilibrta for the above 
system with 6 substituted by t. Then they are the 
asymptotically stable equilibria because the 
linearized system is asymptotically stable. Now 
consider (4.48). Since the result is trivially true 
when 0 n 2 = Z,y(/f), we only consider the case 
Zfy(tl) 0 ( 22 - Let /4h be defined as before and 
A(T): = Z,y( 6 ) - L 22 (r) Subtract (4.48) from 
(3.46) at / = 6 , to arrive at 


d - _ . .1 

— A -I- a;A + AA,-Ap D2(tt)D2(f,)A = 0. 

Let A(t') = A(r)“' where t' = -r. Then 

1 

^^7 A + >lfA -t- AX; - - 0,(6)/?;(6) = 0. 

Define the corresponding system: 


^y=>i;y; >'(0) = y„. 


so that 

y,>^'^'A(T')e^'^y„ = W.(A( 0 ) 

h) y / 

Then, since >»(, was arbitrary, 

A(r') = e''^'^ A(!)) + 

/-T J 

+ - 3 D 2 ( 6 )D'( 6 )e^''dr)r-^'’ 

Since Z,y( 6 ) > 0 f 22 = L 22 ( 0 ), we have A(0)>0, 
and consequently, A(t')>0. Hence, A exists 
and is positive definite for every r. By the 
stability of > 122 ( 6 ) - Sj^Z^yit,), we deduce that 
A(r')—>+« as which implies that 

L 22 (r)—' Z,,,(t() as t- Then, by an 

argument similar to that used in 1 and 2 , 
L| 2 (t)—+ ZJ 6 ) as r—»— 00 . Thus, L2 is estab¬ 
lished Then 3 follows from Theorem L. 

To prove part 4, we use a similar procedure 
We first sub.stitute fi*, into the system (2 1 )- 
(2 6 ), and follow the reasoning of Theorem 1 ( 4 ), 
to show that LI, L3 and L4 arc satisfied. To 
show L2, we introduce the boundary layer 
system, 

d 

-L, 2 (t) = -L, 2 (t)(/ 1 . 2 ( 6 ) 


+ ~2 D 2 ( 6 )D)( 6 )L 22 (t)) 

- (><21 (6) - Z,,(6)5,2(6) 

- L'( 6 )« 2 ( 6 )B 2 ( 6 ))L 22 (r) 
-Z„( 6)>1,2(6)- 0,2(6), 

^,>(0) = On-, (4.49) 

d 

- ^ L 22 (t) — ~>f 22 ( 6 )L 22 (T) - L22(t)A22{Ii) 

- 022(6) +■ ^22(7) ~ /)2(6)D2(6)L22(t), 

^22(0) = 0(22- (4.50) 

Since Z„,y(rf) 2 : Zfy(/,) s 0 , 22 , we can use a 
rea.soning similar to that used in the proof of 3 to 
show that L 22 (t)->Z,„,,( 6 ) as and 

sub.scquently L^^ir)-* V + U'Z„ty(l,) as t-* 
Thus, L2 is satisfied, which then establi.shes 
4. 


Obviously, this system is stable and 5 EXample.S 

j j We present tn this section some numerical 

— (>''Ay)=y' —Z),(t,)D 2 ( 6 )>', results for the infinite horizon case. As stressed 

' earlier, the three quantities y,,,, and y,„, play 
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important roles in the compulation of an 
approximate value for y*(^)- Furthermore, even 
though we know the order relationship /of* — 
Vf,, there is no such relationship between 
and the other two quantities, and it is very much 
problem dependent as we will see shortly. In the 
examples below, we will also study the 
performances attained by the approximate 
controllers and when they are 

applied to the original system.t 


Example 1, 

Consider the following two-dimensional 
system and the performance index: 


Ly = [ {2x]Ix^Xn ^ xl-\-\u\^ — \w\") dt. 

J(j 


(5.2) 


Note that here the fast system is open-loop 
unstable. By using a particular search algorithm, 
we readily compute the three basic quantities: 

y.. = 2.3.565, 

y,,-3.(KKK), 


ym. = ^ 

Note that here the three values satisfy the 
relationship ys^ < y\. < y„ir. We can also comp¬ 
ute the minimax disturbance attenuation level 
y^(c) of the full-order system (5.1)-(5.2) for 
different values of e as shown in Table 1. Note 
that as €—►(), y*(e)—* max (y.^^, yf.). 

1 AHM I OpilMDM A/' PI RFORMANCl 1 I\H K)R IHl HH I 
OHDl R S> SthM C)I I XAMPI (- 1 


Then, we apply fi*y and ju*,^ to the full-order 
system (5.1)-(5.2) and obtain the corresponding 
disturbance attenuation bounds y*, y* and no 
respectively, which are tabulated in Table 2. 


TABCr 2 A COMPARISON OF //""-PFRFORMANCE LEVELS UNDER 
rOMPOSnh. SLOW and feedforward CONl’ROLLERS FOR 
EXAMPLE 1 


r: 

3.4818 

3.4818 

3.481K 

3.4818 

3.4B18 

r: 

□D 

00 


X 

X 

n* 

OD 

c» 

IX 

X 

X 

c 

1 

0 1 

0 01 

0 001 

0 0001 


Note that only the composite controller p*y 
achieves the desired y bound level. Even if one 
allows a disturbance feedforward design based 
on the slow subsystem only (which is the 
controller still the fast dynamics cannot be 
stabilized. 

Example 2. 

Consider the following system and the 
performance index: 


f 2 

1 


-f- 

2' 

1 

u + 

~ L~1 

-2. 

.Jf2- 


.2. 

L 3 J 


Ly=\ (2x] -I- 2xiX2 + 3 jc2 + |u|‘ - y- 1^1^) dt. 

Ji) 

(5.4) 

Here, the fast subsystem is open-loop stable. 
The three basic performance levels in this case 
arc; 


y,, = 1.4240, 
y,. = 1.29W, 
y..,, =2.59K1. 


y*(f) 2 4.132 2 7K43 2 9724 2 9972 2 9997 

r I 111 0(11 (KKII (MKKII 

Now, we choose y = 3.5 max {y,,, ’/f,}, and 
design the optimal controller for the slow and 
fast subsystems based on this value of y; 

= 0.4822, 

-0.31378X,, 

W) = 0.38576X1 - 0.60(K)0iv, 
vrr(^i)= -0.11997X,, 

Z„.= 15.3229, 

f'rAi--« 2 ) = -10.1614X, - 15.3229x1, 

X 2 ) = - 10,4752X| - 15.3229X2, 

t In these examples, the relative accuracy is l).(K32 


Note that here the three values satisfy the 
relationship yi, ^ y„ < y„,,. We can also comp¬ 
ute the minimax disturbance attenuation level 
y*(f) of the system (5.3)-(5.4) for every fixed c 
(see Table 3). Note again that as e—*0, 
y*(c)~»max {y,,, y,^}. Now, we choose y = 
1.6>max{y,,., y,,,}, and de.sign the optimal 
controller for the slow and fast subsystems based 
on this value of y: 

Z,,, = 3.3788, 

Mry(x,) = 1.6447X1, 

PsV^i, w) = -2.4091xi - 1.12501V. 

v*y(xi) = -3.6034xi, 

Z,^ = 0.69201. 

f^i*y(x,,X 2 ) = -5.88965xi - 1.3840x2, 
f^*y(x,. X 2 ) = -4.22518xi - 1.3840X,. 



Singularly perturbed //“-control 


417 


Tabi> 3 Optimum //"-performancp ievh for thi- full 

ORDER SYSTEM OF EXAMPLE 2 


y*(f) 1 3306 1 3993 1 4212 1 4237 I 4240 

e • 0 1 0 01 0 001 0 0001 


Table 4 A comparison of H ~ plrformancf levei^ 

UNDER crjMPOSITL SI OW AND ELEDIORWARD f ON7 HOLI ERS 
FOR I XAMPI r 2 WITH y = 1 6 


vt 

1 4497 

1 4497 

1 4497 

1 4497 

1 4497 

r: 

00 

00 

00 

00 

00 

Kf 

1 5439 

1 5439 

1 S439 

1 5439 

1 5439 

f 

1 

0 1 

() 01 

0 001 

0 0001 


Then, we use fi*y, n*^ and /i,% in system 
(5 l)-(5 2) and obtain corresponding distur¬ 
bance attenuation bounds y*, y* and y*,, 
respectively, which are tabulated in Table 4 It is 
clear that in this case both the composite and the 
feedforward controllers, (x*, and n*,y, can 
achieve the y bound level, but the slow 
controller leads to infinite attenuation Also, the 
level achieved by feedforward controller is above 
the level achieved by fi*y If we want to totally 
Ignore the fast dynamics m the design of the 
controller then we should choose y to be greater 
than the y„,, m the design of the controllers For 
instance, if we choose y-2 7, we obtain the 
value 


z,„ = 2 nn 

f4s*r(jti)- -1 0841x,. 

w) = -\ 5014jC|- 1 nSw 

vr,(T|) = -0 17258jr,, 

7 ,^ = 0 <54480, 

Hty{x I, jti) = 2 13‘i2jt, - 1 () 8 %Jt„ 

= -3 4I95j«:, - 1 08%jc2 

Then, using n*y, /x*, and fi*{y in the system 
3)-(5 4), we obtain the corresponding distur¬ 
bance attenuation bounds y*, y* and y*(, 
respectively, as shown in Table “i Note that m 
this case all three controllers achieve the 2 7 
bound, but obviously, the composite controller 
H*y does much better than the other two 
controllers, n*y and /x*„, not only for small 
values of e > 0 , but even for f = 1 in which case 


H*y does not yield the desired bound (but fi*fy 
does) 


Example 3 

Now consider the following system and 
performance index 


.ex^i L 2 -2. .jtjJ .1.*^ .1 


w, (5 5) 


Ey = I ( 2 x] + 2x1X2 +xl+\u\^ - y^\w\^) d( 

■'ll 


(5 6 ) 


We again compute the three basic quantities 
y,, = 0 63249, 


y,. = 0 44697, 


= 0 49941 

In this case, these satisfy the relationship 
Yi^ ^ Yof^ < Y-.y We also compute yl(£) for 
different values of e > 0 (see Table 6) Again, as 

Tabu 6 Optimum //^performance crvii ior the 
Rill ORDER SYSTEM of EXAMPLE 3 

y*(c) 0S5946 0 61434 0 63023 0 63223 0 63243 

t 1 0 1 0 01 0 001 0 0001 


f—»0, y*(c)—Emax {y,,, y,,,} Now, we choose 
y — 0 7 > max {y^,,, yf«}, and design the optimal 
controller for the slow and fast subsystems based 
on this value of y 

Z,y = 5 5839, 

/x:^(x,) = -16 370x,, 

fiuAi, iv) = -ll 968x,-0 20,000iv, 

v:,(x,) = 22 013x„ 

Ziy = 0 26880, 

fxry(^), Jfi) = 1 0272jr, - 0 26880 x 2 , 
ficV^i. X 2 ) = -15 343x, - 0 26880x2 

Using p*y, p*y and p*,y m systems (5 5)-(5 6 ), 
obtain the values shown in Table 7 for the 
corresponding disturbance attenuation bounds 


Table 5 A c omparison or //' pi reormanc i i evei^ 

LINDFR rOMI'OSITt SI OW AND FLFDhORWARD CONTROlIhRS 
FOR FXAMPl L 2 WITH y 2 7 


1 3899 

1 4152 

1 4344 

1 4533 

1 4533 

3 5891 

2 7145 

2 5315 

2 5990 

2 5990 

1 7093 

1 6838 



1 6838 

J 

1 


mSmm 

0 0001 


IaBLF 7 A COMPARISON OF H"'-PERFORMANCb LEVELS 
UNDER COMPOSITE SI OW AND FEEDFORWARD CCJNTROLl ERS 
FOR EXAMPLE 1 


0 69507 0 69507 0 69507 0 69507 0 69507 

Yt 0 69713 0 69713 0 69713 0 69713 0 69713 

y:^ 0 69309 0 69309 0 69309 0 69309 0 69309 

e 1 0 1 0 01 0 001 0 0001 
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y*, Y* and y*f, respectively. We can see that all 
three controllers achieve the 0.7 bound, but 
does a little better than the other two. This is 
mainly due to the fact that in this case y„f^ < y,.. 

We observe from the preceding analysis the 
following essential features of approximate 
designs based on time scale separation; 

(1) When the fast subsystem is open-loop 
unstable, only the composite controller fi*y 
can achieve a y > y„ bound. 

(2) Even if the fast subsystem is open-loop 
stable, the composite controller may achieve 
a better disturbance attenuation bound than 
the when y** < y„,„. 

(3) In the case of y^^ a y„,^ > y^^, all three 
controllers (i.e. fi*y, fi*fy and fi*y) can 
achieve a y > y*. The composite controller 
does not necessarily lead to a much better 
performance than the other two. 

6. CONCLUSION 

In this paper, we have provided a complete 
analysis of the singularly perturbed linear- 
quadratic A/*-optimal control problem, with 
perfect state measurements in both finite and 
infinite horizons, by relating it to a class of 
singularly perturbed differential games. One of 
the main results of the paper is the existence and 
construction of a composite controller, inde¬ 
pendent of the singular perturbation parameter, 
under which the associated differential game has 
a bounded upper value. This composite control¬ 
ler leads to a specific (pre-computable) W- 
performance bound for the full-order control 
problem for sufficiently small values of the 
singular perturbation parameter. For the infinite- 
horizon case, we have actually proved recently, 
in Pan and Ba$ar (1992), that this performance 
bound is tight;t we have not included this result 
here since the arguments are quite technical and 
lengthy, and its counterpart in the finite horizon 
case is still an open problem. 

In the present pap)er, we have also obtained 
the conditions under which a controller design 
based on the slow subsystem only can achieve a 
desired performance level. Yet another control¬ 
ler whose performance is studied is the one that 
uses slow state feedback and a feedforward term 
from the disturbance, and it has been shown that 
the performance of this controller is generally 
inferior to that of the composite controller. One 
of the important messages of the paper has been 
that the “best” composite controller is not 

t In the recent work, Pan and Ba§ar (19V2), we prove a 
weak converse of Theorem 2, which says that 
y*(f)=y provided y, > yf and the matrix G, >0 for small 
enough, but positive, e. 


simply the sum of the “best” slow and the “best” 
fast controllers (contrary to what is observed in 
the singularly perturbed LQR problem) but 
involves a more intricate construction. 

One immediate, but not trivial, extension of 
these results would be to the sampled-data 
measurement case, so as to obtain the 
counterparts of the results of Ba§ar (1991) in the 
singularly perturbed case. This work has already 
been completed and the results will be presented 
elsewhere (Pan and Ba§ar (1991b)). Another 
extension is to the imperfect measurement case 
(the four-block //"-optimal control problem) 
which has also recently been completed by Pan 
and Ba§ar (1991a). One other extension would 
be to the multiple-time scale problems, which is 
a topic currently under study. 
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appendix a simplification of subsysfem 

PARAMETER MATRICES 
First, we introduce some matrices to simplify our proof 


^’■~^22Q22'^2:’ (A1) 

P • ~ f + ^3*422 (A2) 

O D2(A22Q22^'22 + ^2^'2) '^2' (A3) 

4> =Y^I- 'CaiAij'D,, (A4) 

X *2 (A5; 


In terms of these matrices, wc can obtain the following 
expressions using some simple matrix operations and matrix 
inversion identities 



p = /4 BU 'B,, 

(A6) 

p 

‘ = /-B;(;r + fl,fl;) 'B,, 

(A7) 

P '^2^ 

' - B;(;t + BjB;) 

(A8) 


(7= yM - Di(3r 4 B|fl',) M),, 

(A9) 

a 

' = 4 *, D^(3r 4 S-,,) 

y~ y 

(A HI) 

a '/72(jr -f B.B'j) 

' = y-D;(3r 4 A 22 ) 

(All) 


<t>--y-l-D'22T 'D.. 

(A12) 

0 

4 D'(3r D,/);) 

r' r •' > ^ / 

Dr), 

(AH) 

(p 'D\jt 

' D'2f3r \nj)[] 

y y '' 

(AI4) 

X 

' l~B,(n-i S„) 'B,, 

(Al^) 

X 'fl;(;r-^2/MD2) 

1 

S2>) 

(Alt.) 


Transformatiorw of the \lf}w \iih\\\tem under the // 
information pattern 

Wc will deal with the transformations described in H) 
and (1 9) one by one Substitution of (1 S) into (1 2) and 
7) yields the lollowing system 

jr, = 24*"^ + ^,(0) - A,(0). (A17) 

f-ys ~ l’^s^^)lc^fll l (I'Ju*' 

Ml 

+ -I--t |uj^)df. (AIK) 

where 

- A,,) 

= A„ + + fl j;) '(A;2C2 ..'c,,-Aj,), 

D’*^D„-BnP 'fl> 'r>2 

-■ I)„ - 'D,, 

0 " - Qn - a:,,A'- 'CM, - (3 ,2A.M42, 

+ Aji^r 'A^i “ (C12C27 At, ~ A •iItt 

y B.p 'fl> '(A,.Q 22 'CM|-A,i) 

= Cii - Aj.AC 'cm, - C'i 2 A 2 ^'A 2 i + A^i^r 'A,I 

— (C12 C22'A 27 — A 21 

+ B^B\) '(Aj^CM^fi'z,-A,,) 

= Oil - C->,2(?22'C2I + (C>I2C^22‘A22 ' Ajl) 

A( 2 r + fl;fl.;) ' X (A 22 C 22 'C '1 - A,,). 

P" = -ei2A22'D2 + A2|,^ 't^2 

+ (C|iD 22'A22 A2i)3r 'B.p 'fl> 'D, 


- -0i 2A22'D2 + A2,;t 'D 2 

A Q 12 Q 22 A 22 — A 21 ) 
xfjt + fljfli) 'B^B'^n 'D 2 
= -(C>i3G22'Ai2-A;,)(2t4 B.B'l) 'D 2 , 

R” =--Y'n D'22t 'D.-D'.n 'B 2 P 'B'^n 'D, 

--r^/ + o> 'D 2 -d> 'flitf; 

xfjr + fljflj) 'D 2 

- — y^/+ Djl^r + flj®?) '^2 

- -o 

Now, >.ub>.iitute (3 9) into (A17) rind (AlH) to obtain 

All “ A„ + Bf^B2(2i F B^B-,) (A 22 Q22 G 21 
-A,,)-{D„- B„B'2(7t + B,B'2) 'P 2 I 
Xfj 'D;(;r + fl2B;) '(4 .C->2>'Gr, - A,,) 

= A,i+B„B2(;r + B2fli) '(A22G22'G2 i 
- A.,)B,,B 2 (n+B.B:,) '/M) 

X ^',D;(2r + .V:2) '(A,,CM.'G2 i - A,,) 

= A„ + fliifl^lJT 4 BrBr) '(^A “2 D,D^t^r + ,^ 22 ) 

^(A2032'02,-A2,) -pD,,D;(;r 

*■^ 22 ) '(A22CM2'e.|-A2,) 

- A,I + (.^,2 - A |2A,2'^22)(^ a .V,,) ' 

X(A23CM2'C>., A 2 ,) 

- Aii“A|2G2'^G'’|A(5|2 + 4|,G2'Ati) 

X(/r4i22) ' X (A22CM2'C-’'I A,|), 

fl„=B*, 

0 „=D"(r 

G-G" + (G,2G23'A22-A'.,)(2r 

+ B2fl2) '0.0 'd; 
x(3r4B,fl;) '(AirCMj'Gri - A;,). 

= G''A(G,2G22'Ai2-A'.,)(3I + fl2B2) ' \ D.P'^ 

x(3r4y,) '(A 22 GJCM,-A,,) 

-CM, GpCM-'CM, A(GrG2^'A;2-A;,)(;i4.'(.,2) ' 

X (A 22 CM"* G 2 1 ~ A I) 

Now, wc can compute as follows, 

-I- ^ 22 ) ' 2 ~ ^^ 22 ) '^2^11 

- fl„fl;(;r + B 2 B',) ' \ DjD'jiJt 4 ,S,,) 'BjB',, 

= “ By^B\(jx A S22) 'B^BI^ 

-^D„D;,-AD„D’,(;r4 .S„) ' ‘,0,0,’, 

r ' “ r ■ 

+ fl|,fli(3t + .S 22 ) ' ^'rDjDi'i + pDiiDifTI 4 S,;) 'B.B;, 

-.V,,-ApA^r'e,- iirAC'Al^F IpA^r'-^zzAC'A ’ii 

- (,S,, - A ijAij'rCjXjr + A’ 22 ) '(i,, - S 22 A 22 'A I 2 ) 

= 5|| + /4 i2C?2>*x4 12 " (‘S'* I 2 Q 22 ^ 422 ) 

y (71 + ^ 22 ) '(^^21 “f ^??C ?22 ^ 22 ) 

Below, wc now provide the details of the steps that led 
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from (3 24)-(3.25) to (3 26)-(3.2H). 

^ ~ ^ (^22022 t?!! ~ 

= [p d;,Z,, + D',(n + .S 22 ) '0,0,',Z,, - p 

X D'j{n .V 22 ) 

-pDi(2r + 5:.) '(/>j2022'(?2i -'42i)k 

X (21 + .t,) ‘(.V,, - A ,,/t,2'>S,:)Z„ 

- -'j D;( 2 r + .Vj,) - /tii'Us 

- /3 |Z^y — ^ /)2(jr + 5 ^ 2 ) 

^ ((^1 "f" '^22C?22 (^21 ^ ^22^22 C?2l))l't\’ 

U'ly^-P '[fl(,Z,,A,+ fl> ‘(D;W, 

~ (-^22(^22 Q2I ~ '^ 2 l)'*^s)] 

- -(fl;z,^j, - Bi( 2 t + H^H'.) 'H.H[Z,yX, 

+ ^ 2 ^ 2 ) * 

X ( — {A 22 Q 21 021 ^21 

-^22022'>‘i2-ZsvXj) 

= -fl;z,^*, + fl;(2r + /f2«2) '((«2fl'i 

"f" ^27 022 I2)^sy 

+ M 22 0 22 ' 02 I - '42,))^ - + filK) 


fl‘'=fl„+O „0 'D> ‘fl 2 

= fl„ + p 0 ..^i (^-^-2 02 D;) B,. 

I I ~ '^2l'^22 C?2I ~ C? 12-^22 ^21 

-l-i4;,2r '>421 

(C^ 2 C> 22 ‘^ 22 ”^ 2 l)^ '^20 ' 

^ 22 (2 22 'C?21 ~ ^ 21 ) 

-en->4;.>»L'C^2.-0.2/^22'^2, 

+ >421^ '^21 

+ (0l2022'^l2 - ' ^2 «207 

X M22 0.m' 021 -^-i) 

~ I I “ I ? C? > 2 ' C? 21 ( C? I '• C? 22 22 “ ^ 21 ) 
X (^- p Ihl^l) ('422022 021 -^7|). 


/’" = - 012'4,2'fl7-f X;,;! 'B 2 - (012022 
-/4,|)7r 'D24> '^> 2 ^ '^2 
- Q,.A22B,-^ A’2,JT '«.- ( 012022 '- 4(2 


-/f;,)(^7r - /^j/);) ^2^,/)',7r 'fl, 

- -(012022''4;2-'42i(^~ ^'2 ^^ 20 :) B 2 . 


u;^=-B,Z,,x, + fl;( 2 r + if 2 B 2 ) '((B 2 BI 

+ '422022''4;2):^sy 
+ ('427027'021-'42 i)K-B.U^ 

n;(7t f i;,) '((.Sj, 4 4f22022''4:2)2s, 

— (^21 “'422 022*021 

- -B\Z,^x, + B',(71 + B.B^} '((^Vji +/»27 0 22''4i’)Z„ 
+ ('422022*071 -'421 ))-l2+B2(^ 

+ B.B' 2 ) ' + S^n) ‘ 

^ ((‘^21 22^^22 12)^sy ~ ("^71 ~ ^ 72 27 C?71 

— + 82(71 + .^ 22 ) ^ ^?2C?72^ 

~ (-^21 ~ ^22Q22 Q 2 i))-*^> 


Transformations of the slow subsystem under the Sb 
information pattern 

Similar tn that in the FI information case, we consider the 
transformations introduced in (3 29) and (3 30) one at a 
time Substitution of (3 29) into (3 2) and (3 7) leads to the 
following system 


jc, = >4^'ar,3 D V\, jr^(O) ^ jr i( 0 ), 


ys ~ |3^s(^r)lbr, 1 



+ Zr^P'" u,-b |ujJ,u- |wj")d/. 


(A19) 


(A20) 


A^'-A„+D„4, 'D'.n '('42, -/(22022*02i) 
= '4,1 — “2 — ~2 

^ ('422022*021 — '42i)' 


/?"-/ + «> 'B. + B'^ir '!),((> 'n'.u '/{, 

-I t flwT 'fl, BU ' /+/);f7T 

y- \ 

-^.DzO'zj fl: 

= l + B'2(2z-\r>2fy,'j B, 

^ X 

Now, substitute (3 30) into (A19) and (A20) to obtain 
a„ = a„-\d„d',{ji~ tD.d;) 

y- -\ y- - ■/ 

X ('4.>027*021 -'42i) 

+ (b„^ \2y'D„D' 2i^n \iyjy) l^i)x 'H'l 

X (;t ('4-0-'0^i- '4?!) 

= /l„- \n„iy{2t- '.O,/)',) 

y- ' y / 

X ('427022*071 -'42,) 

+ [fl„ + \ D„iy(n - \ f^i^Bjln 

^-^ 22 ) (^2?(J22 Q 1\ ^ ^^2\) 

“ A(, 4 4- ?2C?27 C^7 l “ -^ 21 ) 

~ ^7 4 .*> 22 ) (^IzQ?? C?2I ^ 21 ) 

~ -^n “b (^V|2 ~ A A 22 ^ ‘^ 22 ) 

X (A22QzsQ21 ~ A 21 ) 

“A,1 “A|2C:?22 071 4“ (^^124" A|2022 A 22 ) 

X (JT 4- kS22) ^ (A220?7'02I ~ A^j), 


where 
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Q = Q^- (0,2022^22 - A'2,(jt - ^2 DzD;) ' 

X fljx '®2 ^ ~ 

X (^ 22 ^^ 22*621 ~ '^.m) 

“6 ~ (612622 ^22 ~ ^ 21 )^^ ~ ~2 

X ^ S 22 ) \A2^Q2lQ2^-A.,) 

^611 “ 612622*621 + ( 612622^22 “ ^ 21 ) 

(JT + * 1 ^ 22622*621 ~ ^ 21 ) 

Now, we can compute S as follows 

^{\X ^ ^2i) '^ 2^0 

+ ‘^'’ 2 ) '^?^n ^ 

y~ 

+ »S^)) O-tDfj ^2 ~ Z),,Z),i 

- \D,,D\(n- \lXD\] /J,D;, ', 
r \ v‘ • V y 

- fl„fl;,-B„fl;(;i + 5,-.) ‘B,«;, 

+ ^/)„D'.(;t+S„) 'fl,fl;, 

r 


y y 

- \d„d\(2i + s„) 'z),d;, ', 
y “ y 

~‘^ll'^ '^p 6 2 '^ 1 ’~(S 2^'^12 6 '’'’^ 22 ) 
X(;T-|-S^n) '(.V,, 4 y42y(j2-'>^l2) 

We now give the differenl steps ot the derivation that led 
from (1 42) to 2H) 


'[ 

4 B 2 


1 \ ^ 1 

-(Bi+B'(jr- IhlK) D.ni 
\ y I Y 




X + (/ 1 j,( 222 'G 2 i - A2,)]x^ 

= B'2(n + S '(S,, - .^,4,, ‘A\2)Z.r 

+ flj(jl -( A,,) - A.Ax, 

= fl;z,^jr,+fljiJT + i,,) '((.S,|+ /t„(?22''4;,)Z,v 

~ ('^21 “ '^ 22 ^ 22 *^ 21 ))-^^ 


“ (^^21 “ -^22622*621)))^ 1 
= ~ B2ZfyX2 “ ^ 2 ‘^lr'^ 22*(^21 
52 iZ„y 4 S 22 ( S 22 + -TT) 

^ ((‘■'^21 -^22622*“^ 12)^SV ~ ('^21 

~ ^ 22622*621 )))'*^1 

= — B2^\yX2 ~ ^2^[y^22 (^21 ~ ‘^^ 21 ^sy 
^(*^22 

^ ((‘^21 '*■ -^22622 ^ I 2 )‘^s> “ (^21 

— 2429622*621 )) 

4 (.'>21+^2’622*^12) A, 

— (y4 j - -^22622 621 ))-*^! 

= -B\Z,yx2 - B’.7,^Q22(-A\ASz^ + ' 

^ ((‘^l + ^ 22622 *^ 12 )^sy 

“ ('^71 “ ‘^22622 621)) hy 621 ))'^ I 

APPtNDIX B A USEFUL LEMMA 

The following lemma will be used throughout in the 
derivations of Appendix C 


Lemma 3 Let U^, U 2 , V^ and be delincd as in 

(4 B)-(4 II) Then the (ollowmg relationships hold 


(1) 43j(;, + .?,,V, = -A,,, 

(Bl) 

(2) A22U2 + !>22V2-A,„ 

(B2) 

(t) - 4:,^,= t».,. 

(Bl) 

(4) ^2,1/, A2,V, = A,2. 

(B4) 

(A)/i,„+v„y-/i,, S2,/,y. 

(BM 

(6) C?22 --4 22V' = C>M+ 

(B6) 


Proof 

( 1 ) A 2 :(/,-f ^^9, 

- S 21 +A2i622'>4p->^ G.VA 22 (S 22 

+ a..q,!a22) ‘(S'.,4a, 

- '^22(^22 +^22622*^22) '(’»2l + A 2 . Q 22 A \ 2 ) 

- - ’^ 22^1 

(2) A22U2 - A., 

- - 4., 4 A..Q2 '(? , ^ A..(? Ja;,(^,. 

^A^xQ.'aA ‘(A,, - 

- -^'>27(^^22 + ^22622^20 ‘(> 4 ^ 1 -^2262'6^1) 
(^) 622^^-621 

- 6^1 - 6^1 + AAS 22 + ^zQiiAA ' 

X (A;I - A22622'621) 


Finally, we give the steps that lead from 47) to (1 48) for 
the fast subsystem 

u;yU) = -BjinZfyiOi^z - ^ 2 ) 

-^ ^22HA.^x, 4 fl.u,%4 D,h:^)) 

~ A, 

4B:,fi;(>V.,4 ;r) ' 

^ ((*^21 + -^22622 '^22)'^-‘y — (^21 

A 22 622 621 )) + ^ ^ 2 ^^ 1^91 

— ^ + ‘^ 22 ) '((‘^zi + i 2 )^»r 


(4) Q2,U,-A\, 

^ A'A^'^z-A ^ 2 ^ 622 *^ 2 -) ' 

X (,^21 A 22 Q 22 A] 2 ) 

= A 22 ^ 

(S) and (6) are direct extensions of (1), (2), (3) and (4) 


APPENDIX C VERIFICATION OF SOME USEFUL 
RESULTS 


Proo] of the fact that Z,, = Z,^^, Z,:, - Z', Z ^2 = 2.,^ \atL\fy 
(2 3(J)- (2 32) 
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It IS immediate to see that (2.32) is satisfied For (2.31), 
LHS = AztZfy + Z,yA,2 + (V' ’f t/'Zfy)>422 

+ ((?„ - - (V' + U^Z,y)S^,Z,y 

(^21 “ Z^y^x2 “ V'S22)Zfy -f Z^yAy2 

“*■ ^'^22 Qi 2 ^*ZiyA22 ~ ^ 2jy522Zfy 

= + U^Z,yA22 - U^Z,yS22Z,y 

= + ZfyAjJ + C?22 ” Z^yS^lZfy) 

= 0 , 

where in going from the second to the third step Lemma 3 
has been u.sed 
For (2.30), 

LHS = A;,Z,, + A2,{Z,yV^ V) 4 Z^yA,, 

+ (V + U Zfy)A2\ Q\ \ ~ Z^yS^^Z^y 
- (K' + U'Z,y)S2,Z,y - Z^yS,2(Z,^UV) 
-{V'^U^Z,y)S22(Z,yU^V) 

— ^|iZ,y + Z^yA,, + Cil - Z^ySjjZfy 
-(A2,-Z,y\2)(Z,yU-^V) 

+ (V" + t/'Z,,)(A„ - 5„Z,, - S22Z,yU- S22y) 

“ ^IiZgy + ZjyA^I + Cil “ Z^y5, ,Z,y 
^(U'A'2+V'S22)Z,yU 
+ (A;,-Z,y5i2)V' + (V' 

+ f/%y)(A,,-.5,,Z,y)C/ 

” ^uZgy + Z^yAj, + 011 ~ Zgy^ljZjy 

•f y'(S22Zfy-hA22-S22Zfy)U 
+ (Ai,-Z,y5i2)i^+f/'(A^Zfy 
■*■ (ZfyA22 ~ ZfyS22Zfy)U 

”■^11 Zjy + Z,yA 11 + 011 - ZsysV| , Zg^ 

+ (V'A22-L/'022)^ 

+ (^21 “ Zgy5i2)V 

~ -^nZsy + ZgyA 11 + 011— ZJy^S'l I Zgy 

~ (012 + Z,yA,2)f/ 

+ (A;, - Z,yS,2)V 

“ -4(iZ,y -h ZgyA,) — Z^ySf^Zj.^ + 0 
= 0, 

where we have used Lemma 3 throughout the simplification 

Verification of the fact that c = 0, Y,i = Z^y, Y,, = Z„,yU + 
V, V 22 - Z^fy satisfy (4 40)-(4 42) 
llie verification of (4 42) is straightforward For (4 41), 
using Lemma 3 

LHS = A2iZ„iy *f Z^yA 12 + ^ A 22 + Z,^fyA2> + 012 

— Z^yfipfl^Ziify 

— V B2^2Znfy + ^ Zgy D| ZljZofy + ;p ( V 

+ ^ Z„fy)D2D2Z„ly 
= A2|Z,,fy + (ZgyA 12 + ^ 'A22 + 0 12 ) 

— Zgy5i2Zj,ly -■ V ■V22Z„,y 

+ ^ (^Zf,fyA22 ^2 Zofy^zf^zZofy j 
= (A 2 I — Zgy5i2 — y ‘S 22 ~ U A22)Z„ly 

^(ZgyA,2+L'A22 + 0,2-f/'022) 

= 0 

For (4 40), on the other hand, using again Lemma 3, 

LHS = AnZgy - z,yfiifl;z,y - V'B2B\Z,y 
+ Z,yAl,-Zgy£f,B;Zgy-Z,yBiZf2V 


^(A2,-Z,yB^B2-V’B2B2 
+ ^,AOi)(z„„t/+v) 

+ (V' + U'Z„„)[a,,-B^B\Z^^ 

-B,B',V+y,D,D1Z.^) 

+ :pZgyDiDiZgy + 0,1 + Zgy/?,fl;z,y 

+ V'fl2^lZgy 4- Z^yByB^V 
+ V'B^B^V + p (V” + U'Z„,^)D,D'2{Z„,^U + V) 

All Z,y + Z,yA 1 I Zf^■^ 3 l I Zgy + 0(1 

+ (A 2 I ~ Zgy5i2 — y S 22 

+ ^,U'Z„„D,D',)(Z„,^U+V) 

+ V (^2|— .V^iZ^y) 

+ U'Z,„^(A2, - ^,z,^ - B.B'^V) 

= -41 |Z,, + Z,,/4 II - Z,yS, IZ,^ + (2 11 

+ t/'-4;,(z,„^(y+V) 

+ ^ t/'z„,/>3D;z.,i,i/ + v ’(/»,! - .s,|Z,,) 

+ U'Z„,^{A,,U + S,,V -S,,V) 

A 1 iZ^y + Z^yA 1 1 ZgyiSi ( Z^^ + 0 | | 

+ w{A 22 Z, -.fy-^ / l,,>A 2 j 

+ p Z„„.D.D;Z„„)t; + U'A'.^V + V’{A., - A,|Z,J 

~ A \\Z^y + Z„yA I I — Z^y5i | Z„y ^ 0 , | 
f/'( 021-A;,z,y) 

+ V'(A2,-.V2,Zgy) 

A(|Z,^y 4" Z^yA|) Z,^y»S()Z,^^ 4 (4 
-0 

Verj/ica/ion of the nunsinf^ularity o] H,| 

In Theorem 1, m order to be able lo use the Implicit 
Function Tlieorcm, we needed the condition that H,, is 
nonsingular We now verify this result here First, it is easy 
to see that E,, has to be in the form (4 14), where R is given 
by 

R ~ Ayy — (A 12 ‘ ^\2Z fy )( A 22 “ Z 22 Zfy) 

XA.y- SyyZ^y-Sy.AZyyLUV) 

^(A,2-\2Ziy)(A2.-522Ziy) '.V2,Zgy 

+ (A 12 — ‘^irZfy )(A22 
-5„Z,,) 'S,,{Z„U+V) 

This tan further be simplified lo (by using Lemma 3 of 
Appendix B) 

/? = AiI ~ .S', 1 Z^y - (A 12 - .S|2Zfy)(A22 

--V22Ziy) '(A., - S.yZ,y-S22{Z,yU^y)) 

-.S,2(Ziyf7+T) 

= A I I “ 5| 1 Zgy — (A 12 — ^S, 2 Z,y )(A22 

” ^ZzZfy) (A 22 ~ A22Zfy)(J 
-5,2(Zryf/+V) 

-All-5,,Zgy-A, 2 

“ AfJ - 5 „Zgy 

Since A„-5„Zgy is Hurwitz, for y> the eigenvalues of 
S,, are in the left half plane From this it follows that H,, is 
nonsingular 
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For (4 29), on the other hand, R is given by 

~ (-^i p 

~‘^12'^ry)('^22 -‘■^22^1y) ' 

+ ^ 5 D,d;z., + ^‘,z),o;z^ 

- (p4|3 - - S 22 Z,,) ' DjD\Z,^ 

“ (^12 “ ~ pStjZ( y) ' ^ D]1)2Z^, 

which simplifies to 

= a„z,,-5,2(z„(; +V) 

(/112 ~ i [ 2 ^ fy )(^22 ~ ^ 22 ^iy) * 

— A[\ ~ 

from which it again follows that E,, is nonsingular 
Now, for (4 ^4), R can be rewritten as follows 

+ d,d:z„„)( 4,2 + D;/)'z,„j '4>, 

+ ^*,D,D;Z,P^',ZJ,D2(Z,„,(/+F) 


APPENDIX D THEOREM L OF KOKOTOVIC AND 
YACKEL (1972) 

Consider the initial value problem 

JJ = /(a.9. E) e(ln) = 5n. (Dl) 

f f) 9('n)=^9o. (D2) 

where c is a small positive parameter, and £ and rj arc n- and 
m-dimensional vectors, respectively Formally setting f-O 
in the system (D1)-(D2) yields the degenerate system 

^^ = /(t,|,fj,0), £(f„)=£,„ (D3) 

0 = g(r, ^j.O) (D4) 

Since (D4) may have several roots, suppose that a particular 
root fj = (p{t, I) IS of interest and substitute it in (03) The 
^-dimensional system 

0 (1. £)^0). g(g>£n. (DS) 

IS a reduced system of (D1)-(D2) 

Introducing a new time variable t, (D2) can be rewritten 
in Ihc form ol a boundary layer system, 


+ -^DyD,Z,,,) ' \lKI),/, 

r y 

-(p4p + ^*./J|D.Z„,,)(A., 


where tv = /o and = tlfo) ^re fixed parameters In the space 
of variables f, 7 / e, I we define a region 'H ||f - ^(011 ^ 

||r( - f/(f)||' r. I)£er f", /w/, where r|f;,i - r;(f,i)|=' 

0 


+ ^',d,d;z,.„) ' J,/>,d;(z„„(;4 v) 
-a,,-pS„/,, -s,,F+ ^',D,17 

-(Ap+ ',/),D,Z„„) 

r 

X + ^2 j |/l|i~S2iZs> 


Theorem 1 Let the lollowing conditions be satisfied 
LI / dJ/3S 31/3r} g, 3^l3k d^/Sr} arc of class fm 
(5. rj, t r)e^)i 

L2 The solution wir) ol (Db) exists on r e |0, is unique, 
and IS asymptotically stable with respeil to the root 

<Pih U (P4) 

I 3 The solution kU) Ihe reduced system (D5) exists and 
IS unique on t e fb,. 

L4 The real parts of the eigenvalues ol the Jacobian matrix 

f(( r),(l) 

3rj 


-A|] .S||Z,,, ,S|, V + Oi D_iZ|,|, (y 

-(ap + ^',d,d;z„„) 

X (Aj2 + \ ZI.D'pZ,,!,,) '(a,, + A D.DU„t^i 

-A,,-.S,Z„ -S, F-Api; 

= A,|-.i|;K A,3t;,-(,s„ 

-,y,,F,+A,.I/,)Z,, 

~ ^(1 ~ 

from which, again, the nonsmgularity of H,, follows 


arc negative on |l„. ij, for fj - 0(f, 5) 

Ihen for sufficiently small e, the system (Dl) (D2) has a 
unique solution t(t 0- vOf r e [to, b) satisfying the 

initial conditions ^(l„ f) - E„, ^/(t,,, 0 ^ Furthermore, 

hm ?(r. c)-l(l) on|l„.ifl (D7) 

f *i\ 

lim r/(f c) - f/(0 on (l„ /f], (D8) 


where the limit (D7) is uniform in t on [to- ^1 

(D8) IS uniform in t on any interval (1,, tf\. where L ^ b 
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Abstmcl —The initial setting and maintaining of the desired 
paper basis weight (in machine direction and cross direction) 
is to be ensured adaptively for .several given paper grades 

rechnical equipment includes; usual traversing gauge at 
the output of the paper machine, special traversing robot for 
shaping the headbox lip (sequential setting of screws), 
control computer. 

Adaptive multivariate LUCi conlrol with recursive 
identification gcnerali/ed to distributed parameter system has 
been designed with the following Icaturcs; integral 
(convolution type) model of the process respecting the 
continuous nature ol all signaK and kernels involved (by 
means ol spline approximation), repetitive control synthesis 
(via Riecati equation with periodical solutions). 

1. Introduction 

RiiiABii fioMOOLNiriY ot produciion in most chemical 
technologies cannot be ensured without the advanced means 
of automatic conlrol. It holds also in the paper industry, 
where the demands on quality distribution over the whole 
sheet are increasing. 

Paper machine performance is evaluated according to 
three mutually dependent variables prescribed fur a given 
paper grade; basis weight, moisture, and caliper content. 
Formerly, only the temporal mean properties of the paper 
sheet (in machine direction—MD) have been controlled 
automaliCfilly; nowadays the desired spatial distribution 
along the sheet cross-seelion (in cross direction—CD) is 
pursued, loo. This contribution deals with the CD conlrol ot 
papei basis weight; similar problems are met also in other 
sheet-making technologies. 

The authors belong to a group, where the Linear 
Quadratic Gaussian (TQG) approach to adaptive control has 
been deeply elaborated (Kirrny ct al., 1^85) together with the 
Bayesian approach to identihcation (Pelerka, P^Kl). The 
underlying problem formulation was found to tit the 
requirements of the paper CT- ontrol, too. 

The contribution begins with a survey of the technology 
(Section 2). Attention is paid to the discussion of the 
diflicultie^ met when solving the C'D control problem tor the 
basis weight (Section 3). After a short summary ot the 
present stale (Section 4), available reserves for improving the 

Received 24 July 1989; revised 1 October 1990; revised 1 
March 1991; received in final form 30 June 19^n. The original 
version of this paper was presented at the 11th IFAC World 
C'ongress on Automatic Control of the Service ol Mankind 
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Published Proceedings of this IFAC Meeting may be ordered 
from: Pergamon Press Ltd, Headington Hill Hall, Oxford, 
0X3 OBW, UK. This paper was recommended for 
publication in revised form by Associate Editor D. W. 
Clarke under the directions of Editor P. C. Parks. 

t Institute of Information Theory and Aiilomalion, 
Czechoslovak Academy of Sciences, Pod vod^renskon vCii 
4, 182 08 Prague 8, Czechoslovakia. 


quality and economy arc suggested (Section 5). The 
proposed approach is reviewed, stressing the practical 
aspects more than the theoretical ones, the reasons more 
than the tools (Section 6). The control system has been 
broadly tested by simulation; illustrative examples arc given 
m Section 7 The outline of the resulting algorithm follows 
(Section 8) The results are summarized in the Conclusions 
(Section 9). 

2 Technological background 

A very simplified scheme of a paper machine is sketched in 
Fig. ) to give' a non-specialist a rough idea of the design 
problems. 

TTie raw material, a watery pulp, is pressed out from the 
headbox through a narrow slice lip onto moving wire section; 
water flows through the net where paper web is created and 
then dried. Al the output of the machine, paper properties 
arc measured by traversing gauges. The mean machine 
direction (MD) basis weight is given mainly by the thick 
.stock flow, but small deviations can be controlled by machine 
velocity. The only way of influencing the cross direction 
(C'D) basis weight is to shape the headbox lip. 

3. Main difficulties 

The main difficulties and expenses arc caused by 
measuring instruments and actuators (continuous signals 
di.scrctely mea.surcd, control actions severely limited, special 
expen.sive gauge.s and devices needed), others are part of 
technology (multivariate couplings, transportation lag) and 
possibly of problem formulalion (MD and CD conlrol 
Inseparability). 

The output evaluation is based on measured data provided 
by a gauge traversing across the produced paper sheet. The 
sheet moves at the velocity of the order 10msec ’ and the 
measurement principle does not allow the gauge to move 
more quickly than about 10 'msec ^ these movements 
result in a zig-zag measured path sketched in Fig. 2, where 
the angle or may be about 1" only. Moreover, the gauge slops 
at one side of the sheet (for some time intervals at 
unpredictable time instants) for recalibrating it.self. From 
these extremely sparse data, the whole profile is to be 
reliably reconstructed. This is possible only due to relative 
stationarity of the profile, corrupted, however, by the 
measurement noise 

At the actuator side, the shaping of the slice lip is 
necessary, which is realized by a set of screws in distance 
about 0.15 m. For technological reasons, the absolute values 
of (vertical) screw-position changes and relative adjacent- 
screw positions arc severely limited. The precise screw 
setting is very demanding and consequently expensive in 
Itself. Position change of imc screw alflicls broad area of the 
web over several adjacent screws (the influence distribution 
is typical for a given paper grade). This coupling makes the 
problem essentially multivariate. 

The inevitable transportation lag from the headbox (input) 
to the gauge (output) is .substantial. 
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Fi(i. 1. Scheme of paper machine. 


4. Usual solution and its disadx^antages 

The introducing of CD control is explosive in developed 
countries since 1980, but after the first publications (Karlsson 
and Hagglund, 1983; Wilhelm and Fjeld, 1983; Chen et ai., 
1986) rare references appear (c.g. Elliot (1987), Natarajan et 
al. (1988); Brewster (1989)). The information published by 
the suppliers often does not allow to understand the 
underlying design principles. In our knowledge, the 
time-space distribution of the paper basis weight y(/. jt) is 
usually described (for control purposes) by a variant of the 
simple model 

Y(/)-Y,-Y,-Y,4B„U(/-d), (1) 

where t is discrete time, d is time delay. ' denotes 
transposition, Y'(0 = [Y(t, jc^,), . . . . Y(r. jt^,^)] is a vector of 
basis-weight values at the points jr e [x^i. . . , (usually 
the CD screw p>ositions), Yp, Y^ are desired and starting 
values of Y(t), respectively, LJ(f - d) is ^-vector of screw 
settings, is symmetrical band matrix (the band width is 
given by the number of screw sections influenced by one 
screw). 

The control task is to remove the deviation caused by an 
inappropriate initial slice-lip setting. Other disturbances arc 
supposed to be slow, so that the nonzero control action i.s 
rarely needed after initial tuning. Machine propcrtie.s are 
as.sumed to be time invariant; the matrix B„ can be estimated 
beforehand. 

If the technological arrangement allows to change all the 
screw positions at the same time, it suffices to estimate Y^ 
and to set U(0 = B„'(Y„ - Y J for achieving satisfactorily 
small deviation from the desired output C denotes a 
generalized inversion). The profile is c.stimaied by averaging 
and filtering the output measured inside the appropriate 
sheet section. 

This solution has several disadvantages; 

• the profile estimation adds substantial dynamics to control 

loops, 

■ the profile model is very simplified, 

• the matrix B„ has large dimensions and it is difficult to 

invert it robustly, 

• the simultaneous resetting of all screws is expensive. 

5. Available reserves for the economical and qualitative 
improvements 

'Fhe cross-profile basis weight distribution becomes 
economically important and the reconstruction of old paper 
machines is an actual task. As mea.suring in.struments and 
computational means are usually available (and used in the 
MD control) for large machines, the necessary new hardware 
is a headbox with slice screw actuators to be controlled 
directly by a computer program. The simultaneou.s setting of 
all screws is expensive; a successive setting using a special 



Fig. 2 . Measurement path (result ol sheet and traversing 
gauge movement). 


robot might be preferred. More complex software is able to 
make up for this hardware simplification. Improvements of 
measuring devices and actuators, where possible, seem to be 
effective. Stationary profile measurement (Francis and 
KJeinsmith, 1988) and recommendations for slice lip shaping 
(Kiessling, 1988) have been published recently. 

The common definition of the CD profile as a deviation 
from the MD value makes the MD and CD control loops 
inevitably interrelated. Changed definition can better reflect 
control design requirements (see next section). 

A substantial quality reserve is still fell in more 
sophisticated algorithms taking into account technology 
properties. Profile stationarity is used, but the very profile 
smoothness has not yet been exploited. 

As the control computer is at disposal for the initial 
tuning, it can be used for adaptive control with recursive 
identification as well. Nevertheless, the adaptive control still 
seems to be rare (Ma and Williams, 1988). 


6 . 77ie proposed solution 
Problem: 

• ensuring possible adherence to the prescribed basis weight 
profile, 

■ adding the CD control to the existing MD control. 
Hardware available: 

• measuring device—traversing radiation gauge, 

■ actuator—traversing robot (sequential resetting of screws). 

6.1. Control synthesis. We have found useful to define the 
profile as a curve shape only (as it is intuitively natural) and 
to express U in terms of derivatives (cf. the model (1)). 


S’(f)=[.V|,. .,J„1 


5jf., 


J’ 


at some fixed CD positions i (usually, derivatives in 

the middle of screw distance with o - p - \) 'Ilien the 
existing MD control can be preserved (as required) while 
either automatic or manual CD setting is chosen (according 
to permanent cross profile evaluation and displaying) 

The difficulty to synchronize the gauge and actuator 
movements requires separation of identification and control 
synthesi.s, which is well possible in adaptive control design 
(certainty equivalence principle) 

As only single screw-position change i.s to be realized al 
one time moment and all future changes must be 
simultaneously taken into account due to coupling of effects, 
the long-horizon dynamic programming is the right tool for 
control synthesis The optimization criterion is, therefore, 


Jf(t) 



t \ M 1 

, Z /(T) 

I ( 


(3) 


where , 1 ) is expectation (conditioned on the observed 
data up to t - 1), /(r) is partial loss, M is horizon length. 

The partial loss /(f) includes the control error; penalization 
of control actions enables the user to limit them. The 
quadratic form is advantageous as it emphasizes larger 
deviations; 


/(f) - {s{t) - s„)'(s(/) - s„) y w(t) Au^it), (4) 


where 8 (f) - s,, is control error (shape error), (o-vcctor of the 
differences between the actual s(f) and the required profile 


Au(f) is the control action designed (.single screw-position 
change) (see ( 1 )) 


lJ'(f) = U'(/- l) + |0. . - ,0. An(0.»_0] 



= ^ ( 0 , 0 , .. . 0 ), 



(.S) 


w(t) is a time-varying penalization; its restriction-dependent 
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changes (see Bbhm and Kimy, 1982) ensure th.u slice would 
not be damaged by excessive input values 
As discussed by Kdrny et ol (1988), the profile shape can 
be described by the state space equation 

s(r) = «(/-]) + r"''Au(f)4 e (0 ( 6 ) 

where e(t) is process noise. 


1(0 

The veclor I = [/,, ./j^] is a skew-symmelrical k - 2m 

vector (k^a characterizing the screw influence) 

The upper index i(0 points to the screw to be positioned 
It IS pcnodical and makes the vector periodical as well 
(by shifting f inside 

The equation ( 6 ) says that the shape vector s(/) ts a slate 
vector (unmeasurable), the new shape equals the old shape 
plus an increment following the change of the /(t)th screw 
position Au(t) The increment encompasses more than one 
section— It IS the derivative of the screw response Dynamic 
programming for this simple state equation ( 6 ) and the 
criterion (3) results in Riccati equation with periodical 
solutions (enforced by periodicity of f^'^) of the form 


Au(r) -L'*'* s(/ ~ 1 ) 


1^'(0 


_S(or"'' _ 

r"'*s(/)f‘''' + H(/) 


(7) 

(«) 


where the matrix S(l) is Ihe resuli of backwards recursion 






fori^Z + A/ 1, r with the initial condition S(f 4 Af)- 

1 


6 2 Identification The prcicquisilc for the lontrol ap 
proach given is a valid shape estimate s{t) acquired by 
identifleation The data at disposal lor this purpose arc 

• the slice-lip position U(t - d, x) at time ( and C D position 

jrE{x, X 2 le (sec (I), ('>)) [U(r- 

d,x^) U(t — d = U'(/- fV) (only the deviations 

from the initial adjustment arc available thus we can set 
U(r, )^(Jforr-0), 

• the machine speed V{f — d) 

• the noisy ''diagonal basis weight >j(V> x(i'^)) measured in 
the CD positions x(i'/) at time moments {f (samftling rale is 
higher than control rale, sampling C U positi ins arc often 
nonequidistant due to the nonunilorm gauge speed see 
full dots in Fig 2) 

The Items {a(i^)) f - 1f} arc hltered with a 
special hlter (not adding dynamit delay) (Karny ct al , 19KK) 
and extrapolated into equidistant points sav to the nearest 
CD screw position 

xm / -l' 1} >Y{t (1(1) 

(see crosses m Fig 2) The prehitered (smoothed) data are 
then used 

The basic description oi the process for the idcnlihcation 
and profile display is the convolutional integral equation 

K(/, x)= f A:(x)t;(/- d,x~ x)iXx 
Jo 

+ f J(t)V(t-d- Odf 4C(x) + /:(r X), (11) 

Ju 

where Y(t, x) is the basis weight al time t and CD position x, 
L IS width of the paper web, d is transportation lag for the 
mean speed, V(i) is machine speed at lime /, it has lo be 
incorporated into the model because it is used as input 
variable for the MD basis weight control, U(t x) is the 
vertical deviation of slice at time t and CD position x from 
the initial form, C(a) is a smooth absolute term reflecting 
slice profile for f/( , )®0 and V()-Vi,-medn machine 
speed, K( ) is smooth time-invariant even kernel, charac 


tenstic for a paper grade, J( ) is smooth time-invariant 
kernel falling sharply to zero, £(f, x) is stochastic zero-mean 
term comprising uncertainties 

By this model the continuous character of the process in 
time and space is sti^^scd 1 he references as well as practical 
experience seem to agree, that the implicit assumptions are 
fulfilled the system is (approximately) linear static, the edge 
effects are not pervasive (edges cut off) The assumptions 
about the disturbance E{t, x) arc discussed by K^rn^ et al 
(1988) 

In the evalualions, the continuous functions of a variable h 
arc approximated by properly chosen splines according to 
formula 

C(h)='Z g,gg,(h) = f\{h). (12) 

where we have introduced the vector g' —) 
of Ihe function values, g^~G{h,), i-l,2, , at the 

node points = < h. <h„ = the vectors of 

spline bases qj^(^) 

First order splines proved to be sufflcienl Not only the 
signals, but also integral kernels are approximated 
(sometimes possibly with nonequidistant spline nodes) In 
this way the distributed parameter system is approximated 
by the lumped parameter system 

Using th. approximation (12) where h stands for x or t, G 
lor f Ay U V, respectively, and g stands for newly 
introduced vectors of function values c, k, j, u, v the 
kqualion ( 11 ) can be rearranged into the form 

K(r, x) - Zr(f, x) + P'z(i, x) X F [0 L] (13) 

The parameter vector P-[i‘',k,j'] includes the initial 
profile and parameters of both convolution kernels 
(discretized according lo spline nodes) The kernel 
parameters k determine the vector f (see ( 6 )) used in control 
synthesis 

Ihe regression vector /(/ x) is a known linear function of 
U{t — d, x), V(t-d), ihc weights depend of spline choice 
only and can be precomputed 

Knowing al time t the values Y{t, a^,,)). U(i - d, x^^,j) (see 
(10)) the regressor z{i x^j,,) is known, equation (13) can be 
used for parameter estimation The regression model can be 
effeciively identified by the existing procedures (Kdrn$ et at , 
198S) including sophisticated forgetting with anlibursting 
influence (Kulhavy 1987) and initialization respecting the 
prior knowledge (KiSm^, 1984) 

If the parameters are idenlified, the basis weight and its 
dciivative, i c the profile shape, can be computed al any CD 
position (possibly except of spline nodes) By sampling the 
piohic shape at o CD positions (see (2)) at which derivatives 
cxisi, and defining admissible input changes (5), we arrive at 
the form ( 6 ) 

7 System texting 

An extensive system for debugging and testing was built 
tvery part of the algorithm was thoroughly tested without 
and with noise (much care was devoted to realistic noise 
simulation) 

For the overall system performance the identification is 
crucial because of sparse data The simultaneous identifica¬ 
tion of all parameters (13) proved to be reliable but the 
resolution of mutual influences of dim(P) o parameters 
requires (much) more than dim(P), identification periods It 
was found useful to identify the initial profile first, without 
input changes During this phase the control algorithm is 
initialized, too (by iterating the Riccati equation) 

The illustrative example shows the performance of a 
simulated machine modelled according lo equation (13) 
under the following conditions 

• numbei of slice lip screws /i - v - 

• initial absolute profile of the slice lip 

U = [0,0, 0,0 1,0 2,0 3.0 2,0 1,0, 

-0 1,-0 2,-0 3, 0 2,-0 1,0] 

• sampled screw influence kernel K(x) - [0, 0 7, 1,0 7, 0] 
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Fig. 3. Closed loop bchaviour-time evolutions of: (I) quadratic norm ol the basis weight error Y(t) — Yf,, 
(2) control action Au, (3) a typical element of the basis weight error Y(0 — Y(,, (4) the worst element of the 

basis weight error Y(/) — Yo- 



Fig. 4, Time-space distribution of basis weight (known 
parameters). The arrow (—►) marks the start of control. 


• equidistant spatial spline nodes (placed at the CD screw 
positions) 

■ constant vekxiity 

(The noise-less case is illustrated—experiments with noise 
give satisfactory results, itx), but the graphs do not display 
the control inhuence clearly enough.) 

Figures 3 and 4 show the ideal performance of the system 
for known parameters and control hori/.on M ^ 30 longer 
than one period of the control robot movement. Figures 5 
and 6 illustrate how the same .system works in case of 



Fig. 6 . Time-.space distribution of basis weight (unknown 
parameters). The (--♦) marks the start of control. 

unknown hcadbox parameters (vel(x:ily hxed). The initial 
profile is supposed to be completely unknown, the 
screw-kernel estimate models weak input influence. The 
resulting strong control inputs in the initial phase are 
automatically severely limited until the .screw kernel is well 
identified. 

8 . The final control algorithm 

I’he adaptive controller was de.scribed by source program 
parts with user-oriented commentaries F’^or all parameters 
appropriate default values were given. 



Fig. 5. Closed Imip behaviour-time evolutions of: (1) quadratic norm of the bicsis weight error Y(t) - Y,,, (2) 
control action Au, (3) a typical element of the basis weight error Y(t)-Y,„ (4) the worst element of the 

basis weight error Y(/) - Y,j. 
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Identification algorithm 
Off-line phase: 

• spline bases choice (number and placing of nodes; 
Bayesian slruclure determination (Kdrny and Kulhav^ 
1988) might be helplul); 

• precomputation of weights for data vector formiiie (Karnv 
etal.,\m)\ 

• initialization of recursive least squares (using data from 
past runs, possibly incorporating prior information (KiirnV 
1984)), 

• choice of forgetting factor (Kulhavy, 1987). 

On-line phase: 

- raw data acquisition (without usual filtering which brings 
additional dynamics); 

• filtering (see ( 10 ), smoothing and transforming the 
nonequidislaru measurements into equidistant 
representatives); 

• identification of regression model (by recursive least 
squares with restricted forgetting (Kulhavy, 1987)); 

• state computation (profile shape in preselected points 
computed from equation ( 11 ) for the current parameter 
estimates); 

• estimation of CD basis weight distribution (display for the 
staff, again from ( 11 )). 

Control algorithm 
Off-line phase; 

• choice of horizon length (the longer the better, minimally 
the width of the screw response); 

■ lower bound of penalization w(r) (3); 

• control action limits (given by the user). 

On-line phase: 

■ evaluation of allowed change of actuator action; 

■ actualization of state dependent restrictions (admissible 
s 1 i ce -1 i p d e I o r m a t i o n); 

• iteration of Ricatti equation (9) (with new parameters); 

■ evaluation of control action (7), 

• hard control signal restriction (if necessary), evaluation of 
the future penalty w (Bdhrn and Karny, 1982), 

9. Conclusion.^ 

Control system for cross-direction paper basis weight 
control has been designed for sequenliiil resetting of slice lip 
screws and tested by simulation. Described key results which 
can be used also in other applications arc; 

• spline-approximation of convolutional models (Karny ct 
ai, 199(1), 

• transformation of multi-input multi-output system into 
single-input-rnulti-output with model periodicity. 

Special features related to the paper-making implementation 
are: 

• the problem can be formulated as LQfj adaptive control 
problem, 

• adaptation to slow process changes is possible, 

• the periodical design requires the long-horizon dynamic 
programming, 

• the lack of measured data for identification can be 
compensated by using prior inlormation (continuity in 
space and lime), 


■ the experimental results show that the horizon length is 
critical: it must include at least the width of one screw 
influential zone, 

• the automatically varying penalization (Bdhrn and Kiniy, 
1982) proved to i^e helpful when starting the algorithm 
under uncertainty. 
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Extended Discrete-time LTR Synthesis of 
Delayed Control Systems*! 
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Abstract —This paper addresses compensation of dcalys 
within a multi-inpul-multi-output discrete-time feedback 
loop for application to real-time distributed control systems. 
The delay compensation algorithm, formulated, in this 
paper, is an extension of the standard loop transfer recovery 
(LTR) procedure from one-step prediction (o the general 
case of p-slep prediction (p=-<l) It is shown that the 
steady-state minimum-variance filler gam is the 
minimization solution of the relative error between the target 
sensitivity matrix and the actual sensitivity matrix for p-step 
prediction (p >■ 1). This concept forms the basis for synthesis 
of robust p-step delay compensators (p >l). The proposed 
control synthesis procedure for dcla> compensation is 
demonstrated via simulation of the flight control system of an 
advanced aircraft. 

1. Introduction 

In many real-timl distributed control systems such as 
advanced aircraft, spacecraft, and autonomous manufactur¬ 
ing plants, the sensor and control signals within a feedback 
loop may be delayed or interrupted. An example is the 
randomly varying delays induced by multiplexed data 
communication networks in Integrated C ommunication and 
Control Systems (ICCS) (Halevi and Ray, 1988; Ray and 
Halevi, 1988; Liou and Ray, 199U) where delays may be 
distributed between the sensor and the controller and 
between the controller and the actuator as illustrated in Fig. 
1. Another example is the occurrence of delays in the control 
law execution due to priority interruption at the control 
computer (Belle Isle, 1975). In general, the presence of 
randomly varying distributed delays within a multi-input- 
multi-output (MIMO) feedback system makes the task of 
controller design signihcantly more difficult than that without 
delays. To this effect Luck and Ray (1990) proposeil a delay 
compensator to alleviate the detrimental effects of randomly 
varying distributed delays by using a multi-step predictor. 
The key idea in this multi-step compcnsaloi design is to 
monitor the data when it is generated and to keep track of 
the delay associated with it. With this knowledge, the 
problem of varying distributed delays can be alleviated by 
having a lumped constant delay of multiple sampling 
intervals as seen by the controller. 

The major assumption in the formulation of the above 
multi-step compensation algorithm (Luck and Ray, 199(f) is 
that the randomly varying delays are bounded. This 

■^Received 17 September 1991; revised 24 Mar';h 1992, 
received in final form 15 June 1992. The original version of 
this paper was not presented at any IFAC meeting. This 
paper was recommended for publication in revised form by 
Associate Editor T. Ba§ar under the direction of Editor H. 
Kwakernaak. 

t This work reported in this paper was supported in part 
by Office of Naval Research under Grant No. NGCfOH-W-J- 
1513. 

! Mechanical Engineering Department, Pennsylvania Stale 
University, University Park, PA 16802, U.S.A. 


assumption is justified in view of the fact that an unbounded 
delay would render the closed loop open. LJsing a specified 
confidence interval, an upper bound can be assigned to each 
of the randomly varying distributed delays. The number (p) 
of predicted .steps in the compensator is then determined 
from these bounds. That is, at lime k, the predictor estimates 
the stale o.sing the measurements up to the (k - /?)th instant. 
Although Luck and Ray (1990) addressed some of the 
robustness issues of the delay compensator for structured 
uncertainties, the compensated system u.sed the gain matrices 
that were originally designed for the non-delayed system. 
vSince the robustness property of linear quadratic optimal 
regulators (LQR) is not retained when the stale feedback is 
replaced by stale estimate feedback (Doyle and Stein. 1979), 
this problem is likely to become worse with the insertion of a 
p-step predictor for p ^ 1 because of the additional dynamic 
errors resulting from plant modeling uncertainties and 
disturbances. 

The objective here is to deveh>p a procedure for synthesis 
of the p-step delay compensator with a trade-off between 
performance and stability robustness. To achieve this goal we 
propo.se to extend the concept of loop transfer recovery 
(LTR) (Doyle and Stein, 1981, Stem and Athans, 1987) 
which is a well-established procedure for synthesis of robust 
controllers. In continuous-timc systems, the key step in LTR 
design is to select an ob.scrvcr gain .so that the full-state 
feedback loop transfer properly can be recovered 
asymptotically For the di.screle-time LTR, Maciejowski 
(1985) has shown that although the target sensitivity matrix 
can be completely recovered with a posteriori stale 
estimation (i.c. p - 0 in the p-slep predictor) a predictive 
state estimator (i.e. p = 1) is not capable of full recovery. 
Along this line Zhang and Freudenberg (1991) have analysed 
the loop transfer recovery error for predictive state 
estimation. We have adopted an approach, following the 
multi-step prediction method of Luck and Ray (1990) to 
.synthesize the control system for delay compensation. This 
approach minimizes the loop recovery error where the gain 
of the p-step observer is tuned to a prescribed value. An 
alternative approach is to incorporate the delay in the plant 
model and then synthesize the controller and observer gains 
that must accommodate the effects of loss of robustness 
margins due to the delay and uncertainties. This approach is 
also discus.scd in this paper and it has been shown that the 
above two approaches yield identical relative .sensitivity 
errors. However, if the plant model is simply augmented to 
accommodate the induced delays, the stale-space realization 
may not be minimal as pointed out by Kinnaert and Peng 
(1990). 

The paper is organized in five sections including the 
introduction. Section 2 summarizes the pertinent properties 
of LTR for one-step prediction as reported in the existing 
literature. Main results including the structure and properties 
of the p-step delay compensator (p ^ 1) are generated in 
Section 3. A general description of the p-step compensator is 
first presented. 'Hien, the loop transfer matrix of the 
compensator is derived along with the error of the sensitivity 
matrix relative to that of the target loop. Finally, it is shown 
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Fio 1 Random distributed delays in the control system 


that the minimum variance tiller gam minimizes the error ol 
sensitivity matrices A prt^ccdure to synthcsi/c the observer 
and controller gams for the p-step compensator is presented 
in Section 4 The proposed proceduic is tested by the 
simulation of the flight control system ot an aircralt which is 
subjected to a lumped delay ol two sampling periods The 
paper is summarized and concluded m Section S 

2 Review oj the LTR concept Jor one step prediction 

The concept ol loop transfer recovery (LTR) and the 
existing results for one-siep prediction m the discrete time 
setting are presented in this section Tht plant under control 
IS represented by a discretized version of a hnitc 
dimensional linear time-invariant model in the continuous 
lime setting I he discretized model is assumed to be 
minimum-phase stabilizablc and detectable 

>1 - ( 1 ) 

Following (1) the plant translcr matrix is given as 

0(z)~ ( <t»(z)/? (2) 

where <P(z) -(z/-A) ‘ is tht resolvemcnt matrix I he 
target loop transfer matrix at the plant input is 

H{z) ~ra>{z)R ( 1 ) 

and the resulting target sensitivity matrix is 

S(z)-|/ +A/(;)| ' (4) 

The lull-stale feedback control law tor the above plant is 

In this paper wc have assumed that the uncertainties arc 
unstructured and lumped at the plant input in the lorm of an 
input multiplicative term 

(jiz) fi(z)l/+A(zJl (h) 

with given bound 

cT[A(e^‘")] IJo>) 'ico *0 (7) 

Usually, unstructured uncertainties include high frequency 
dynamics that are not modeled in the plant dynamics 1 he 
bound finally restricts the system design specilications 

for stability robustness in terms ot the dosed loop 

complementary sensitivity matrix (Do>lc and Stem 1981) 
The task of control synthesis via the standaid I UCi/l IR 
approach is focused on shaping the loop sensitivity matrices 
for required performance and stability robustness and can 
be carried out in two stages as discussed in Doyle and Stein 
(1979) 

For the filter observer (i c p-U) of a slabili/abic 
detectable and minimum ph isi plant the loop transfer and 
sensitivity matrices have been shown by Maciejowski (198*)) 
to converge pointwisc in frequency to those of the target 
system as the measurement noise approaches zero However 
this may not be valid for the one-step predictor 
(Maciejowski, 1985, Zhang and Freiidcnberg, 1991, Yen and 
Horowitz, 1989) Breaking the loop at the plant input, as 
shown in Fig 2 , the one-step delay compensator transfer 
matrix IS 

G,(z)- F(z!-A + Hl^ + IC) '/ (Ka) 


Loop bietX point 




Loop httik point 



FKi 2 Loop transfer recovery breaking the loop at plant 
input 

An alternative lorm is 

(,,(z) - l-y + <P{z)(Bf- + I o] '<r>(z)l (8b) 

Then the loop transler matrix for the one step delay 
compensated system is 

L|(z)-G|( z)(/(7 ) /•|/^ <t>(z)(fl/' + /( )| '(|>(Z)/('l>(z)W 

(9a) 

which can also be expressed, similai to the lormula proposed 
by Zhang and Freudenberg (1991) as 

I,{z) \n h ,(z)] '\H(z) l (9b) 

where /'j( 2 ) F|z/ - A I ( ] 'B is the one step error 
matrix at the plant input The resulting one step sensitivity 
matrix can be expressed as a fund ion of the error tiansfer 
matrix 

S,(2 )-[/ + L,(2)] ’ 

-\I + H{z)] ■|/ + ^(‘) on) 

It IS clear from (b) and (9) that r,(2) is essentially the 
relative error ol the sensitivity matrix S,(z) of onc-step 
delay compensator relative to the target sensitivity matrix 
S(2) 

L,(z) S(z) '|S,(z) 9(7)) (11) 

It IS known (Zhang and Freudenberg, 1991, Yc*n and 
Horowitz 1989) that complete loop recovery, i e making 
F,(z) = 0 for all z cannot be achieved in general by a 
constant observer gam L However, it is possible to identify 
an / that minimizes the one-step error transfer matrix / 1 ( 2 ) 
in the H. sense 

1 The p-step delay c ompen^ator 

Figure 1 illustrates a feedback control system where both 
sensor and control signals are delayed (ITiis might happen m 
a network based integrated control system of advanced 
aircraft where sensor, controller and actuator are not 
collocated ) If the sum of the upper bounds of distributed 
delays arc represented by a lumped delay ol p sampling 
intervals at the plant output, the sensory information 
available at the controller is at the ^th instant The 

p-step delay compensator (where the plant is completely 
controllable and observable), proposed by Luck and Ray 
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Fir, 3 Struclure of p-sicp delay compensator. 


(1990) and illustrated in Fig 3, has the following structure. 

( 12 ) 

where F is the state feehack gain matrix and the state 
estimate is based on the .sensory information up to the 
(k - p)\h instant given as: 


\k\k 


,7+2|A f, M 

^k /, M |A /j p\k -p I p p 


(13) 
. ,) 


The key idea of applying the LTR approach to the above 
/?'Stcp delay compensator is to tunc the loop transfer matrix 
such that the error transfer matrix (i e the different between 
the actual and target sensitivity matrices) is minimized in a 
certain sense. Derivation ol the loop transfer function of the 
p-step delay compensator is presented below as two 
propositions. 


Proposition 1 . The transfer matrix of the p-slep delay 
compensator (p ^ 1 ) from y^ to in the equation sei (13) is 
given as: 


C„(I) = FQ„ '(z) [z; - /v + U + '(r) 

(14) 

where 

ft^(z) = / + (/- , j<i>(z)fl/' for p - I. and f 2 ,(z) - /. 

(I-.) 


Proof of Proposition 1. The transfer matrix of the p-sU’p 
(p 1 ) compensator from to is obtained by substituting 
(13) into (12): 

“* = - M" 'i* ,,-f' X , I. 

I -N 

where the .summation on the right hand side reduces to zero 
for p =■ 1. From the above equation, it follows that 

“t = -FA'' '(A - LC)xi, ,,|t I, I 

-FA'' 'Ly^ ,,-F'£a'Bu^ , (lb) 


The Z-tran.sform of (16) is. 

U{z) = -FA'' '(>4-i.C).V(z)z "-hA’’ 'LY{z)z " 

p I 

-F'Z A'BIJ{z)z • (17) 

f-O 

and /-transform of the last equation in (13) yields: 

X{z) = (zl-Ayl.C) 'lfl(/(z) + LV(z)l, (18) 

where A’(z)=-Z|i*n ,), y(z) ~Z|>J, and (/(z)-Z|utl. 
Substituting (18) into (17) yields; 

V(z)--FA'' '{A-LC){zl-A +LC) '\BU{z)-y I.Y{z)]z <’ 

-FA’’ 'LY(z)z 

-f X ‘'^'BU{z)z ' 

r- fl 

which in turn can be simplified as: 

r '' ‘ 

(/(z)- /•■ z'' '/+ V A'Bz'’ ' -F 


+ A'’ '{zf-A + LC) 'BF 


xA" '(zl -AS LC) 'TT(z). (19) 

U.sing the relationship '^(z) = (z/-/4) ' and equating like 
powers of A, equation (15) can be expressed as: 


/ 3- ^ A'Bz ' 'F p ' 1, 


I'he proof is completed by substituting (20) into (19) and 
exercising a few algebraic operations. 

An alternative proof of Proposition 1 can be formulated 
following (Ishihara, I9S8) by expressing the transfer matrix 
of the p-step compensator from to as; 


c.( 2 )-- 


(zl-A -V LC) 'B + / + 


'U* M'B] 

Zj ,^i 
1-0 ^ -I 


x~--,-(zI-A+LC) '/.. 


Proposition 1. l^t the loop tran.sfer matrix of the p-step 
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delay compensated system at the plant input be expressed as 
4(z)-G,(z)G(z). 

where Gp(z) and G(z) arc defined in (14) and (2), 
respectively Then, 

L,(z) = (/ + £^(2)1 'l«(z)-£p(2)), (22) 

where 

£^(z) = (/ + <I>(z)LC 1 ‘^(z)£C ^(z)/f. 

( 21 ) 

IS the p-step error transfer matrix, and H(z) is the target 
loop transfer matrix as defined in (3) 

Proof of Proposition 2 By Proposition 1 the open loop 
transfer matrix of the p-step delay compensator is 

£^(z) = C,(z)C(z). 

= Fa/{z]^^zl-A + LC A BFQ/U) tJ 

X /X<I>(z)B 

= Fn/(z)[/ + '^^- ,'(/ + «l>(z)LC) '<l>(z) 

“1 > ap I 

xBFn„'(z)J — 

X {I + <t>(z)lX) '<P(z)LC<t>(z)8 
/+(/-^^, ,)0(z)flF+ 

x(/+<|)(z)£C) '<l>(z)flf] 
x(l + <t>(z)LC) ‘4>(z)/.C<t)(z)fl 
\l+F^(z)B-F^ 

■ I 

y (I + <t>{z)LC) '<P(z}l C<t>(z)B 
AP ' 

,U + ^(z)LC) '<t>(z)L( <t>(z)B 


The proof is completed by substituting (24) and (1) in the 
above equation 

Remark 1 After some algebraic manipulations E^{z) can be 
written as 


/•,(z) - Fr,.(z) 

where 

r ar ' 

W.)7'' + 

[(zI-A^LC) ^8 


(24) 



This Lshows that £^(z) can be separated in terms of the 
fulbslate feedback gam F and a function of I and p 

Remark 2 For a minimum-phase plant (ABC) and with 
det(rfl)^0, as the measurement noise convariancc matrix 
R approaches to zero and consequently L -* ABiC 8) ' 
following Shaked (i98S) the error matrix L^{z) in (23) in 
Proposition 2 simplifies to 


E^(z) = f{l~~l'j<b(z)B 


(26) 


For p = Ir the error matrix £^(jr) in (26) is identically equal 
to £,(z) in (11) 

Remark 3 It follows trom the expression of Lp{z) in 
Proposition 2 that the sensitivity matrix, \,(z), of the p-step 
delay compensator is 

5^(z)-[/ + L^(z)] (27) 


and the difference between tht sensitivity matrices of the 
p-step cximpcnsated and target systems is 

.S,,(z)-V(z) = [/ + A/(z)] ‘£„(z), (28) 

where 5 ( 2 ) is given m (4) T^is shows that the error transfer 
matrix £^(z) is indeed the error of the sensitivity matrix of 
the p-slcp delay compensator loop relative to that of the 
target loop 

Remark 4 It the plant model has an inherent delay of p, 
steps and the induced delay in the feedback loop amounts to 
P 2 steps such that the total delay is p~p\^p 2 then the 
resulting error transfer matrix satisfies equation (26) as the 
measurement noise tovarianLe is tuned to zero Fhis can be 
easily seen if the plant transfer matrix satisfies the following 
constraints 

CA‘B={) i 0 1 p, - 2 and det (C/4^'' ‘B)=^0 

(29) 


Giz)- (^{2)8 

= f >: fl 


f , <i.(z)fl 


(30) 


According to Shaked (198S) the observer gam L approaches 
A'^BiCA'' '8) ‘ as the measurement noise covariance 
approaches zero Applying this observer gain to the loop 
transfer matiix 

/,(z)-G^(z)G(z) 

and by using (14) m Proposition 1 the error matrix becomes 
identical to that in (26) uith a total delay of p - pj + p, 

I his shows that any inherent delay in the plant has the 
same effect on the error matrix as the induced delay in the 
feedback loop In other words we can either consider the 
delays in the feedback loop outside the plant or as a part of 
the plant model In the second case the original plant 
state spa(-c matrices {A 8 i ) with det (f /i) -^0 need to be 
augmented with p, steps of delay and the new plant 
stale space matrices {A 8 ( ) need to be formed which 

must satisfy the following conditions ( 1 ) C {zl ~ A ) ^8 - 
C(zl-A) '' and ( 11 ) complete controllability and 
observability Therefore wc have adopted the first approach 
of putting the lumped induced delay outside the plant model 

Remark S Dual results of Proposition 2 obtained by 
breaking the loop at the plant output instead oI plant input 
yield the Imip transfer matrix 

/^(z) (A2)G^{z) 

-lW(z)-/„(z)||/t/„(z)| 

where the target loop tianstcr matrix at the plant output (1 c 
the transfer matrix of the minimum variance filter) is 


and 


//(z)~ C^zU 


r^(z) - C <l>(z)/ - C tD(z)££‘l>(z)|/ + BF<Piz)\ 




/ 


The resulting loop sensitivity matrix of the p-step delay 
compensator at the plant output is 

S,(z)-[/ f £,(z)l ‘-iyT£,(z)l[/4//(z)] 

and the difference between the sensitivity matrices of the 
minimum variance filter and p step delay compensator is 

Sp{z)-S{z)-Lp(z)\l^H(z)] ■ 

H 2 ~mintmiz(iUon of the p-sfep error matrix 
For the onc-step predictor, it has been shown (Zhang and 
Freudenberg, 1991, Yen and Horowitz, 1989) that the 
steady-stale minimum-variance hlter gam with zero measure¬ 
ment noise IS obtained by minimizing the norm of the 
one-step error matrix F,(z) Analogous to the case ofp = 1, 
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we will show thdt the •.dine filter gam niinimires the H, norm 
of the p-error matrix £„(-) This result forms the basis tor 
synthesis of robust /)-step delay compensators (p > I) and is 
presented below as two propositions Then, the procedure 
for synthesizing the observer gam matrix is outlined 
For the purpose of tuning the minimum variance gam of 
the observer, we augment the discrete time, linear 
time-invariant plant model in (1) and (2) with (fictitious) 
plant and measurement noises ds 


= ^ ni) 

U-CX-fiV (12) 

where is a zero-mean white sequence with covariance 
matrix E{w^w[) and (nj is a zero-mean while 

sequence with covariance matrix 
combining the distributed delays within the control loop as a 
lumped delay of p sampling intervals at the sensor controller 
intertace, the stale estimate is rcdchncd as 

•«* * p-=F{H IV* ,,) ni) 

where the estimator is described by the set of equations (11) 

Fropusilion 1 Let the (zero mean) state estimation crroi be 
defined as 

^‘k\k I ~ '' ^k\k i> (^4) 

Then, 

p^k\k 

( 

^ '{A - IX YBHAA - I C y^A'f 

V II 

f 

i 1 V P-2 (■?<!) 

1 II 

V zt/’ '(/t lcybbAa-I ( y'A^' "" p-\ 

Proof of PropoMtion ^ From ihr plan! model in (11) and 
(12) and the filler equations (11) we can express the 
estimation error P;^|^ m terms ot the input sequence (wf,) 
when the system is mitiall> started at s - ^ 

k J 


A' 

'{A / ( )* w, 


k 1 

V 


(10) 


1 A / I I 


Hence the cross covariance of and defined as 

^/k(') = ^ jf'A I + i^l) can be expressed in the following 
form 


»i p \ 

p \ I 

-1- 2 -4’ ' '^BB'oim) 

f?l 1 s 

‘.A’’ '(/f - f,C)' ’’BB^ (f -p) 

= iA' 'BB^ (i - I p - 1) (37) 

fo ('-(1) 

Simildriy, the dulocovariancc of c* is obtained as 

KM 

( I ^ 

S «,„(v + m 4 p)(A - LCr'A''' 

nt (I 

P ^ 

' + S + \)A'' 

I 0 

+ * 

X RMAm+p)(A-U r'A'" 

<m-[) 

The proof is completed by setting s = 0 and then substituting 
(37) into (3H) 

Proposition 4 The norm of the sensitivity error matrix 


nr 


/;-2 (IK) 

p-\ 


dehned in (23) in Proposition 2, is minimized if the 
observer gam matrix L is identically equal to the standard 
steady-state minimum-variance filter gam matrix with no 
measurement noise 

Proof of Proposition 4 From the definition of norm 
(Francis, 1987) and F^(z) in (24) ol Remark 1, it follows 
that 

l|£^(2)lli = :^lrace (J Fr„(e'“)r,*(e'")F^ d£j) 

A” 




4 trace I 


'flj F’ do) 

A + LC ) 'fij 


ry 


A' 


,m,b] 


yUiy I) y 


\shcrc 


.|(e '“l-A + IC) 'flj F^dn) 

1 M(p), 

fin 


(39) 


M(p)- ' 


1 

2;r 


U) 


h\ y 


jUi^ I 1) 


< 

p y nr , 

X ^ jliii 11)^1 ^ ^ di2 j 


P -2 

p - 1 


Since the sum of the second integral and the third integral is 
identically equal to zero and the inlcgrals of the cross terms 
in the (ourlh term also vanish 


\\Ep(A 


1 

2;r 


trace 


M" ■ f V 


e'“/-/f+Lf) 'BB' 


X (c A‘i-a 4 f ( ) ' dt2|/<''' "^F 


7 


>shcre 


4-/V(p) 

(40) 


N{p)- 


trace I ^ j p 


lo 


p-1 


For given plant model siatc-spacc matrices A and B, if the 
feedback gain matrix F is fixed then the observer gam L is 
the only adjustable matrix which could change the Hj norm 
of the error matrix On the other hand, the covariance of the 
stale estimation error in Proposition 1 can be written, 
according to the discrcle-timc Plancherel theorem (francis, 
1987) as follows 

LM, Xl‘ ,} 'f l(e'“/-/f4/C) 'BB' 

Ju 

x(e "‘I~A + IC) ’'dQl/l"’ "'4.S(p) 


where 


S{p) = 


fs 

t.; 


A'BB'a'. p -2 


(41) 


A tompdrison of the H, norm of the error mdirix tp(z) in 
(40) with the trace of error covariance matrix 
p^k\k ‘n (41) reveals that any adjustment of L can 
only change the first term of both equations rhereforc, 
minimization of ||F (z )||2 is equivalent to that ol 
trace |f{(r^n ,,?*!* ,,|lrorVp,'0 

Next we proceed to find an optimal L that minimizes the 
trace |F{ey^ It follows from Lemma 1, given 

below, that the minimum variance filter gam (with p ~ 1) 
also minimizes the trace [£(p^I^ )| while p1 

Therelnn the optimal ('bserver gam /, that minimizes 
II^,(-^)1L same L that minimizes ||F,(z )||2 According 


«T0 |g,|.N 
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to Lemina 2, the steady-state minimum variance gain with no 
measurement noise is the optimal gain 

Lemma I for Proposition 4 For the p-step predictor 
(/7 ^ 1), if the estimation error is defined as ~ 

^k~^k\k p> the filter gain L which minimizes the 

covariance is identical to the minimum 

variance filter gam 

Proof of Lemma 1 The proof directly follows the 
derivations in Chapter 5 of Maybcck (1979) 

Lemma 2 for Proposition 4 For a fixed F, the //-, 
optimization of one-step predictor error matrix given as 

min ||£,(z)||2 - mm 11^(2/~ .4 + (42) 

Li 

IS the steady-staic minimum variance filler gam where the 
plant and measurement noise covariance matrices, Q and H, 
arc set to 

Q = BB^. W-limp/, (41) 

Proof of Lemma 2 The proof directly follows from the dual 
result of Theorem 3 1 in Yen and Horowitz (1989) 

4 Synthesis of the p-siep delay lompensator 

We propose a procedure for robust synthesis of the p-step 
delay compensator on the basis of the analytical results 
denved in Section 3 This two-stage procedure is structurally 
similar to the conventional LOR/LTR, and is described 
below 

• In the first stage, the target loop is designed assuming no 
delay (1 e p ~0) and lull-state feedback The controller 
gain IS optimized relative to a specified pcrlormancc index 
Tins IS accomplished by shaping the target loop transfer 
matrix and the sensitivity matrix 

• In the second stage, according to Proposition 4, the 
observer gain L is made identically equal to the 
steady-stale minimum variance filter gam This gam is 
calculated by solving the steady-state Riccati equation for 
a fictitious measurement noise convariance matrix R which 
IS set to p! where p is a tunable scalar parameter and I is 
the identity matrix The plant noise covariance matrix Q is 
set equal to For a given p, the loop transfer matrix 
and the sensitivity matrix of the p-step delay compensator, 
denved in Proposition 2 and Remark 4, arc computed 
Finally, p is tuned to achieve the specified performance 
and robustness requirements T*he similar procedure can 
be found in Maciejowski (1985) and Yen and Horowitz 
(1989) 

The above synthesis procedure for p~step delay compensa 
tion has been venhed by simulation of the flight control 
system of a fighter aircraft The lumped delay of p sampling 
intervals is a representation of the scnsor-to-controllcr and 
controller-to-actuator delays (that could result from a data 
communication network interconnecting specially distributed 
components of the flight control system) The (continuous 
time) plant model, linearized at the operating condition of 
7 62 k and 0 9 Mach, is given below (Safonov el ul , I9K1) 
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0 
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0 

0 

■)()]' 


0 
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0" 

Lo 

0 

0 

0 

30 
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0 0 

0 

1 0 
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where the six plant state variables are forward speed, angle 
of attack, pitch rate, attitude angle, elcvun actuator 
position, and canard actuator position, the two control inputs 
arc elcvon and canard signals, and the two output variables 
arc angle of attack and attitude angle The plant model was 
discretized at a sampling frequency of 1(KK)H/ which is 
sufficiently high relative to the desired operating frequency 
range of the closed loop system On the basis of the 
discretized plant model, and a given set ol performance and 
robustness specifications which can be lound in Safonov et al 
(1981), the control matrices for the target and delay- 
compensated systems were synthesized using a standard 
commerically available toolbox on a personal computer 

The full-state feedback regulator was designed using the 
standard LOR procedure with (7, =/,, and R^ 10 I he 
resulting optimal slate feedback gam, F is givtn below 

7 3S70 -19^95 -9 9957 
-4 2423 11685 6 6904 

-15 975 8 K425 0 7()Kn 

10 395 - 0 70716 8 2537 

For tuning the filter gain 111 the LI R design proceduic, the 
plant noise covariance was set as (J ~ and the 

(fictitious) measurement noise Lovarianct was set to R - pL 
where p was tuned in the range ot 10 10 ' Initiallv a 

senes of simulation experiments were Londmletl with no 
delays, 1 c p - i) lo verify that the sysiLin perfoimanLC is 
degraded as p is increased and the error rciovcrv becomes 
impossible 1 he ITR procedure yielded good resulls lor 
recovering the full-stale feedback robustness properties at 
p - 0 as expected Tht system pi*rlormantc improved as the 
measuiement noise (1 e p) was redueed iind the loop 
sensitivity matrix converged to the target scnsitivit) matrix as 
p was made to approach zero 

In the simulation experimenis tin sialar paiamcler was 
tuned lo adjust tht observer gain I foi both om step .ind 
two-step compensaliirs such that ths stability robustness tor 
each case (1 e p 1 and p -2) beais .1 dtsired sate margin 
relative lo the target system while the state leedbaek gam is 
retained at the optimal value h for ilic target svstem Ifit 
robustness margin m the frcquciKy range ol 10 KHM) 11/ was 
set lo 10 dB m terms of iht maximum singular saluc ol tlu 
loop transfer matrix lo oseicomc the dcinmcnial Llfecis cd 
loss of phase margin resulting from delays I Ik pLiitimclcr p 
in the design procedure of the delav Lompensatoi was 
adjusted lo satisfy this requireniLrit for robustness VUl 
respective values of p and the resulting filter gain / that were 
used for one step and two step delay compensators are given 
m Table 1 

Figure 4 shows a comparison ol the maximum and 

minimum singular values ol the loop transfer matnees foi Ihe 
one-step delay compensated (i e p 1) and two skp 
delay-compensated {i c p 2) systems (Note Ihe target 
loop with full stale feedback and no delay merely serves as a 
reference for the synthesis procedure ) I he minimurn 

singular values of the loop transfer matrices of the individual 
systems represent the lower bounds of their respective 

performance, and their maximum singular values represent 
the upper bounds ol stability robustness As slated earlier 
the observer gam matrices of both compensators are .id)uslcd 
by tuning p such that their maximum singular values are 
about 10 dB lower than that of the target system m the range 
of 10 KKKIHz The minimum singular values lor both 
eoinpcnsalors, as seen in I ig 4, are sigmheantly lowei than 
that for the target system I his is expected m view o( the 
reduction in observer gam resulting m deereused loop 

transfer gain ITowever, the two-step compensator consis 
tcnlly exhibits a lower performance (of aboul I2clB) rel.itivc 
to the one-step compensator because ol lower observer gain 
due to a higher value of the parameter p 

One major criterion in the synthesis procedure is to reduce 
the difference between the sensitivity matrices of Ihe target 
loop and the delay compensated loop (in Ihe //> sense) as 
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TaIUI 1 OBSI-RVf R CMINS AND ADTUSlINn PARAMI TFRS OFllIl p SI I P Dll AY t OMPF NS A [ OK 
Ont sltp cleld) LOinpensator p = 1 x 10 


/-0 42 X10" 0 2x10* 0 33xHr 0 (KK) -0 6252 x 10"* 0 34l x 10V 

V-0 41X10" 0(KK) 0 772xUP 0 1 x lO' 0 2701x10'' 1 217 x lOV 


/-6 6239 
1-4 1893 


Two slip delay tompcnsalnr p - S = 10 

3 4811 x 10 * 3 7994 x 10 1 2893 x10 * -13923 x10" 

1 1793 ^ 10 ' 3 032S x 10 2 2847 x 10 * 1 9327 x 10 


1 5987 X 10 / 
i 2122 X 10"/ 


much as possibk from this ptrspctlive Pig 5 shows 
comparisons of the maximum and minimum singular values 
of tht scnsitivitv matrices lor the target one step delay 
compensated fi e p ^ 1) and two step delay eompensated 
(i e p = 2) systems As shown m 1 ahk I tliL ohservLr gams 
of the delay compensators are tuned to thtir respective 
values such that the specihcation for stability robustness is 
satisfied This renders the minimum singular values of the 
seiisilivity matrices ol the two compensalors remain close to 
each other and above that of the target system except in the 
high frequency region where all ol them move inwirds 0 dB 
therefore the sensiiivii) of the delay compensators reduces 
at high frequency which is a very desirable feature from the 
point of view ol stability robustness On the other hand the 
perkirmancc of the delay compensator (in the low 
frequency range) degrades as p is mcrciscd It lollows from 
Fig 5 that o(/ |(e'' )) - 5 5 dB and o(/ )) - H 8 dB m 

the ficquency range of 10 10 H/ 


5 ( anclustorVi 

In many leal time dislrihuted control systems ‘^uch as 
advanced aircraft spacecrafi and autonomous manufacturing 
plants the sensor and contiol signals within a leedback loop 
arc subjected to delays mduecd by multiplexed data 
communication networks or due to priority interruption at 
thi control computer I rom this perspective a procedure has 
been dtvcloped for robus’ compensation of delays m 


multi input-multi output discrete time control systems This 
control synthesis procedure is an extension of the standard 
loop transfer recovery (I TR) from one step piediction to the 
general ease of p-step prediction (p ^1) and is earned out 
m two steps (1) evaluation ol the state leedback gam 
assuming full state feedback and no induced delays and (2) 
tuning ol the hlter gam by varying a scalar paiameter 
representing the (hctitious) measurement noise covariance 
matrix Ihe delay compensated s\stem albeit inferior m 
performance relative to the non delayed lull state feedback 
system can be synthesized for .i given value of p The 
synthesis procedure is demonstrated via simulation ol the 
flight control system of a fighlci aircraft 

The major conclusion derived Iroiii the analytical work 
reported m this paper is as lollows fhe concept of the 
steady state minimum variance filter gam as the H 
minimization solution of the difltrencc between the target 
sensitivity matrix and the actual sensitivity matrix lor 
one step prediction does net hold foi p step prediction 
(p 1) Ihe conclusions from the peispcclivc of control 
synthesis are (i) it is impossible to tune the observe'r gam lor 
a delayed system (i e p ' 1) to fully recover the tiigct loop 
eharactcrislics and (ii) if the delay compensated system is 
designed to satisfy a sptcilicd requirement ol stability 
robustness then its performance decreases as p increases 
Ihe results and conclusions are also applieahk il the plant 
model has mheient delays which have the s mie effects on the 
loop recovery error 
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Heuristically Enhanced Feedback Control off 
Constrained Systems: The Minimum Time 

Case* 
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Abstract —Recent advances in computer technology have 
spurred new inleresi in the use of feedback controllers based 
upon the use of on-line oplimization Still, the use of 
computers in the feedback loop has been hampered by the 
limited amount of time available for computations In this 
paper wc propose a feedback controller based upon the use 
of on-line contrained upiimization in the feedback loop. The 
optimization problem is simplified by making use of the 
special structure of time-optimal systems, resulting m a 
substantial dimensionality reduclion These results are used 
to show that the proposed eonirollct yields asympioiieally 
stable systems, provided that enough computation power is 
available to solve on-linc a con.slramtcd optimization 
problem considerably simpler than the original. 

1. JntmfJurtion 

A .simsiANTiAi NUMHi R of Control problems can be 
summarized as the problem ol designing a coni roller capable 
ol achieving aeeeptahle performance undci design con¬ 
straints. rhis statement looks deceptively simple, but even in 
the ease where the system under consideration is linear 
lime-invanani, the problem is far from solved. 

During the last decade, substantial progress has been 
achieved m the design ol linear controllers. By using a 
paramctiization of all internally stabilizing linear controllcis 
in terms of a stable transfer matrix Q, the problem of finding 
the “best" linear controller can be formulated as an 
optimization problem over the set of suitable Q (Boyd c( al., 
198H). In this formulation, additional spiecifieations can be 
imposed by further eonslrauiing the problem. However, most 
of these methods can address time-domain constraints only in 
a conservative fashion. Henc': if the consli aints arc tight this 
approach may fail to find a solution, even il the problem js 
feasible (in the sense of having a, perhaps non-lincar, 
solution). 

Classically, control engineers have dealt with time-domain 
constraints by allowing inputs to saturate, in the case of 
actuator constraints, and by .switching to a controller that 
attempts to move the sy.stern away from saturating 
constraints, in the case of stale constraints. Although Ihesc 
methods are relatively simple to use, they have several 
serious shortcomings, perhaps the most important being their 
inability to handle constraints in a general way. Hence, they 
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require ad hi)c tuning of .several parameters making extensive 
use of simulations. 

Alternahvely, the problem can be .slated a,s an 
optimization problem (f'rankena and Sivan, 1979). Then, 
mathematical programming techniques can be used to find a 
.'.olulion (sec for instance Zadch and Whalen, 1962; Fegley el 
al., 1971, and references therein). However, in most cases 
Ihc control law generated is an open-loop control that has to 
be recalculated entirely, with a considerable computational 
effort, if the system is disturbed. Conceivably, the set of 
open loop control laws could be used to generate a closed 
loop control law by computing and storing a complicated 
field of extremals (Judcl et al., 1987). However, this 
alternative requires extensive amounts of off-line computa¬ 
tion and of storage. 

Because of the difficulties with the optimal control 
approach, other design techniques, based upon using a 
Lyapunov function to de,sign a stabilizing controller, have 
been suggested (Gutman and Hagander, 1985). However, 
the.se techniques tend to be unncce.ssarily conservative. 
Moreover, several steps of the design procedure involve an 
extcn.sive trial and error process, without guarantee of 
success (see example 5.3 in Gutman and Hagander, 1985). 

Recently, several techniques that exploit the concept of 
maximally invanant sets to obtain sialic (Gutman and 
Cwikcl, 1986, Vassilaki ct al., 1988; Benzaouia and Burgat, 
1988; Bitsuns and Vassilaki, 1990; Blanchini, 1990; Sznaier, 
1990; Sznaier and Sidcris, B>91a) and dynamic (Sznaier and 
Sideris, 1991b; Sznaier, 1991) linear feedback controllers 
have been proposed. These controllers are particularly 
attractive due to their simplicity. However, it is clear that 
only a fraction of the feasible constrained problems admit a 
linear solution. Furthermore, performance considerations 
usually require the control vector to be on a constraint 
boundary and this clearly necessitates a non-linear controller 
capable of saturating. 

Finally, in the last few years, there has been a renewed 
interest in the use of feedback controllers based upon the use 
of on-line minimization. Although this idea was initially 
proposed as far back as 1964 (Dreyfus, 1964), its 
implementation has become possible only during the last few 
years, when the advances in computer technology made 
leasihlc the solution of realistically sized optimization 
problems in the limited time available. Consequently, 
theoretical results concerning the properties of the resulting 
closed-loop systems have started to emerge only recently. In 
Sznaier (1989) and Sznaier and Damborg (1990) wc 
presented a theoretical framework lo analyse the effects of 
using on-line optimization and we proposed a controller 
guaranteed lo yield a,symptotically stable systems. However, 
although these theoretical results represent a substantial 
advance over some previously used ad hoc techniques, in 
some cases they are overly conservative, requiring the 
on-line solution of a large oplimization problem. Since in 
most sampled control systems the amount of time available 
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between samples is very limited, this may preclude the use of 
the proposed controller in many applications 

In this paper we present j suboptimal feedback controller 
for the minimuin-time control of discrete time constrained 
systems Following the approach presented in Sznaier and 
Damborg (1990) this controller is based upon the solution, 
during the sampling interval, of a sequence of optimization 
problems We show that by making use of the special 
structure of time-optimal systems the proposed algorithm 
results m a significant reduction of the dimensionality of the 
optimization problem that must be solved on-line, hence 
allowing for the implementation of the controller lor 
realistically sized problems 

The paper is organized as follows in Section 2 wc 
introduce several required concepts and wc present a formal 
definition to our problem In Section ^ wc present the 
proposed feedback controller and the supporting theoretical 
results The main result of this section shosss that by 
relinquishing theoretical optimality we can find a stabilizing 
suboptimal controller that allows for a substantial reduction 
of the dimensionality of the optimization problem that must 
be solved on-line Finally in Section 4 we summarize our 
results and we indicate directions for future research 


2 Problem formulation and preliminar\ results 
2 1 Statement of the problem C onsider the linear, time 
invariant, controllable discrete lime systems modeled by the 
different equation 

T*, I -• *^=0.1 (^) 

with initial condition ti, and the constraints 

- {u |Wu|- a>} 

where y e w f y,. nj, 0 O f W e with full 
column rank indicates jc is a vector quantity and where 
the inequalities (C) should be interpreted in a component by 
component sense Furthermore assume as usual thal A ' 
exists Our objective is to find a sequence of admissible 
controls, Uf^\x^\ that minimizes the transit time to the origin 
Throughout the paper we will refer to this optimization 
problem as problem (P) and we will assume that it is well 
fK^sed in the sense of having a solution In Section ^ wc give 
a sufficient condition on ^ for (P) to be feasible 

2 2 Definitions and preliminary results In order to 
analyse the propeiscd controller wc need to introduce some* 
definitions and background theoretical results Wc begin by 
formalizing the concept of null controllable domain and by 
introducing a constraint-induced norm 

Definition 1 ITie Null Controllable domain of (S) is the set 
of all points f /f" that can be steered to the origin by 
applying a sequence of admissible controls c Q r R'^ such 
that G A. -0, 1 Ihe Null Controllable domain of 

(S) will be denoted as C ^ The Null Controllable domain in / 
or fewer steps will be denoted as c C , 

Definition 2 Fhe Minkowsky Functional (or gauge) p of a 
convex set “^containing the origin in its interior is defined by 

^(jt:) - inf j r ^ e 


A well-known result in functional analysis (see for instance 
Conway, 1990) establishes that p defines a seminorm in /?" 
Furthermore, when is balanced and compact, as in our 
case, the seminorm becomes a norm In the sequel wc will 
denote this norm as p(jp)-||r '(7^|| - ||jr|U where 
r==diag(y, Yp) 


Remark I The set ^ can be characterized as the unity ball in 
II II ^ Hence, a point x € iff ||ar|U ~ 1 

Next, we formalize the concept ot underestimate of the 
cost to go We will use this concept to determine a sequence 
of approximations that converges to the solution of (P) 

Definition 3 Let O be a convex open set containing the 
origin and such that for all the optimal liajectones starling 
out in O the constraints (C) are not effective and let Jn(x) 
be the optimal cost-to-go from the stale \ A function 
g such thal 

0- g(3)^y„(A) Va g 

g(x)^Ux)^^^o 

will be called an underestimate of the cost-tu-go relative to 
the set O 

Ihe following theorem where we show thal problem (P) 
can be exactly solved by solving a sequence ot suitable 
approximations provides the theoretical motivation tor th( 
prop(3sed controller 

Theorem 1 I ci O be the set intioduced in Dehnition 3 and 
let be the (unconstrained) optimal ira)ectory 

uiriesponding to the initial condition t O Finally let 
g(,x) R''R he an undercsiimatL iclative to <) Consider 
the (ollowing optimization problems 

mm {y( j) - N} = min ( V 11 ( 1 ) 

min{y^(r) m - 1 -I- ^>(x,„) / - mm V I 4 

V I A I ' 

m ■ /V (2) 

subjecl to (( ) where H- m, ) then an optimal 
traiectory A - 1 2 m which snlws (2) cxlLodcd by 
defining A - m 4 I N is also a solution of 

(1) provided thal x„, e () 

Proof The proof lollows b) noting that the theorem 
corresponds to a special ease of IliLorcm 1 in Sznaier and 
Damborg (19^;M)) with/j^(x^ yj I 

It lollows thal problem (P) can be exaetly solved by using 
the sampling inlerval to solve a sequence ol optimization 
problems of the form (2) with increasing rn until a number 
m„ and a trajectory such that c () .ire obtained 
However this approaeh presents the dilfieultv that the 
asymptoliL stability of the resulting elostd loop system can 
not be guaranteed when there is not enough time to rcaeh 
the region () 

In our previous work (Sznaier 19S9 S/naier and 
Damborg 19*^)) we solved this dHieully by imposing an 
additional eonstiaint (whieh does not afleel leasibility) and 
b> using an optimization procedure based upon the 
quantization of the eontrol space By quantizing the control 
space, the attainable domain from the initial eontlition can be 
represented as a tree with each node corresponding lo one ol 
the attainable stales Thus the original oplimal control 
problem is iceast as a tree problem that can be efficiently 
solved using heuristic search techniques based upon an 
underestimate of the cost to go (Winston 19H4) Wc 
successfully applied this idea to minimum lime and quadratic 
regulator problems However as we noted there in some 
cases the results based upon a worsi-case analysis, proved lo 
be overly conservative As a result the optimization problem 
quickly became unlractable I his phenomenon is illustrated 
in the following example 

2 3 A realistic problem Consider the minimum lime 
control ol an F-lIK) jet engine The system at intermediate 
power sea level static and PI A - 83“" can be represented by 
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Sznaier (19«‘»): 


0.H907 

0.0474 

0.0237 

0.9022 

A=l 0.0233 

-0.0149 

0.0 

0.0 

\^--0.0979 

0.3532 

Ao.o') 


^ 64.0 


y-| 20.0 


5.0 


d 8 .I 


0.0213 

-0,3704'^ 

0.0731 

-0.1973 

1-0.0357 

-0.5538 

0.2212 

0.0 

0.0527 

-3,906S/ 


G = /; 


0.0980 0.2616 0 .() 6 K 9 \ 
0.0202 O.lO.'i? 0 . 0.111 
0.H167 0.22.S.S O .0295 ) 
0.0 0.7788 0,0 

0-1662 0,6489 0.0295^ 


,1 .11.0; |i<,| 2(K».0). 


( 1 ) 


The MirnpJing time fur this system is 25 msec. In this case 
cqualinn (21) in S/naier and Damhorg (1990) yields !()' 
nodes for each level of the tree, which clearly precludes the 
real-time implementation of the algorithm proposed there. 


3. Proposed c(nUrol algorithm 

In this scetion we indicate how the special structure of 
time-optimal syslciTis can be used to reduce the dimen¬ 
sionality of the optimization problem that must be solved 
on-line, Specilically. we use a modification of the Discrete 
Time Minimum Principle to .show that the points that satisfy 
a necessary condition for optimality are the corners of a 
subset of £2. Hence, only these points need to be considered 
by the optimizatnm algorithm. 

3.1. The modified disereie ume minimum principle. We 
begin by extending the l ocal Discrete Miiiinuim Principle 
(Bulkovskii, 1%3) to Frechet differentiable lerminal-cosi 
functions and constraints of the form ((.'). Note that in its 
original form, the minimum principle requires the slate 
constraint set to be open, while in our framework it is 
compact. 

'Theorem 2. Consider the problem (P ) defined as 


min .S'(Cv)- 

Ok O 


admissible; and (iii) a necessary condition for optimality is: 


H{x;, ut, lyv*) = min y, i/^;). A == I, . . . , N ~ 1, 

y tr 

( 10 ) 

where 

a,(i,.k) = {u e 12; || 4 *, ,||.*--s(l - e) ||4'*||,a. (I D 

where is some neighborhood of y, e >0 is chosen such 
that is not empty and where * denotes the optimal 
trajectory. 


Proof. Feasibility follows from (9) and Theorem 3.1 in 
Gutman and Cwikel (1986) (or as a special case of Theorem 
2 in Sznaicr and Damborg (1990)), Since e \\xj\,^^ 1. 
From (9) |kv^||^< 1 and therefore e for all k. To prove 
(iii) we proceed by induction. From (6) it follows that there 
exists r >0 such that £2, is not empty. From the definition of 
£2, it follows that lor any e: £2|(x^,/c) there exists a 
neighborhood O,, c:£2| , not necessarily open, where (6) 
holds Hence, if e ~ f{x^, Uf.) e e Let X/, 

denote a non-optimal feasible trajectory obtained by 
employing the non-optimal control law i stage 
k^N-\. Consider a neighborhood c; £2 of ^ such 
that the stale constraints are satisfied for all the trajectories 
generated employing controls in For any such trajectory 
i, ij wc have: 


Hence; 


w 


s(x;j)^^s{x^). 

3S 

~ I I - W/V l) 


{J (^.v I * l)) - b- 

f I • t/s/ !• ' •' l) 






( 12 ) 


(13) 




(14) 


C’onsider now a neighborhood D,, c £2 of u* such that all the 
trajectories obtained by replacing by any other control in 
siUisfy the constraints and assume that (10) does not hold 
for some k < N - \ . Then, there exi.sts at least one trajectory 
A, y such that; 


fHxf, y*, i/v") < y;, Ip;). (15) 


rheiefore; 


subject to; 

.rj , I = /l,v, + fly* = /(a, . I/*), .t„, N 


where .S" is Frechet diflcrenliablc. Let the co .stales 
defined by the difference equation 




. ' “ y’k 

3x 






( .4 y ) 


Finally, define the Hamiltonian as: 

H{x^, y^. ipt ) -- Vi/(3 a. w/)- 

Then, if: 


max I min ||/1 a f Ru\\ fi [ * - 1. 

11.111-9-1 


0> vr(/(3;. Q,) /(a;. «;))- vr aa, , db) 


(sj) Hcncc: 


(h) 


|. “a*, |. V’a*, i) - 

HixC 

1. yA* M. 

V’a* . I i) 



= V’a", l,A-fA 1 

|. Ma' i 1 

1-v-:; 

i/(4a . !• yi. 11) 


VV he 

or; 

' 

“ V'a* 

“Li’ 

1 _ )Aiv,,<l). 

4a K 1 

(17) 

(7) 


7/(4A A1• y A11■ 

V'J.i)' 


1 1. 'Aa 1 1. Va f [)■ 

(IH) 


Using the same reasoning we 

have; 





h( 4*,2. y*‘, 2 . 

V'A*,2)- 

' mil, 


(19) 



(Aav I' y 1 ’ 

V'~ i)' 

'//(.rj, 

1 1 /V 1 . V ’ /V 1 ) ■ 



(«) 


From (19) it follows that 

0 > Vn 1 ^ AS, (20) 

against the hyptithesis that 3’(.v;i) was a minimum O 


the following resulLs hold; (i) problem (I”) i.'- feasible; (ii) for 1.2. toing Ihe modified disereie minimum principle. In this 

any initial condilion the resulting trajectory U J is section we indicate how to use the results of Theorem 2 to 
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generate a set of points that satisfy the qecessary conditions 
for optimality. In principle, we could apply the discrete 
minimum principle to problem (P') by taking 
and solving a sequence of problems, with increasing N, until 
a trajectory and a number A/,, such that = 0 are found. 
However note that Theorem 2 does not add any information 
to the problem since; 

r^' =4fl„=(). (21) 

oar 

It follows that v^^ = 0V/t, and hence the optimal trajectory 
corresponds to a '‘singular arc’\ Therefore, nothing can be 
inferred a priori about the controls. In order to be able to 
use the special structure of the problem, we would like the 
co-states, y/, to be non-zero. 

Consider now the special ca.se of problem (P') where 
5(X/v) = { |U/vll 2 (with fixed terminal time N). Let n be the 
dimension of the system (S) and assume that the initial 
condition is such that the origin cannot be reached in JV 
stages. Then, from (7); 


V'iv' I 


( 22 ) 


>0 





Flo. 1. Using the discrete minimum principle In limit the 
search. 


It follows (since A is regular) that yj*^OV/c. Furthermore, 
since (S) is controllable, C„ (null controllability region in n 
steps) has dimension n (Sznaier, 198*^). It follows that, by 
taking N large enough e C„. Hence an approximate 
solution to (P) can be found by .solving (P ) for N such that 
e C„ and by using Linear Programming to find the optimal 
trajectory from to the origin. 

Theorem 3. Fhe optimal control sequence = (wo ■ ■ wj i) 
that solves problem (P') is always in the btiundary of the .set 
Q|. Further, the control sequence can always be selected to 
be a corner point of such a set. 

Proof. Since the constraints are linear and ^0, it follows 
that the control nf that solves (10) belongs to the boundary 
of the set Q,(x/^, k). Further, except in the case of 
degeneracies, i.e. when y;* is parallel to one of the 
boundaries of ^2,, the control m* must be a corner point of 
the .set. In the case of degeneracies, all the points of the 
boundary parallel to the co-state yield the same value of the 
Hamiltonian and therefore the optimal control .still can be 
selected to be a corner of £2,. 

3-3. Algorithm H^^p. In this section we apply the results 
of Theorem 3 to obtain a sulxiptimal stabilizing feedback 
control law. From 3’heorem 3 it follows that problem (P') 
can be solved by u.sing the following algorithm. 

Algorithm H^p (Hcuristically Enhanced Control using the 
minimum principle) 

(1) Determine t for equation (11) (for instance using Linear 
Programming oft-line). Let ()=C\, and determine an 
underestimate g( ) relative to O. 

(2) Let x^ be the current stale of the system: 

(a) If Xf^E C„, null controllability region in n steps, solve 
problem (P) exactly using Linear Programming. 

(b) If Xf^ i C„ generate a tree by considering all possible 

irajcctorie.s starting at with controls that lie in the 
corners of the polytope Search the tree 

for a minimum cost trajectory to the origin, u.sing 
heuristic search algorithms and g( ) as heuristics. 

(c) If there is no more computation lime available for 
searching and the region 0 has not been reached, 
use the minimum partial co.st trajectory that ha.s 
been found. 

(3) Repeat step 2 until the region the origin is reached. 

Remark 2. Note that by solving problem (P') instead of (P) 
we are relinquishing optimality, strictly .speaking, since the 
trajectory that brings the system closer to C„ is not 
necessarily the trajectory thai will yield minimum transit time 
to the origin. However, for any “reasonable” problem, we 
would expect both trajectories to be close in the sense of 


yielding approximately equal transit times (in the next 
section we will provide an example where this expectation is 
met). 

Remark 3. Algorithm considers at each level of the tree 
only the control that lie in the corners of £ 2 ,, as illustrated m 
Fig. 1, therefore presenting a significant reduction of the 
dimensionality of the problem. However, the algorithm 
requires finding the vertices of a polytope given by a .set of 
inequalities m and this is a non-lrivial computational 
geometry problem. 

Theorem 4. The closed loop system resulting trom the 
application of algorithm to problem (P ) is asymptoti¬ 
cally stable, provided that there is enough computational 
power available to .search one level of the tree during the 
sampling interval. 

Proof. From Theorem 2 it follows that >5* mono- 

tonically decreasing in Hence the system is guaranteed to 
reach the region CV|. But, since the solution to (P) is known 
in this region, it follows that the exact co.st-io-go is a 
Lyapunov function for the system in Thus the resulting 
closed-loop system is asymptotically stable. O 

3.4, The heuristic for algorithm H^„p. In order to complete 
the de.scription of algorithm we need to provide a 

suitable underestimate g(x). In principle, an estimate of the 
number of stages necessary to reach the origin can be found 
based upon the singular value decomposition of the matrices 
A and B, using the same technique that we used in Sznaier 
and Damborg (1990). However, in many cases of practical 
interest such as the F-lOO jet engine of Section 2.3, the 
limitation in the problem is essentially given by the state 
constraints (i.e. the control authority is large). In this 
situation, this estimate yields an unrealistically low value for 
the transit time, rc.sulting in poor performance. 

The performance of the algorithm can be improved by 
considering an heuristic based upon experimental results. 
Recall that optimality depends on having, at each time 
interval, an underestimate g{x) of the cost-to-go. Consider 
now the Null Controllability regions (C^). It i.s clear that if 
they can be found and .stored, the true transit time to the 
origin is known. If the regions arc not known but a 
supraestimate C), such that C\ g is available, a suitable 
underestimate g{^), can be obtained by finding the largest k 
such that ^ 6 C'i and ^ i CJ, However, in general these 
supraestimates are difficult to find and characterize 
(Sznaier, 1989). Hence, it is desirable to u.sc a different 
heuristic, which does not require the use of these regions. 
From the convexity of Q and ^ it follows that the region.s 
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are convex. Therefore, a subestimate such that C\^ c C 
can be found by finding points in the region and taking 
Q as their convex hull. Once a subeslimaic of C\ is 
available, an estimate g(ar) of the cosl-io-go can be found by 
finding the largest k such that jp e Q and ^ ^ Note 

that this estimate is not an underestimate in the sense of 
Definition 3. Since ^ Q then y* e C* e and 
therefore is not necessarily Thus, Theorem 1 that 
guarantees that once the set O has been reached the true 
optimal trajectory has been found is no longer valid. 
However, if enough points of each region arc considered sc, 
that the subestimates are close to the true null controllability 
regions, then the control law generated by algorithm is 
also close to the true optimal control. 

3.5. Application to the realistic example. Figure 2 shows a 
comparison between the trajectories for the optimal control 
law and algorithm for Example 2.3. In this particular 
case, the optimal control law was computed off-line by 
solving a sequence of linear programming problems, while 
algorithm Hf^p was limited to computation time compatible 
with an on-line implementation. The value of c was .set to 
0,01 (using linear programming it was determined that the 
maximum value of e compatible with the constraints is 0.025) 
and each of the regions was found as the convex hull of 
32 points, using optimal trajectories generated olT-linc. Note 
that in spite of being limited to running time roughly two 
orders of magnitude smaller than the compulation lime used 
off-line to find the true optimal control solution, algorithm 
generates a solution that lakes only 25more time to 
get to the origin (25 vs 31 stages) 

Figure 3 shows the results ol applying algorithm 
when the heuristic is perfect (i.c. the exact transit time to the 
origin is known). By comparing F'igs 2 and 3 we see that most 
of the additional cost comes from the approximation made in 
Theorem 2, while the use of an estimate of the cosi-to-go 
based upon the subestimates (rather than a “true” 
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Fig. 2. Optimal control vs algorithm klf^p for the Example nl 
Section 3.4. 



Time (eec) 

Fig. 3 . Algorithm H^p with perfect information. 


underestimate a.s required by Theorem I) adds only one 
vSiagc to the total transit time. 

4. Conclusions 

Followi.ig the idea presented by Sznaier (1989) and 
Sznaier and Damborg (1990), in this paper we propose to 
address time-domain constraints by using a feedback 
controller based upon the on-line use of a dynamic- 
programming approach to solve a constrained optimization 
problem. Theoretical results are presented showing that this 
controller yields asymptotically stable systems, provided that 
the .solution to an optimization problem, considerably 
.simpler than the original, can be computed in real-time. 
Dimensionality problems common to dynamic programming 
approaches are circumvented by applying a suitably modified 
discrete lime minimum principle, which allow's for checking 
only the vertices of a polylope in control space. This 
polylope IS obtained by considering the intersection of the 
original control region with the region obtained by 
projecting the slate constraints into the control space. The 
proposed approach results in a substantial reduction of the 
dimensionality of the problem (two orders of magnitude for 
the case of the example presented m Section 2.3). Hence, the 
propo.scd algorithm presents a significant advantage over 
previous approaches that u.se the same idea, especially for 
cases, such as Example 2.3, where the time available for 
compulations is very limited. 

We believe that the algorithm presented in this paper shows 
great promise, especially for cases where the dimension of 
the .system is not small. Note however, that the algorithm 
requires the real time solution of two non-trivial computa¬ 
tional geometry problems in R" \ determining the inclusion of 
a point in a convex hull and finding all the vertices of a 
polytope. Recent work on trainable non-linear classifiers 
such as artificial neural nets and decision trees may prove 
valuable in solving the first problem. 

Perhaps the most serious limitation to the theory in its 
present torm arises from the implicit assumptions that the 
model o( the system is perfectly known. Since most realistic 
problems involve some degree of uncertainty, clearly this 
assumption limits the domain of application of the proposed 
controller. We arc currently working on a technique, 
patterned along the lines of the Norm Based Robust Control 
Iranicwork inr*'odiiLed by Sznaier (1990), to guarantee 
robustnc.ss margins for the resulting closed-loop system. A 
future paper is planned to report these results. 
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Abstract —Ti) justify the introduction of new sensor 
hardware, the potential performance improvement iimsi be 
sufficient lo warrant the additional expense and potential 
design risks that a change in system architecture entails. I his 
paper compares the response of an image-enhanced tracker 
with that obtained from a conventional sensor suite Because 
the underlying motion model is nonlinear and the tracking 
environment non-Gaussian, performance is seenario depend 
enl. For a problem of tracking an agile target in the plane, it 
is shown that a system containing an imager is sviperior lo an 
orthodox tracker without one 

1 Introduction 

It is oiTr.N oimt ui i to predict the uilimate impact of 
hardware innovatiiMis on system design When a novel device 
accomplishes a previously feasible task, iis role is apparent. 
However, when it improves on one aspect of a task already 
achievable lo some oegree using standard methods, its utility 
is not as clear. 7o justify introducing new hardware, it must 
be Lvidenl that the ensuing svsiem has capabilities 
significantly greater than those found in more lamiliar 
systems, and lo achieve this improvement new archileelures 
may be required. A situation of this type has developed as 
FLIR sensors have been introduced into tracking loops. An 
imaging sensor produces a sequence ol "pictures” of the 
target. An image processor interprets these scenes, and 
identifies target attributes that are naturally manifested in 
properties of target extent. Figure 1 shows a tracking 
architecture proposed m several rcfcrenecs as a means for 
integrating the information derived from an imager with that 
from conventional sensors (Kendrick cl ui, I he 

lower path shows the usual mapping from center-ol-rdlcction 
observations to an estimate of target location. The upper 
path is image-specific, and u.sc.s observations of target slia|)e 
lo augment the point-location data in the lower paih. Note 
that the information flow in the upper path is a complement 
to, not a duplicate of, that in the lower. 

Merging these disparate data requires considerable care if 
the potential performance in prnvement is lo he leali/ed. 
The additional complexity and expense implicit in the 
architecture shown in Fig. 1 must be jiislilicd if imagers are 
to he acccplcd into tracking systems. This is not an easy task 
because the commonly used performance indices arc dilfkatli 
to compute for realistie representations of an encounter, f he 
models required to faithlully portray the peculiarities ol 
typical motion paths are nonlinear, non-Gaussian, and quite 
scenario dependent. fO gain insight into the idiosyneraeics ol 
image-enhanced trackers, and to contrast them with those ol 
their more established brethren, this paper considers the 
response in a spccihc vignette; an agile target moving in the 
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plane with intermillenl maneuvers is trucked from a fixed 
location, I'he object of this study is the quantification of the 
utility of the upper path in Fig. I. The detailed conclusions 
are encounter-specific, but the substance of the results holds 
in more general circumstances. 

2 . Modcl-husi d trackers 

In mode* based liackers, as the name implies, an analytical 
encounter model is used as an intermediary in deriving 
algorithms lor estimation and prediction. Analysis usually 
make no pretence regarding the completeness of the model; 
judging it on the basis of the adequacy of the algorithms lo 
which It leads In its simplest form, the model relates 
spatio-temporal measurements of target location through an 
ostensible motion equation which has a sufficiently intuitive 
parametrization to permit natural modification as the 
conditions of the encounter change. Model-based slate 
estimation algorithms have a long history, and considerable 
experience exists which points lo both the strengths and the 
weaknesses of the approach (Maybeck, 1982). 3'he most 
common formulation uses a linear Gauss-Markov (LGM) 
motion model for the slate process {.v,). The observation 
process (y,) is often derived from a radar or similar device 
that provides a measurement albeit a noisy one—of the 
ccntcr-of-rellection location of the target; e g. range and 
bearing 

There is a well known solulion to the resulting csiimLilion 
problem - the (extended) Kalman filler. Denote the filtration 
generated by j y,} by {T,} Then the conditional mean of the 
encounter slate (i, ^ F.{x, | >'j) is given by the algorithm 

de, - A.*,d/T ' dv„ (2.1) 

where A is Ihc dynamic malrix m the motion model, D is the 
observation gam, is the intensity of the observation noise, 
dv^ Ihs* increment nf (he innovations process, and T,, is the 
error eovarianee matrix This algorithm has been used in 
non l.GM applications as well. Unfortunately, it is difficult to 
compare estimation algorithms in a non-Gaussian environ¬ 
ment. The error eovarianee cannot be computed strictly, and 
even if a pseudo eovarianee is computed in its stead, it is not 
always clear how well it quantifies estimator accuracy. To 
illustrate these ideas in the context of a specific example, 
consider Ihc problem of tracking the motion of an agile 
target in Ihc v-y plane. A simple motion model (or an 
eva.sive target moving at essentially constant speed is; 

-(::)■ 

( 2 - 2 ) 

where (A',, F,} are position coordinates, and (F,, } are 

associated velocities. The target is assumed lo be subject to 
two types nf acceleration; a wide band omni-directional 
acceleration described by the Brownian motion w;.) of 
intensity VV'. and a maneuver acceleration represented by the 
turn rate process }. The former fils well within the LGM 
Iramewoik, but the latter is troublesome in two respects. 
First, it enters into the target dynamics as a stale multiplier. 


0 "(l (II 0 1 
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and consequently renders the dynamic equation nonlinear in 
the augmented state space Secondly the sample paths of the 
maneuver accelerations are usually described as jump 
processes, and are clearly non Craussian (Cloutier et al 
1989) In volatile encounters this lateral component is of 
primary importance thus making the LKF formalism of 
doubtful applicability (Speyer ei al 1990) 

To complete the motion model the dynamics of the turn 
rale process must bo quantihed Suppose that the set of 
povsibie maneuvers is partitioned into A levels e 
{fl, and let (nr,) be the maneuver indicator 

process, a, = e^ if - a, where is the unit vector in the 
/Ih direction m the appropnatc space The simplest way to 
delineate the maneuver process is to suppose the successive 
maneuver modes arc represented by a Markov chain with 
transition rate matrix q y'C i J ~ i J + 

o(f) Tins IS a iractablc model the parameters of which can 
be expressed in terms of the mean lifetime of each maneuver 
mode and the transition probabilities between modes 
Before analysing a tracker with the lull sensor suite 
consider an architecture which has no image path i e the 
tracker avails itself of just the range bearing measurement as 
shown m the lower path m Fig I These measurements will 
be taken at a fixed rale (10 samples sec ‘) with independent 
errors Both the target model and the observation equation 
arc nonlinear thus precluding the explicit evaluation of the 
conditional mean ol the target state Additionally the 
maneuver accelerations have a non Gaussian jump charac 
ter Nevertheless the need to achieve high accuracy tracking 
in applications like this had led to the development and 
iinplemcntatic^n of plausible algorithms The most common 
approach is also the simplest, the maneuvers arc neglected 
The EKF paradigm (linear motion nonlinear observation) is 
now directly applicable A four dimensional filter is sufficient 
to generate the slate estimate (position and velocity) [he 
filter gam depends parametrically on the intensity of the 
motion disturbance and the family of hlters will be denoted 
by EKFh/(IV) The performance ot the estimator tends to be 
good if conditions are benign but by overlooking what is in 
fact the mam ingredient in ihc acceleration, the filter is slow 
to recognize a turn-induced path change, and large errors 
build The basic reason for this belated response to lateral 
acceleration is that the motion equation neglects maneuvers 
thus leading to a small error covariance The simplest 
way to avoid the large error after maneuver onset is to 
arbitrarily increase W in equation (2 2), i c add pscudonoisc 
(Maybeck, 1982) Unfortunately the same higher gam that 
makes the filter more responsive, and hence a better tracker 
during the maneuver is also a gain that magnifies the 
observation noise during quiescent conditions I he ex 
igcncics created by the need for high accuracy tracking 


during quiescent operation must be balanced against the 
possibility of large errors and loss o( lock during turns and 
with the single degree ol freedom found in LKFvv this is 
difficult to do 

A slight increase in model complexity gives a more realistic 
motion equation Rather than neglecting the turn rate i( can 
be replaced by a Gaussian surrogate If the initial probability 
distribution is selected correctly } can be represented by 
a wide sense stationary process with a jxiwcr spectral density 
4>^(u;) that can he computed using familiar techniques I he 
resulting acceleration dynamics ean be integrated into the 
position-velocity dynamics by increasing the size ol both the 
state vector anel the Brownian motion (Berg 19HT) If the 
shaping filter is first order the turn rale model is u two 
parameter family with r the time eonstanl of the shaping 
filter and AfO) /{(oi,^) } the intensity ol the turn rate 
process This leads naturallv to a two paramcler family of 
filters FKF,(A(()) r) with the correct choice depending on 
the volatility of the encounter I hough the motion equation 
IS nonlinear but it can be linearized using eoiiventional 
methods Furthermore the acceleration dynamics arc 
included explicitly, and the correlations in the acceleration 
direction arc mainlamed during Ihght (Ilcpner and Geenng 
1990) 

In the dual path tracking architecture shown in T ig 1 the 
sensor suite contains an imager in addition to the 
conventional range bearing sensnis Imagers are attractive 
because they gcneralc data that bear on an aspect of target 
motion different irom that usually processed in a tracking 
loop An estimation algorithm proj^Hiscd m Swordcr and 
Hutchins (1992) used the shape inlormation produced by the 
imager for a faster indication of turning motion than that 
obtainable from location based residuals The imager is a 
pattern classifier and its errors are misclassificalions 
Suppose that the imager proccssoi collects data frames at a 
rale of A frames see ' and classifies target orientation into L 
equally spaced orientation bins The output of the processor 
can be written as an L dimensional counting process (z,) the 
ith component of which is the number of times the target has 
been placed in bin i on the interval [0 t] I his sequence of 
counts (or symbols) can be interpreted by a temporal 
processor to give the relative likelihoods ol the various turn 
rate hypotheses I his development is earned out in Swordcr 
and Hutchins (1990) and can be summarized as follows Let 
the orientation indicator process be given by {p,}, p, ” 
the current target orientation is in the ah bin The quality of 
the imager is determined by the Lx L discernability matrix 
P-[P,y| where is the probability that symbol i will be 
generated by the processor if / is the target orientation at 
time of image creation, ic P,^ = T^( Az, # 0 | p - The 
fidelity of image interpretation is a function of the imager 
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qualities, t> and A, and on the volatility of the encounter 
Xhis latter can be described by a comprehensive KL KL 
dimensional transition rate matrix Q which depends on the 
generator of the acceleration chain and the generator of the 
possibly random angular motions for a given acceleration 
(Sworder and Hutchins. I9H9) Denote the hitralion 
generated by (z,) by {Z,) The conditional probabilities 
of the various turn rate hypotheses are given by 
= subject to di^), = ^)0,di + (diag((^,)- 

®l/)A'diag(A(l;,f 'dz, where A AP 

I IS the identity matrix, and 1 is a unity vector 
In iKe tracker architecture propo^d m Hutchins and 
Sworder (1992), the turn rate terms were treated as additive 
disturbances, and their Z, conditional means were added to 
the conventional nonmaneuvering LKF This proved to be 
a useful approach, but had poor performanet subsequeni to 
the maneuver This deficiency follows directly from the fact 
that a small value of W results m a slow decay of any errors 
created during the maneuver But the prcKess noise intensity 
IV can be augmented proportionally to acceleration 
uncertainty (Var )^} transformed 

into the Cartesian coordinate system The corresponding 
value of IS that with the increased W Through this simple 
artifice a maneuver adaptive irasker is created Denote this 
image enhanced adaptive filler by TKF^ 

1 A Lomparison of the filters for a speuju vij^oietu 
It IS diffieuit to compare Iht performance of the v inous 
filters described in Section 2 Thcic is an miuiiive sense that 
as the model becomes more represenldtive o( the actual 
motion the performanee of a iratkcr based upon it should 
improve Still it is not elcar that LKF, is enough better to 
justify Ihc hardware/soltware expense required to implement 
it To illustrate the contrasts in performanei consider a 
specific vignette with a target imliallv located at (\ , V^,) 
(10 b 4)KM and initial veloeit> (i\„ i.^„) _ (S 0 -114) 
m sec ^ Moving at constant velocity for 30 sec a 0 ^ g 
turn IS made for 8 sec after which it returns to constant 
velocity motion The omnidirectional atccleralions will be 
assumed to be slight fdnj - (divj -001 d/ (msec)" 
Figure 2 shows Ihc target path (truth) wilhoul the wide band 
acceleration 

fracking will be accomplished with a 10 samples sec ‘ 
range-bearing sensor located at the origin with Gaussian 
errors of standard deviation S 0 m and 0 25” respectively 
independent in time and type Ihis measurement specifita 
lion suffices to determine the basic FKF^ If an imager is 
dvaiJablc it will be eolloealed and make measurements at 
the same rale, i e A ~ 10 frames see ' placing the target in 
angular bins of width 22 5“ (number of angular bins L - 16) 
To complete the measurement model the imager errois must 


be quantified The error taxonomy includes errors of three 
types uniform errors in which the output symbol is 
uniformly distributed without regard to the true orientation, 
1 0% in toto Nearest neighbor errors in which the ostensible 
orientation is placed in the neighboring angular bin, 1 H% in 
toto Silhouette projection errors in which the image 
processor places the target orientation in the bin situated 
symmetrically with respect to a line perpendicular to the 
line of-sjght, 10% This Iasi error is particularly important 
when image processing is dependent upon edge location 
because the target silhouette is invariant to the indicated 
shift If the error eaicgones are assumed to be independent 
in time and kind the disccrnibility matrix can be deduced 
directly 

To determine FKF, and the more sophisticated image- 
based tracker EKF^ the maneuver dvnainics must be 
specified It will be supposed that {a, = 2b see * turn right, 
a^=no turn 01 - 20 ” see ' turn left) and that the chain 
(or,) IS symmetric about tht null mode i e the probabilistic 
character of a, a, and art identical The elements of the 
g matrix are determined joinlb by the mean sojourn times in 
each acceleration mode {\ i-l 2 3 v,-v,} and the 

transition probabilities from a maneuver to the nonman 
euvenng mode 1 el - y‘(a, - e. | cr^ - c, and An, ^^0) 
Then i/, measures the fraction of limes that a maneuver ends 
in a coasting motion, e g 0 implies pure jinking mohon 
and q, - 1 always interjects coasting between turns 
'ipecifieally consider encounters with the tempos shown in 
Table 1 F neounter 3 is the most Llusely m dched to the 
vignette As mentioned earlier the power spectral density 
can be computed for each eneountei and IhL shaping filter 
determined using conventional methods Note that R{{)) 

2 fi,/(— £,) can be calculated from the stationary 
distribution of the turn rale chain T able 2 gives the 
parameters of the shaping hiters for each encounter and 
from this the family of associated set of LKF^ can be 
determined 

Performance is difficult to quantify analytically The 
motion model and the observation are nonlinear Further 
more if Ihe vignette is partitioned into pre post and 
maneuver coincident segments the relative performance of 
the trackers differs as a function of (he portion of the path 
that IS being reviewed A good indication of performanee 
can however be determined by simulation Figure 2 shows 
tht position response of selected estimators in the x-y plane 
To display iratker biases a 20 tiial mean sample palh is 
shown rather than a (noisier) single sample path Ihe 
used the indicated Brownian intensity while both F'KF^ and 
FKF, arc tunsd to temiw 3 All of the hllers perform well 
in Iht premanciiver phase bill both LKF^, and f KF,, have 
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Fig 1 Velocity estimates for tliftcrcnt trackers 


difiiculty following a turn, the latter making a pirouctie at 
apogee Only LKF,. pertorms satisfactorily in the post 
maneuver initiation phase of the engagement Figure ^ shous 
mean velocity profiles in the plane and thev arc not as 

good Tlie filter tends to assign observation residuals to 
velocity to a much greater degree than one might expect 
TTiere is conspicuous misidentification ol the velocity by both 
EKFyv(0 01) and rKF,,(5 76" 6 2^) with neithei recogni/ing 
the correct sense of rotation the piiouette in the position 
path providing a velocity profile that is the mirror image ol 
the actual path 

4 Perjormanie scnstuvities 

The smoothed s.imple functions shown in the previous 
section give an indication of the relative pcriormace ol tin 
various trackers 1 he estimators were selected on the basis ot 
the actual encounter dynamics Since both FKF^ and I 
are parametric families it might be hoped (hat improved 
tracking could be obtained by tuning them to a 
pseudo-encounter, i e tuning lor response rathci than trying 
to mimic the encounter dynamics (Chang and labac/^nski 
1989) II the vignette were not multimodal (his would 
certainly be possible, but such ad luu adiustmenl ollen 


achieves improved performance in one set ol conditions at 
the expense of degraded performance clscwhcic To quantify 
performance, consider the paired tracking indices the 
average error m the premaneuver phase, and the 

average peak error on the path, The former measures 

quiescent pcrlormancc When there arc no maneuvers, it is 
obviously advantageous not to expect any. and so it would be 
expected that an FKFvi with a low gam (i e small W) would 
be attractive The latter is the maximum error on the 
traiectory From the sample curves of Section 'I, it is evident 
that the time of occurrence ol this maximum error will be 
during or soon after the maneuver To mmimi/c a high 
filter gam would be appropriate I hese antithetical demiinds 
make tracker design difhcull The performance indices are 
dilTicult to evaluate analytically since the lack of LOM 
structure precludes finding the requisite moments For ihis 
reason, the indices were determined using a simulation with 
a ^(J-tnal sample average replacing the indicated expectation 
Figure 4 shows the perlormanee of EKF^^ lor a range of 
W, along with the image-enhanced tracker t KF,. In order 
to mdke the comparison of the filters easier, an icon has been 
used to display the performance of FKF,. This (tiler is 
designed on the basis ol a single intensity (W - 0 01), but the 
icon lor IS shown to the left of the graph and that lor 
^An.x shown to the right, thus placing it neai the best 
performaiiLC attained with I as W vanes Irom 0 01 to 

UK) Increasing W reduces the maximum position error Jrom 
more than i(K)m to less than 20 m Ilowcvei quicsLCiU error 
doubles toi the same change Flu best balance between 
nominal operation and maneuver tiacking is achieved with a 
pscudonoisc around 10 \el even with this tuning LKF^ 
IS not as good as the image enh.mced filter in cithei phase of 
the scenario for example*, for IKI^IIO) (f/.JO) 
//,„,J0))-t6 n)m while lor EKF, the comparable hguies 
are (//^^(O) //„,,,,(())) (4 12) m A improvement in 

quiescent tracking is achieved while maximum Itacking eiior 
IS maintained at the same lcve*l 

Figure S shows lour dillerent pcrlormancc curves lor I Kh ^ 
with .in iconic display lor F Kl , as beiore I he* mlensitv ol 
the pseudonnise is related to the stand.ud deviation )1 llu 
turn rale and this is the abscissa ol the curve Again a 
trade oil must be made since the lime const int best suited to 
nominal operation is not so well adapted lo sudden 
maneuvers A good choiee would he (/\’(()) t) (10 2) 
With these parameters (/Y,,(()) FF,^M„f0)) (S 28) m which is 
comparable with the performanci ol IKFvvIl^O in the 
coasting phase but not nearly as good duiiiig a turn It is 
interesting to note the strong resonance in Fig 5 lur 
T -20 see Figure h shows that the mLiximiim error is not 
maneuvi r coineideni but occurs in the postm.meuve i coast 
I he lag in the shaping Idler causes the tracker lo keep 
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deg /scr 


Pifi. 5. Pertormancc of EKF^ as a function of (/^(U))"*' and t. 



Fi(i. 6. Performance of EKF^t/?(()), 20) for different values 
of h(i)). 


turning tftcr the maneuver has ceased, thereby creating a 
large overshoot. 

5. C onclusions 

New sensors require novel architectures to capture the 
potential for performance enhancement. It has been shown 
that the response of an image-enhanced tracking loop is 
superior to that attainable from conventional trackers 
without imaging capability. Because of the non-Gaussian 
nature of the problem, and the nonlinearilies in the motion 
model, it is difficult to make inclusive generalizations about 
relative performance. It is evident though that an imager 
may achieve good quie.scent performance without a severe 
turning penalty. This exchange is dilficull to make with 
conventional algorithms. The two modified LGM proecdure.s 
had similar performance, and in this application, increasing 
the order of the estimator to shape the ersatz Gau.ssian 
disturbance is not worth the additional complexity. This 
result is due primarily to the high accuracy of the location 
sensors; if the signal-to-noise ratio is high enough, even 


primitive models result in good estimators. The image-based 
architecture provides 40% better quie.scent performance with 
a somewhat less pronounced improvement during a turn, 
This superior response certainly warrants capture 
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Adaptive Frequency Response Identification 
Using the Lagrange Filter* 

YU TANGt 

Key Words—Identification, frequency response; tillering; adaptive control. 


Abstract —Ihis paper addresses the problem of identifying 
the frequency response of a stable discrete-time system at a 
set of arbitrary frequencies. The output of the identifier may 
be used directly in control synthesis based on points of 
frequency response, or to give a frequency response estimate 
together with an bounding function for robust control. 
Central to our idea is Lagrange interpolation Conditions for 
exponential convergence of the paiHmctcr estimates arc 
given m terms of energy n( input signal. Simulalions arc 
presented to illustrate the analytical results 


1. Introduction 

RhUL-NTiY, F.FIORTS to combiiic robust control synthesis with 
adaptive techniques have been made to handle the 
robustness of adaptive control systems (e g. Ortega and 
Tang, 198^, and the references cited there). A key element 
in such approaches is an cslimatoi giving inlormation on the 
frequency response ol the plant, e g a set of points of 
frequency response, a frequency response estimate, and/or a 
bounding function of the modeling error This paper deals 
with estimating frequency response at a set of arbitrary 
frequencies of interest. 

Central to our idea is l.agrange inierpolatinn. 1 his gives a 
parametric model finterpolation model) which interpolates 
the frequency response at a set of user-defined arbitraly 
frequencies. Then the interpolation model is estimated using 
a recursive identification algorithm, and the frequency 
response estimate at these points is calculated via a linear 
map These estimates may be used directly for on-line tuning 
of controllers (Astrdm and Hagglund, 1984; Tang, 1989) or, 
to be further elaborated, to give a Ircqucncy response 
estimate and/or a bounding function of the modeling error 
for robust control purposes (Kosut, 19KH; Helmicki ct a . 
1989; Ciu and Khargonekar, 1992). 

Frequency response identincatinii via interpolation ap¬ 
parently was first done m Biimead and Anderson (1981), 
where the frequency sampling filter, a special case of the 
Lagrange filler when the interpolation points are equally 
spaced around the unit circle, is used a> the interpolation 
model. .\n bound between the plant and the 

interpolation model wa.s later established in ar er an 
Bilmcad (IWa) and Helmicki el at. (IflW) As 
in Parker and Biimead (1987b) and Helmicki et a ( ), ir> 

(he prc.sence of disturbances, the error etween c 
estimated interpoliilion model and Ihc plant is no onger 
uniformly bounded in N (the number of interpellation 
poinls). To remedy this, a class of two-stage idenlih^cation 
schemes is proptised, see, e.g. Helmicki ct ai (198 ), tu an 
Khargonekar (1992) and the references cited there. 


* Received 19 June 1991; revised 25 June 1992; received in 
final form 1 July 1992. The original version ol this paper was 
not presented at any IFAC meeting. This paper was 
recommended for publication in revised lorm by Associate 
Editor R. R. Bitmead under the direction of Editor P. 
Parks. 

tDEPFl-UNAM, P.O. Box 70-256, 04510 Mexico D.F., 
Mexico. 


This paper extends the results of Biimead and Anderson 
(1981) and Parker and Bitmead (19R7d) to the case where the 
interpolation poinls are arbitrarily spaced around ihe unit 
circle. Tl is shown that if the input signal is rich in a 
frequency band, then the corresponding parameter estimates 
will converge to a neighborhood of their true values, which 
reduces to a single point (the true value) in the absence of 
the signal interference and external disturbances. T he 
remainder ol the paper is organized as follows, the Lagrange 
filter is described in Section 2. The idenliricalion scheme is 
given in Section 3 Section 4 is devoted to establish the 
convergence of the parameter estimates upon the input 
signal. Simulation results arc presented in Section 5, and in 
Section 6, concluding remarks arc given. 


2 The Laf’ran^e filter 

Let G(z) be a transfer function with real coefficients, 
analytic in l 2 r>p,„ ()<p„<:l. and its impulse 

rcsfxmsc. We want to approximate it by CV[ 2 ) by means of 
Lagrange interpolation (Geddes and Mason, 1975): 


G'( 2 )=^ V G( 2 (^)/^(z), ( 2 . 1 ) 


where {z, = 0 - w, < 2;r. is a set of N 

distinct points around the unit circle, and A(z) are the 
interpolation filters, 


N 1 

n .(i - 2 ., 

n (i- 2,„2, 


1 


(2.2 a) 


It IS easy to .see that the frequency response of ^(z) satisfies 

(I form=k, (2 2b) 

U) form^k. 

Since the transfer function G'(z) has real coefficients, the 
information of the frequency response in [0, ;trl is sufficient to 
determine the whole frequency response. We will consider 
the case where the interpolation points arc symmeBically 
.soaced around Ihc unil circle respect to the real axis with 
= 1. The ease where z,, — c'’*’' for 0 can be treated 
similarly. 

Let i2 be the set of interpolation points symmetrically 
.spaced around the unit circle respect to the real axis (N odd), 

- {z* - e'*"* I - 0. -»v/v A. 

for it-1,2.L -(N-l)/2}. (2.3) 

Define 

A, = Rie|0(z*)]. = IlM [C(zJl. for *=0,1- L, 

( 2 . 4 ) 
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n't = ^ cos w* j[c 05 (>*) - cos 

L 

X n |cos(ivj-cos(iv„)], 

m ^ I 
rn-^k 

Pk=~ \ (Y»'»)[tO'’(?‘^»)-‘-'“s(V4)l 

/ 

X Y[ [cos (»v^) - cos (»v^)l, lor A:- 1 , 2 , , L, 

m 1 

(2 Sh) 

then the inlerpolation model (2 1 ) can he expressed as 


G'U)='^ (c,+d,z 


where the coefficients are calculated from 


Remark 2 1 A special case is when the interpolation points 
are equally spaced around the unit circle, i e -2nk/N, 
which gives a frequency sampling implcmcniation ol the 
interpolation model (Bitmcad and Anderson, 19K1) In this 
implementation of the interpolation model (Bitmcad and 
Anderson, 1981). By simple calculations it can be shown that 
in (2 5), ~ I, - 0, for A - 0 , 1. , f , and the tomb 

filter (2 9) reduces to A/^lz) = (1 - z^)IN 

Remark 2 2 Since there is a cancellation of pole zero at the 
unit circle in (2 2 a), to avoid instability of the filters, in the 
implementation ol (2 S) and (2 12 ), z should be replaced by 
z/a^, ()< 1 , which corresponds to a cancellation ol 

pole-zero at the circle | 7 | - n, 

3 Identification <irherne 

3 1 Plant (LssuniptiorLs and purf)u\e\ Consider a linear 
time-invariant (I II) discrete-time plant 


i^=A^. d^=i}. forA-0, (2 7a) 

[f*! r 2 0 II o* 

[dj [- 2 cos(w*) ~ 2 sin(M'*) -/J* ftj 1 

for A; ^1,2, , L. (2 7b) 

Hf^{z) are bandpass filters with center frequency 

H *(2 ) = ' lor k =- 0, (2 Ha) 

* 1 ~ 2 cos (k'Jz ’-Hz 

for A- 1 . 2 , , (2 Kb) 

and ‘s a comb filter, 


yit)^Ciq)u(t)^d{t), (3 1) 

where v(r), u(t) and d(t) are plant oulpul, input and 
disturbances reflected at the oulpul, respectively (He/) is the 
pulse transfer function of the plant Assume that the plant is 
exponentially stable, ic the impulse response / *d. 

satisfies 

Ie(t)|-M„p;„ (ort-O (3 2) 

for s< 7 me ' 0 , and 0 " p^ ^ 1 This condition implies that 
the transfer function ff(z) is an.ilvlic in |z| pt, with no 
repeated poles at | 2 |-p,, and sup |(/(z)l' M^pUp p„), 
for some p f (/i,,. 1 ) 

The purpose is to estimate the freqiiencs responsL at f + 1 
(/ integer) arbitrary points e [ 0 , jt) with starling point 
- 0 'lo this end, define 


(1 - I ') n (1 -2 cos(m.,„)z ' t z 
//,(z)= --- (2 9) 

Therefore, given the frequency response at the interpolation 
points (2 3), the interpolation model can be obtained 
from (2 6)-(2 9) Some important properties of the 
interpiilation model are summan/ed in the lollowmg 
proposition, whose proof is given in Appendix A 

Proposition 1 1 Let C/(z) be analytic in |z| -p,,- " Po" 1^ 

and Cj‘iz) be the interpolation model given by (2 6 )-(2 9) 
Then 

(i) r/(z) IS an 1 degree polynomial m z ‘ 

(ii) (j^{z) interpolates C^fz) at z^ e^”* for A - 

0 , 1 , , 1 . i c 

C;'(zJ- C;(zJ. A - 0 . 1 2. ,N I (2 10) 
(in) The approximation error is bounded by 

||G'-C;||,^Mp,r, (2 11 ) 

where M ~ M{N, p, (j) ^ 0 grows linearly m V 
From (2 11) it follows that ||C/- f/IL-^0, as N^ For 
their later use, we introduce the normalized bandpass Idlers 
(i e their frequency response at the center frequency is one) 
as 

/y;(z)-A(z)//,(z), A- 0 . 1 , (2 I 2 a) 

with fi{z) being the normalizing lactors given by 

/*(z)-l, forA-O, (2 12b) 

fi(z) -- 2 fa'i - [or* cosficj- sin(ivj]z 

lorA-1, ,L (2 12l) 

It can be verified thai 

//;(z)- f^fz), lor A = 0 , 

- /a( 2 ) + ^{z), for A ^ 1 , 2 , , L, 

with /^(z) the interpolation filter defined m (2 2 ) 


a o. i / 

I'he identification problem is then lo estimate the parameters 
and from the input/outpul measurements The 
remaining works are, (I) lo design an idemliliLation scheme 
(i e an idenlilier structure and an estimation algorithm) ( 2 ) 
to choose an input with desired properties to achieve 
parameter eonvergenec 
3 2 Identifier 
Identifier structure 

We shall eonsider the identifier implemented by the 
Lagrange hller For this purpose, poslulale an inlcrpolalion 
model (see equations (2 fi)-(2 9)) 

It follows from (2 7) lhal the map {s ^ 

given by 

~ i 0. lor AO, (3 4a) 

1/2 OA + /f^elg(Kj /fiCSe(Kj 

[if,\ a^clglwj - ese (viA) II jJ ’ 

lor A 1,2. ,L (Mb) 

Therefore, the piobicm ol identifying the / + I points of the 
frequency res^Kinse boils down to eslimaling the paiameieis 
and of the interpolation model (3 1) 

Estimation algorithm 

We shall consider the normalized gradient algorithm lo 
estimate the parameters and d^ Let 

Ig ‘/aT. 

be the parameter vector for the band identifier A. anti 
Its estimate at time instant I, which is updated by 

«*(( + !) n , k 0 , 1 , ( 1 ^) 

where 

=•!«»(') “*('- 1 ) 1 '- ("*'’) 

u^U)^H,{q)uU). (’ 7 ) 
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Fig. 1. A two-dimensional band identifier. 


is the regressor, 


r*!') • = >«(') - V,((). 

(3.Ha) 

with 


.V*(<) - H*(r/)v(/), 

(3.Hb) 



(3,8c) 


i.s the estimation error, where H^q) are the normalized 
bandpass filters defined in ( 2 . 12 ) is a design parameter, 
0 < y^ <2. The identification scheme is depicted in Fig. 1 . 


Remark 3.1. The use of the normalized bandpass fillers in 
(3.Hb) is for obtaining the mainlobc error of the frequency 
response of //)!(</) - 1 tt» be zero at (see (B. 3 ) of 
Appendix B). 

Remark 3.2. In some cases, the impulse response may be of 
interest. Let the impulse response of 

which can be calculated, given the frequency rcsprmsc of 
Gfzj at N points w^, a.s follows: from (2.10) we have 

N \ 

V j(()e (3.9) 

Multiplying m both sides of (3.9) and summing it in k 
from 0 to - 1 , gives 

*•..•= 1 1 = V (3. 10 ) 

^ k .0 r-n 

where 

X "=-1 1+2 V cos(/n - f)H’* , 

m t = 0, 1_1. (3.11) 

[^,ml denote the matrix whose elements arc 
similarly, |^( 0 I the vectors who.se elements are 

g(t), respectively. Note that \h^,] is doubly symmetric 
(Toepliiz, and symmetric), with ~ 1, and 1, lor 

m^t, and is invertible if the iiitcipolation points are distinct. 
Therefore, |^(/)| can be obtained from (3.10) by 

VJ- 

For the case where the interpolation points are equally 
spaced around the unit circle, (gCOirin' lG(e'*^*)}J^^(,' 
form a pair of Discrete Fourier Series, and the calculation of 
(3.12) may be carried out using Fast Fourier translorm. 

4. Analysis 

The purpose of this section is to establish conditions on the 
input signal to achieve convergence of the parameter 
estimates. First, we define the richness of a .signal in terms of 
its energy in a frequency band. 

Definition 4.1. Let x(t) be a stationary signal with energy 
spectrum SJe'"*'), It is said that jr(t) is rich in the frequency 
band k, centered at w^. and bandwidth Iwj^, if for some 


positive constant 

Let the parameter error vector ^^(/) be defined as 

«,(/):-«,(t)-^*-[c,(t) d,U)y (4.1) 

rhe following theorem slates the main results, whose proof is 
in Appendix B. 

Theorem 4.1 Consider the identification .scheme (3.3)-(3.H). 
Suppose that the plant (3.1) satisfies (3.2). 

(J) If u(t) is rich in the frequency band k, then 
f:f*(/)—► 0 ^ := (61 e IR" I |f^| = 5 ^) exponentially, with - 
^a(^. Yk^ ^k) ^ positive constant, and the bound on the 
disturbances d ^(0 

(ii) If u{t) is a sine wave with frequency w = and 
^A( 0 -f^ fb^n ^^(/)—»0 exponentially. 

Remark 4.1. Notice that when the input .signal is rich in the 
frequency band A, the parameter estimate converges 


c^Cl) 


e 50 100 150 



50 100 150 


ii-(t) 

-0.5 

-1 *= 2 ^ 

0 50 100 ISO 

Fl(i. 2. Parameter estimates with u(l) given in (5.2) (a) 
r,i = 2, r, = r, = 0, (b) r, = 2, r„ = r, = 0, (c) r^ = 2, r„ = r, = 0. 
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to a neighborhood of its true value radius R ,,, whose 

magnitude depends on the signal interference and external 
disturbances (see (B.9)). In the absence of the signal 
interference and disturbances, this neighborhood reduces to 
a single point (true value). 

Remark 4.2. It is well known that when the input signal is 
not rich in a frequency band k, the parameter estimate in this 
band may diverge due to signal interferences and external 
disturbances. One solution to this is monitoring the richness 
of the input signal in each band, and tune the adaptation off 
in a frequency band when the signal to noise ratio is less then 
certain value (Tang and Fernandez, 1992). 

5. Simulation results 

The proposed identification scheme was used to identify 
the frequency response of the plant 

•^(4) = gi _ + 0.264<7 x 0.032 ’ 

at = 2jrkl5, for = 1, 2. The plant (5.1) has poles at 

{0.8,0.14±y0.14}. The purpose of the simulations is to 
illustrate the main features of the identification scheme. To 
this end, the input signal was chosen as 

“(0 - '*0 ''i sin (*Vif) + rj sin (w^r), (5.2) 

and the parameters 

N = 5, cr, = 0.99 (stability scalar, sec Remark 2.2), 

Yk - 0.5, Wi^ = 2nkl5, for k - 1.2. 

Alt initial conditions in the estimation law (3.5) were set 
zero. The true but unknown parameters are Co = 5.(K); 
c, = “ 1.06, d^ = 1.41; C 2 = —0.69, dj = —0.48. The behavior 
of the parameter estimates when u(r) is a sine wave is shown 
in Fig. 2. Note that the parameter estimates converge to 
their true values as predicted by (ii) of Theorem 4.1. The 
evolution of the parameter estimates when the input is a 
pseudorandom binary signal (PRBS) is shown in Fig. 3 Due 
to the signal interference, the parameter estimates converge 
to a neighborhood of the true values, as predicted by (i) of 
Theorem 4.1. 


c„(t) 


50 100 150 



6. Conclusions 

An identiheation for identifying a set of {L-¥ 1) arbitrary 
frequences of an LTl discrete-time system has been 
proposed. The identifier consists of L + 1 band-identifiers, 
each works in its corresponding frequency band giving an 
estimate of the frequency response at the center frequency of 
this band. It was shown that if the input signal is rich in a 
frequency band, then the parameters of the corresponding 
band will converge to a neighborhood of their true values. 
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Appendix A. Proof of Proposition 2. 1 

Proof of Proposition 2.1. Ihc first two points follow 
straight from the Lagrange interpolation formula (2,1) and 
(2 2). The proof of (iii) is similar to that of Geddes and 
Mason (1975), and is given in the following. 

Let 

{/ • C L I f is analytic in | 2 1 > p„, 0 < Po < 1) . 

(Al) 

'^/v = {polynomials of order - 1), (A2) 

and denote the Lagrange interpolation 

operation, 

G‘(z)^:J^{CI). (A3) 

According to (A3), and (3.9)-(3.12), the induced norm of 
is bounded by 

^ lie'll,. 

= sup .sup |6''(z)| 

-„S“P , E lj?-l^^'lll'>...,r 'III. (A'*) 
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where |H|, denotes the malrix one-norm. Therefore, 
||C-G'lL.s(l + |lif^,|U)dist(f;. 9^) 

+ ‘ll.lM,p,r, (A5) 

with 

W| -pii), (A6) 

and po, are given in (3.2), The luM inequalily in (A5) 
follows from 

disl(G, gi'^,):=min ||G-p|Us ^ ^ Mp.^d - p„). 

f - A/ 

The proof is completed with /W :=[l +yv ||[h„„] ‘||]A/,. 

Appendix B. Proof of Theorem 4 1 

It follows from (3.8b), (3.1), (2 12) and (2.8) that 

.Vt(<) = + r|i,(0. (Bl) 

where 

»)*(') = «r(9K(') + d*(f), </*(()-//;(</) d((), (B2) 

and H"(q) is given by 

W:(</)-A(</)|G(9)-G'(<?)1+/,(<,) ^ (d+d*(/ ') 

n-O 

n 

yH„(q) + (c^ + d,q ')[«;(<,)- 1|. (B3) 

Note that in (B3) the first term arises from the 
approximation error Ci’(^) - G'V/), the second term is due to 
sidelobe effect of the bandpass lilter H^[q), mi^k, in the 
frequency band /c, and the third term is mamlobc error 
//^( 2 )-l. The use ol the normalized bandpass filters in 
(3.8b) is now clear The frequency response of PTfiq) 
satisfies the following lemma 

Lemma B.l. For a number of interpolation N given, the 
frequency response of Hl\q) in (B3) satisfies 

(i) ||//J’|U- ^ with - M„,(N) 0 a constant. 

(ii) \fl'l^{z^)\ -0. with an interpolation point 

Proof, (i) It follows from (B3) that 

/ 

liw:il.‘-IIAll. IIG-g'ii. + iimi. V (KI +|d,|)||H„.lU 

rn-O 

m 

4-(lr,KldJ)||//;-l|U:-- MJN), 


because for an N given, all the quantities in the right-hand 
side are bounded. 

(ii) It follows that the frequency response of G{q)- G'(</), 
H„(q), for m ¥■ k, and - 1 are zero at = e'**'*. 


Proof of Theorem 4.1. From (3.5), (3.8) and (Bl) it follows 
that 


«A(r+l) = r,(/)S*(r) + v*(/). 


(B4) 


where 

r /A - = / - V V iri - V 

1 + 4>IU)M) 1 -f 0 a(O0a(O 


(B5) 


Resolving (B4) gives 


I- 1 

e*(0 = <b*(r. o)e*(o)+ 2 <*>*(', T+ i)v*(T), 


T=0 


(B6) 


wilh 4>„(i, T) the trun-silion matrix of (D4), 


<!**(», T):=n'r,(i). 


I r 


(B7) 


Since u*(/) is a narrow-band signal, it can be approximated 
by a sine wave with frequency Wf^. Hence, 0^(0 is 
persistently exciting of order 2, which insures the exponential 
stability of (B4) (Anderson et ai, 1986). So there exist 
A, >0, and |A^( < 1, such that 

t)|£A,.|A*|'-’, \fi^ I =>.(), (B8) 

which together with (B6) gives 

|A*rie*(0)i + A:, ||V*|| 2 KT 

T = (l 

sK,-|A*nM0)| + y*KrM;|3^', (B9) 

for some constant d^,) >0. This proves the first 

part of the theorem. 

(ii) If u(/) is a sine wave with frequency and d)t(0 ~ 
it follows from (B2) and Lemma B.l that r/jt(t) = 0. This 
implies that = 0 in (B9). Hcncc, the conclusion follows 
from (B9) 
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A Parametrization Approach to Optimal H2 and 
Hoo Decentralized Control Problems*! 

RANJIT A. DATE* and JOE H. CHOW§ 

Key Words Decentralized control; robust control; optimization; numerical methods. 


Ab^iitriicl —I he pupci propo.ses a two-siaj^e dcsiy;ii approach 
to optimal H. and H. decentralized conlrol problems In the 
firsi stage, an optimal centralized eonlrollei is computed. 
Then in the second stage, based on the optimization results 
lor cenlrrilized controllers, the parameter that decentralizes 
the controller i.s oplimi/ed. Phe design approach is applied 
to an observer-based decentralized eonlrollei. Optimality 
conditions are derived and examples are given to illustrate 
the design 

1. Introdiuiion 

This PAPr:H proposi s a iwo-stagc optimal //, and //, 
decentralized controllt'r design in which the first stage 
involves an optimal centralized design and the second stage 
involves the optimization of the parameters that decentralize 
the optimal centralized controller. The approach is motivated 
by the optimal //-. and //. controller parametrization results 
in Doyle et ai (1989). I'his is in contrast 10 most 
decentralized controller designs (Davison and Gesing, 1979; 
Davison and Fergus.son, 19H1; Geromel and Demusson, 
1982; Dskokovic and Medanic, 1985; Hernstein, 1987; Htar 
and Ozguner. 19H9; Veillette, 1990; Siljak. 1991) m which 
the gains of the controllers are optimized directly. In this 
paper, we select the concuncnl decentralized observer 
controller proposed in W-illellc (1990) and Dale and Chow 
(1989, 1991) to demonstrate the two-stage design approach, 
mid (o show the simplilications lor computing suboptimal 
controllers However, the design can be extended to other 
decent ra I ized pararnet nzalions 

We consider //, and //, decentralized conlrol design lor 
IwO'channcl systems. I he results here can readily be 
generalizeil to svslems with more than two ehannels. Iai the 
moLlel ol the ' generalized plant” be 


input-output form, system (1) can be represented as 

Ml Ml I I 

= I V, 1 = u, I * G(.s) ” , (2) 

LV:J Lm.J 

where the transfer function in packed matrix form is 


r 



B, 1 

c. 

0 


^^22 

c\ 

/>2U 

0 

0 

_c, 

Am 2 

0 

0 

" A 


B ^ 


C, 

0 

Dy, 


. c 


0 



The symbol denotes the stale space realization of a 

transfer function. 


Assumption 1. System (1) is assumed to be in the form of a 
“standard problem" (Doyle el al., 1989) that is 

(1) {A, B,,,) is .stabilizable and (z^, CV) is delectable, 

(2) (/4, B) is slabilizable and (v4, C ) is delectable, 

(.1) = D,|D[=/, 

(4) D[X\ = (). = 

A decentralized controller for system (1) has the structure 



0 l[vi 

0 AC,(i)Jl.v.. 




(4) 


1 his following assumption more than Fart 2 of Assumption 1 
will guarantee the existence of a decentralized controller with 
the structure (4) for system (1) (Wang and Davison, 1973). 


f - Ax -I- w 4 B^u^ f 
. - C,x f 4 /), mU,. 

y, -- 4 /)^|, vv, 

Vj ^ (.' vX 4 0,1 , B'. 

where x e is the slate variable, iv e represents the 
disturbance or other external signals, z c represent the 
error outputs, and r, f and y, c ' are the controlled 
input and measured output ol channel 1,1 — 1.7. In 

* Received 22 November 1991; revised 2.“' July 1992; 
received in final form 30 July 1992. The original version of 
this paper was not presented at any IFAC meeting. Ihis 
paper was recommended for publication in revised form by 
Associate Editor M. Ikeda under the direction ol Editor A. 
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FCS-K9|36ri7. 

Precision Automation and Robotics f’vt. l td., Pune, 
India. 

§ Electrical, Computer and System Faigineering Depart- 
rncnl, Ren.sselaer Polytechnic Institute, froy, NY 12180- 
U.S.A. 


Assumpiion 2. System (1) has no unstable decentralized 
lixed eigenvalues under the control .structure (4). 

Denote the closed-loop transfer function of the system 
using a deeeniializcd controller (4) by T^^^ {K^/{s)) = and 
the set of all decentralized stabilizing controllers by Phe 

decentralized optimal control problem is 

mill (5) 

Afy( » Jt A-,! 

and the decentralized /C suboptimal control problem is to 
hnd K,i{s) e .Kj such that 

II Mil <y. W 

where y is a pre-specilied constant. 

Since a centralized controller u -■ A,"(..)y will be designed in 
the first stage of the two-stage design, wc denote the 
centralized closed-loop transfer function by 7,,,(/C(3)) 

The paper is organized as follows. In Section 2, a 
decentralized controller parametrization is stated and the 
parametrization of decentralized controllers having the 
concurrent observer structure is derived. Section 3 contains 
the proposed AE design and Section 4 the H,, design. Design 
examples arc given in these sections. 
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2 A decentralized controller parametrization 

Several decentralized controller parametrizdtions based on 
stable factorization (Oilnde^ and Dcsocr, 1990, Mano- 
usiaouthakis, 1989, OzgUler, 1990, Date and Chow, 1991) 
are now available In Gunde§ and Desoer (1990) and Date 
and Chow (1991), the pararnetnzation uses a decentralized 
controller as the central controller in the Youla parametnza- 
tion (Francis, 1987) An optimal design would require a 
direct optimization on the central controller In Ozguler 
(1990) the pararnetnzation is based on a sequential feedback 
control of the decentralized channels The sequential design 
would naturally lead to an iterative optimization of 
controllers 

The pararnetnzation result in Manousiouthakis (1989) 
allows a centralized controller as the central controller and 
requires a parameter to decentralize the resulting controller 
Utdizing the and design results in Doyle et al (1989), 
a two stage design approach can be formulated In the hrst 
stage, an optimal (or suboptimal) centralized controller is 
obtained based on the formulas in Doyle et al (1989) In the 
second stage, the decentralizing parameter is optimized In 
optimal design, in contrast to the hnite order of optimal 
centralized controllers optimal dctcntrali/cd controllers for 
strongly connected systems (Siljak 1991) may be infinite 
order This is commonly known as the second guessing 
phenomenon (Sandell et al 1978) As a result, the 
decentralizing parameter is also infinite order In this papier, 
we will investigate a concurrent decentralized observer 
controller with a finite order decentralizing parameter 

The structure of the concurrent observer controller (Date 
and Chow, 1989) is given by 


where 




(A-BF-L.C, ^ 
-F, 0 



I 2 


(7) 

( 8 ) 


and A contains a model of A and F, and i = 1 2 have 
the interpretation of full state feedback gams and 
decentralized observer gains The controller (7) is said to be 
concurrent because the gams F, and I, r = 1, 2 are designed 
simultaneously and nut sequentially Note that Assumption 2 
IS only a necessary condition for the existence of stabilizing 
concurrent observer controller The structure of A for the IF 
and design problems will be specified later 
Consider the controller /C(5) parametrization shown in Fig 
1, where 


A-BF-LC L B 

-F 0 / (9) 

-r / 0 


Q^{s) e 3^ the set of stable proper transfer function 
matnees of appropnate dimensions and 

■ = L-(L, L,\ (10) 


For 0d(r) - 0 wc obtain the centralized central controller 






BF-LC ^ 

-F 0 


(H) 


In the two stage decentralized control design, the centralized 
gains F and Z are optimized over all stabilizing gains 



We denote by 

K^s) c 3K,}, (12) 

to be the set of all the stable parameters 
decentralizes the central controller and achieves the 
concurrent observer controller structure (7) Although the 
construction of a decentralizing parameter C^/(0 requires, in 
general, the solution to a Riccati equation of stable factor 
matrices (Manousiouthakis 1989) wc show that if the 
structure of the decentralized controller ACj(5^) is based on 
observers, then the form of the parameter (2,f(a) can be 
determined m closed form 


Theorem 1 fhc set of all concurrent decentralized controller 
(7) IS parametrized by 


where 


Af Bh 

1 

0 /I, 

L~L 

_F-F F 


C , 

0 

C - ' 

c. 



0 



0 




] 


A^-A-BF 


>1/ - 


A " B.F. 
- B,F, 


BJ 

A 


- I ( 


(H) 


(14) 


The proof of Theorem 1 is given in Appendix A Note that 
the closed loop stability of system (1) controlled by (7) is 
guaranteed \{ A = A and Q^{\)eJC that is Rc(A(A^))^ 0 
and Rc (A(/1/))-^ 0 arc sufficient to guarantee the closed 
loop stability This will be shown lor the H control problem 
in the next section In the FL control problem A=F A such 
that no longer guarantees stability unless 

IIGrf(^)IU (css than a certain bound 
A similar parameter CIjIa) can be obtained for other 
observer controller structures such as the sequential design 
in Gunde^ and Desoer (1990) 


1 A (wo stage H 2 decentralized controller design 

In this section we will investigate the dtsign problem 

min II / (I ^) 

for the system (1) using the concurrent dccenirali/cd 
observer controller K^^is) (7) 

The first stage of the decentralized controller design 
requires an optimal centralized design Using the 

standard notation (Doyle et at 1989) let the Hamiltonian 
matrix have the the form 


H- 


A 

-r"c 


-BB^ 

A^ 


(16) 


where {A B) is stabilizahle and {A, f ) is detectable Then H 
IS in the domain of Riccati Operator denoted by doin(Kic) 
and W - Ric (ZZ) -'ll solves the Riccati equation 

A^W ¥WA~WBB^V/ =0 (17) 


It follows from Assumption 1 that 




~C 

~A 


(IH) 


belong to dom (Ric) Define X 2 - Ric (FF) and K - Ric (Z^) 
For system (1), the unique optimal controller thal 
minimizes ||Z',,,||, is (Doyle et al 1989) 


where 


K^s) 



z. 

L-f 

0_ 


(19) 


A^^A-Bh-IA ( 20 ) 


Fio 1 Controller A^(j) parametnzatioQ 


Z=KC 
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Furthermore, the optimal centralized cost is 

e = min = \\T,JK^(s))\\,. (2i) 

The second stage of the design, consisting of optimizing 
over the set of decentralizing parameters is stated in the 
following theorem. 


Theorem 2 (optimality of concurrent decentralized observer 
controllers). 

(1) The set of all concurrent decentralized stabilizing 

observer controllers (7) is given by (13) where 

A=A,F and L are optimal centralized tcedback and 
observer gains (20), f],f\ are the feedback gains, and 
Li, Lj are the decentralized observer gains from (7). 

(2) If F and L arc chosen such that the resulting parameter 
0J(.s) achieves 

6?= min IIO^(.v)||j - lig,*(.s)ll 2 . (22) 

then the controller 


A -fiF -F,r, 

0 

i-, 

0 - 

0 

A - BF L.c. 

(1 


-f, 

0 

0 

0 

0 

-F 

0 

0 , 


(23) 

is the optimal decentralized observer coiUrollLi which 
minimizes (1.^) and achieves the optimal cost a'* - 


Proof. Part 1 of Theorem 2 follows from I hcorcm 1. Ihe 
stabilizing properly of the controller is shown in Appendix H 
Part 2 follows from Theorem 2 ol Doyle ct ul which 

when applied to the decentralizing parameter (2^y(.i). results 
in 

l|2'l.||,< a,. <24} 

if and only if 




(23) 

L1 


From Theorem 2, wc can interpret as the cost, ot 
decentralization, that is, the additional cost that will be 
incurred if a decentralized observer controller is used instead 
of the optimal centralized controller. 

To find the gams F and L, wc propose a state space 
optimization procedure. Dcline 



such that 


F^\F-F FI 





Then 


(26) 


(27) 


min ||(?,y(0llz ~ min (28) 

/ f 

From the properties of norms (Doyle et al.. 1^>K9) we 
obtain 


IIC,,(.T)ll^ = tr(FVf'), (29) 

where 


-^=A^P + PA + F^f=0, (-B) 

di 

If = F,i <Y,-Y,- Y]) + F,{Y,~Y,) 

dh, 

+ HUPiiY, - y,) + P,(Y, - yj) + P,( Y' - Y') 

+ f ,'(y, - y,) + pUyJ^-yj 4 fjyJ - y,)) 

= 0 , (34) 


3f-/v(y,+ i;-y,-vr) + F,(y.-y,) 


+ BlP,(Y, - y,) + p,(Y, - yj) + /\iYl- Yj) 
- Pl(Y, - y.) - pitYi - yo - p,{Yl - yj) 


- 0 , 


(35) 


- (/-r 4 Z', + P,)/., + {PlY. 4 P^Y, 4 /•,y,'^) 


X C'l - P.,L, =0, 


(36) 


ay' 


= (Pi 4 Pi 4 pjL, 4 (Pl y 4 plY. 4 p„y,.) 


X r( - P^L, -= 0 , 

where 

v, y. v,' 
y- y( y^ y . 
Y( Yl y„_ 


(37) 


p, p, p. 
Pj P, Ps 
_p( Pl p„_ 


(38) 


The Lagrangian J' is locally convex, and (32)-"(37) arc the 
necessary conditions for a local optimum. Equations 
(32)-(37) are coupled d(gebraic equations, and have no 
closed form solutions. These equations can be solved 
numerically using a steepest descent method such as the 
descent Andcrson-Moorc scheme (Makila and I'oivonen, 
1987) or a Newton’s method such as the feedback descent 
scheme (Bcsclcr et al , 1992). These techniques must be 
initiated by a stabilizing controller. In addition, the 
stabilizing property must he maintained during the iterative 
proce.ss, which can be ai^omplishcd by allowing stepsizes 
that preserve stability 

We note that although the decentralized observer 
controllers implement full slate feedback in , the optimal 
feedback gams F, and /s necessarily the same as 

those of the optimal centralized gains F\ and F,. This can be 
atiribuled to the increased observation cost due to 
decentralization which would affect the design of F, and 
However, if il is determined that setting F, - F, and F^ = F. 
docs not significantly increase the cost, the optimization (28) 
reduces to 


min 


L - I 


(39) 


which is a lower dimenjfronal optimization problem. A set of 
necessary conditions of dimensions lower than those of 
(32)-(37) can be derived 


AT F YA^ F LL' -0. (30) 

Wc cast the problem of minimizing (29) subject to (.30) as 
the minimization of the Lagrangian 

tr(FrF^)Ftr((AV+ TA" F LLA)P), (31) 

with respect to F and F, where P is the (symmetric) matrix of 
Lagrange mulipliers. Setting the partial derivatives of (31) 
with respect to T. P, Fj. F^, F,, and F; lo zero, wc obtain 

AYFYA'+ Li.' ^0. (321 


Exarr\pk. I'o illustrate the H, decentralized design, wc 
present an example of optimal decentralized control system 
in/Fig. 2 to achieve tracking ol a reference signal. Let the 
plant model be 

1 

n.v)< 0 0 (40) 

0 0 

Fo track slop input at r,, wc represent the inputs as a 
weighted disturbance, and the tracking error as an error 
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output The weighting tunctions W, and VV^ are chosen as found to be 



”-0 001 

10 

o' 


1 

0 

0 


0 

0 

0_ 


100 
0 lOOj 


(41) 


to approximate the step inputs at r,, and to penalize the 
tracking error more heavily than other error outputs 
The matrices for the generalized plant (1) are 



‘-1 


1 

0 


A - 

-1 


-2 

0 



0 


0 

-0 (Kil J 


' 1 

0 " 



r — 1 

b = 

0 

J) 



c = 

1 

0 


0 

0 

0 

o' 



0 

0 

0 

0 

, c 



u 

0 

0 



-0 

0- 




0, -= 

0 

1 

0 

u 

- 

a, 

f" 

Lo 


_0 

1_ 





0 1 


r-1 0 r 

0 -1 0 
0 0 0 
0 0 0 


0 

K 


(42) 


For this system the centralized Ih optimal cost is 4SH 19 
fhe H 2 optimal decentralized observer controller for 
system (1) with the data (42) is designed using the feedback 
descent algorithm (Besclcr cl al 1992) I he controller is 


Trorking ot thp 'itpp rommond 



Time (5) 

Fiti T Tracking response of H 2 optimal decentralized 
controller 


M' 


999 98 79)J^s + 0 9288) 

U T- 109 97)(s 4 98 712)(i -f 0 (K)27) 

0 


0 

-291 46(n 4 97 28)(^ f 1 4H) 

(\ -T Itn 86)(s 4 97 7)(s 4-0 (K)2)_ 


(41) 


and the optimal decentralized cost and the cost ot 
decentralization are, respectively 

)|/‘'^|| ^ 489 19 ^ 171 94 (44) 


Figure 1 illustrates the tracking of a step in channel I by the 
resulting control system Ihe perlormancc ol the centralized 
optimal controller is also shown lor comparison \Vc note 
that the tracking response with the decentralized ctmtrol 
closely approximates that ol the optimal centralized 
controller and there is no serious loss ol perlormancc 

It is ol interest to note that the deeentialized /, / are 
very close to the centralized /, / II wl impost f ^ /, and 
Fj - f 2 the cost and the cost ol decentralization athieved 
by the resulting controller match those of (44) to five 
significant digits 


4 A iHo-Ma^c 11, (Jci cninilized (onirollcr dc\if(n 

In this section we will investigate the deetniralized 
suboptimal control problem ol hnding a coneurrenl 
decentralized observer eontroller sueh that 

lir''jL r (4S) 


where y is a pre spccifiid eonstant As in the /T design the 
H, deeenirali/ed design also involves two stages— a 
ecnlrahzcd suboptimal design in (he lirst stage and the 
optimization of the decentralizing parameter m the 

second stage 

The suboptimal //, design in Doyle ct al (1989) will be 
used in the cenlrali/ed design From Assumption 1 there 
exists a y *0 such that 

^ Ay BB’ 

^ -e'e a' 

' / / (4h) 

A' y ( '( ~ C U 

' -b^b' a 

belong to dom(Ric) II X, - Ric (F/J - 0, K - Rie (/J -0 
and y\ where p( ) dencites the spectral radius, 

then a tontroller that achieves || / ^ ||, '^ y is 


where 






h 

0 


A. - A * y - Bh,-/..l (. 

B'X.. L.= y,C', Z,-(/-y ’KA",) 


(47) 

(4K) 
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(iivcn the centralized controller (47) achieving < y, 
the second stage of the design process is to soWrfor the 
decentralizing parameter having the property 


V- (49) 

Theorem 3 (suboptimality of concurrent decentralizer 
observer controllers). 

(!) The set of all concurrent decentralized observer 
controllers (7) is given by Q^{s) (13) where 

(50) 

F ^ and L are the suboptinial centralized 

feedback and observer gains (48), is given by (46), 
f ,,^2 fhc feedback gains, and L,,/I, are the 
decentralized observer gains from (7) 

(2) If F and L arc chosen such that the resulting parameter 
Qj(s) achieves 

IIQry('’)l|, < y, (51) 

then the ainlroller 


BF - L| ( 1 

0 


0 

ZiF - LX , 

0 

-F\ 

0 

0 

0 

-F, 

0 


(.S2 

is a suboptimal stabilizing decentralized observer 
controller satislying H * y. 


Proof. Part 1 of Theorem 3 follows from Theorem 1 Part 2 
follows from theorem 4 of Doyle el at. (1989), which also 
contains the result that the control A',y(,s) (52) is u stabilizing 
control if (51) is satisfied. □ 


consider M > 0 solutions since the uncontrollable and 
unobservable parts of Qft(s) for all selection of L can be 
eliminated. 

Let Z' y^M V T hen premulliplying (58) by Z and 
postmultiplying by M ’ and using the expression of A, from 
(14) wc obtain 

0= Z(A - llC)^ i{A- LC)Z + Y "ZF^FZ 

- ZC Y"’Z + (L - L + ZC^){L - L H ZC^)\ (59) 

In the absence of the decentralized constraint L 
diag (L,, L^), we could choose L ~ L-^ ZC^ to eliminate the 
last term of (59), and convert the resulting equation into a 
Lyapunov equation which has a direct solution. Under the 
decentralized constraint, only the diagonal blocks can be 
equated. Following an idea in Veillette (199(1) we let 

L, — L] -f- Z I, C 1, 7^2 - ^2 + Z 22 C. 2 , (bO) 

where 



Since Z is symmetric, (59) can further be written as 
0 = Z(A - Lf )' -f (zl - LC)Z 4 y ZF^FZ - ZC^CZ 


r " 

f., + z„rnr 0 

L,+z,xiy 

If., + Z„r,’ 

0 J _Fi + ^21 C’l 

0 1 


(62) 


From this derivation, we establish the following result. 

Theorem 4. Let the gains F,, L, and Z., be obtained from 
(48) for a given y and set = F^ and F 2 = Fj. If there exists a 
Z '•() satisfying (62), then the observer gains (60) guarantee 
that |lCf./(.')ILy< iiiid the decentralized observer-based 
controller (7) achieves y. 


Theorem 3 determines whether the solution to a 
decentialized control design exists toi a given y Theorem 3 
suggests an iterative two-stage procedure to compute the 
optimal decentralized problem lor the concurrent 

controller structure, that is, the minimum y admitting a 
decentralized .solution, At each stage ol the iterative process, 
the existence of a decentralized controller for u trial y is 
determined. 1’hen a bisection technique can he used to find 
the smallest y that admits a decentralized controller. Lhcn a 
cost of decentralization can be found as the difference 
between the minimum y for the centralized control and the 
decentralized control. 

We now propose a state space approach to find the gains F 
and /. to solve for (51). Defining 


A^ BF 
0 A, 


F = \F~F F\. L- 




(.M) 


such that 


(?W(' ) 


(54) 


the synthesis ol (7) subject to (51) requires the 

existence of W ^0 to satisfy the Riccati equation 

A^W -^WA + y -WLL'W ^ F'F (55) 

Instead of directly attempting to solve (55), wc propose a 
simplification by assuming F, F, and F; F where 



is the centralized suboptimal gain (48). As a result, ^^/(a) 
simplifies to 


Thus wc arc required to design L such that the Riccati 
equation 

AjM + MA, +y ^M(L- L)(L- L)'M + F^F = 0. (.‘) 8 ) 

admits a solution M. 2 O. Without loss of generality wc 


Theorem 4 provides a means to compute a decentralized 
controller for a specific y. Although (62) cannot be solved 
directly, an algorithm can be designed to iteratively solve 
for Z. To initiate the algorithm, assume Z = 0 and compute 
the last term ot (62). fhen solve (62) as a Riccati equation 
for Z. Update the last term of (62) and repeat the solution 
process until the difference between successive solutions of Z 
IS small. The solution process can proceed provided that 
Z -0. In contrast to the ZZ^ algorithm lor solving (32)--(37), 
the iterative solution ol (62) docs not require an initial 
stabilizing //. An equation similar to (62) has been derived in 
Veillette (1990) tor decentralized ZZ, control design. In 
Veillette (19'X)), the design equation was obtained based on 
the closed-liinp transfer function, and hence, contains terms 
different from those in (62). 


Example We present an ZZ, suboptimal controller design for 
attenuation of the measurement noise for the plant 


1 

I 

1 

0 

o' 

0 

-5 

0 

1 

0 

0 

0 

-10 

0 

1 

1 

1 

0 

0 

~0 

1 

0 

1 

0 

0_ 


To weigh the plant output errors more heavily than the 
control signal, we weigh the error signals by 50. 
fhe matrices lor the generalized plant (1) are 




1 


r 



^0 

0~ 


A = 

0 

-5 


0 

, B 

- 

1 

0 

. c 


0 

0 

“ 

1^_ 



_0 

l_ 







r50 

so 

0- 


0 0 

0 

o" 





0 

50 




50 


1 0 

0 

0 




0 

0 

0 






{] 1 

0 

0_ 




0 

0 

0. 








1 

0 


:]■ 


n,, 


-(I ()- 

(10 ro 0 I 0 

1 0 ' ‘'“Lo 0 0 1 

.0 I. 


(64) 
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Wc impose a bound of y=l, implying that for 
disturbances with a unit magnitude, the output will be 
guaranteed to be less than y/50 = 0.02. The controller 
designed using the proposed iterative algorithm to solve (62) 
is 


KAs} = 


_-a86J_ 

(s + 890“l2)(y 295 68)(j i 

0 


0 


{s T 890. 12)(5 t 295.68)(.y 4 443) J 

Figure 4 verifies that the design objectives have been 
satisfied, resulting in an attenuation of 10 ^ (60 dB). 

5. Conclusions 

A two-stage design approach to optimal AT and 
decentrahzed control problems has been proposed and 
applied to the design of concurrent decentralized observer 
controllers. The first stage design is a standard optimal 
centralized controller design. Tlic second stage involves the 
optimization of the decentralizing parameter and is, m 
general, non-trivial for high-dimensional systems. The 
paramctrization approach offers some simplification for the 
computation of suboptimal decentralized controllers. 

Further work in this area includes investigating the optimal 
H 2 and //* decentralized control problems using other 
decentralized controller parametrize tions. One of the 
possible parametrizations is contained in Date (1991) where 
a decentralized controller is used as the central controller in 
the parametrization. It will be of interest to sec whether 
simplified design algorithms can be developed. 

Acknowledgements —The authors wish to thank Dr Dean 
Minto, Dr R. Ravi and Dr Bob Veillcllc for helpful 
discussions. 

References 

Bernstein, D. S. (1987). Sequential design of decentralized 
dynamic compensators using optimal projection equa¬ 
tions. Int. J. Control, 46, 1569-1577 
Beseler, J., J. H. Chow and K. D- Minto (1992). A feedback 
descent method for solving constrained LOG control 
problems. Proc. 1992 ACC, pp. 1044-1048. 


Date, R A (1991) Decentralized control, stable factoriza¬ 
tion approach to parametrization and design. Ph D thesis, 
Electrical, Computer and Systems Engineering Depart¬ 
ment, Rensselaer Polytechnic Institute, Troy, New York, 

Date, R. A and .1 H Chow (1989) A reliable coordinated 
decentralized control design AVoc of 1989 CDC. pp 
1295-1300. 

Date, R A and J. H. Chow (1991) Decentralized stable 
factors and parametrization of decentralized controllers 
Proc 1991 ACC, pp. 904-909 

Davison, E. J and W Gesing (1979) Sequential stability 
and optimization of large scale decentralized control 
systems Automatica, 15, ]M)7-324 

Davison, E. J and I J. Fergusson (1981) The design of 
controllers for robust .servomechanism problem using 
parameter optimization method IEEE Trans Aut 
Control, AC-26, 93-1 U). 

Doyle, J C , K. Glover, P. P. Khargonekar and B A 
Francis (1989). wStale space solution to standard and 
control problem. IEEE Trans Aut ( ontrol, 34, 831-846 

Francis, B. (1987). A course in Tf^ control theory. Lecture 
Notes in Control and Information Sciences, 88. Springer- 
Verlag, Berlin 

Geromel, J. and J. Bernus.sou (1982). Optimal decentralized 
control of dynamic systems Automatica, 18, 545-557 

Giindes, A. N and C. A Dcsoer (1990). Algebraic theory of 
linear feedback systems with full and decentralized 
compen.sators. Lecture Notes in Control and Information 
Sciences, 142. Springer-Vcrlag, Berlin. 

Iftar, A. and 0 . Ozgiincr (1989). An optimal control 
approach to the decentralized robust servomechanism 
problem. IElEE Trans Aut. Control, AC-34, 1268-1271. 

Makilii, P. M. and H. T. Toivonen (1987). Computational 
methods for parametric LO problems—a survey. lEEEi 
Trans. Aut, Control, AC-32, 658-671. 

Manousiouthakis, V. (1989) On paramctrization of all 
decentralized stabilizing controllers. Proc. 1989 ACC, pp 
2108-2111. 

Ozgiilcr, A. B. (1990). Decentralized control: A stable 
proper fractional approach. IEEE Trans. Aut. Control, 
AC-35, 1109-1117. 

Sandell, N. R., P Varaiya, M. Athans and M. G. Safonov 
(1978). Survey of decentralized control methods for large 
scale systems. /£E£ Trans. Aut. Control, AC-23, 
108-128. 

Siljak, D. D. (1991). Decentralized Control of Complex 
Systems. Academic Press, New York. 

Wang, S. H. and E. J, Davison (1973). On stabilization of 



Brief Paper 


463 


decentralized control systems. IEEE Trans. Aut Control 
AC-18, 473-478. 

Uskokovic, Z. and J. Medanic (1985) Sequential design of 
decentralized low order dynamic regulators Proc 1985 
CDC, pp. 837-842. 

Veillette, R. J. (1990). Reliable control of decentralized 
systems: an ARE-based approach. Ph.D. thesis. 
Department of Electrical Engineering, Univ. of Illinois! 
Urbana-Champaign, IL. 

Appendix A 

Proof of Theorem 1 Putting (7,y(,s) (13) into MU) (9), we 
obtain the controller realization 


KU)^ 


where 


VA 


A^ = 


BF-LC BF 
-LC A 
-LC^L^C^ 0 

+ 0 

A-B,F,-I.,(\ 





0 

- BF 


BF 



(Al) 


B„, - 




r L, 

Ly - L| 


R.F^ 

B,F, 

A - ByFy- Lc 
1 
/z 

l-L 


-Fy Fy-Fy Fy 0 
-F. 1) 


(A2) 


(A3) 


(A4) 


Performing a similarity translormation with 




r/ -7 0 0” 

0 I 0 0 
0 -/ I 0 
0 -/ (I 7, 


(A5) 


we obtain 


TAT^^ 


VA BF 
-LC 

L,(\ 

L 7:,c, 


0 

A-BF 

0 

0 

0 

0 

BF~ LyCy 
0 




0 

7^. 

-Ly 

0 


0 

0 

0 

A -BF-L^Cy, 

0 1 
Ly 
0 

-4J 


(A6) 


(A7) 


Note that the eigenvalues of A - BF are not controllable 
and those of A - BF are not observable. The elimination of 
these eigenvalues reduces the controller (Al) to (7). 

Appendix B 

Proof of the stabilizing property of the decentralized 
observer controller in Tlieorem 2. 

With A-A, the closed-loop poles of the system (1) 
controlled by the decentralized observer controller (7) arc 
the eigenvalues of the matrix 


-ByFy 


_LyCy 0 A 

Performing a similarity transformation with 
/ 0 0 


-77,F, 

0 

BF - LC\ 


T = 


-7 7 0 
-7 0 7 


we obtain 


7;A. T, 


L 0 




(Bl) 


(B2) 


(B3) 


which has stable eigenvalues if ( 2 , 7 (^) e 
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Abstract —This note contains an investigation of the pole 
placement problem in interconnected systems (A, flj), when 
local, static state-vector feedbacks are applied It is examined 
only for systems consisting ol two interconnected subsystems, 
and it is shown that it has a solution under the very general 
assumptions of the global and the local controllability, when 
additionally system matrix A has a particular structure. It is 
also proven that, with the present approach, at least one 
eigenvalue of the closed-loop system has to be real, and 
satisfy a certain condition. 

I Introduction 

In mis NOTI the decentralized pole placement problem with 
static feedback of the local state vectors in an 
interconnected system is considered. It is examined lor an 
interconnected (global) system consisting of two intercon¬ 
nected (local) scalar subsystems, under the very general 
assumptions of the global and the local controllability Only 
the case of two interconnected systems is examined, since 
only then will the global system have no decentrali/.ed hxed 
modes (Wang and Davison, 1974, Davison and Ozguner, 
iyH3; Anderson and dements, 1979; C alogianms and Fessas, 
19K2; Fessas, 1982) It is shown that this problem admits a 
solution, up to the restriction that at least one of the 
eigenvalues of the closed-loop system is real, when, 
additionally, the system matrix is of a particular structure 
(corresponding to the rank [D j -= 1 case; the notation used is 
explained below). It is also indicated that, when the system 
matrix has no particular structure, the same result holds. A 
more general result concerning only the stabilizabililv with 
static feedbacks of the local state-vectors of a system 
consisting of two interconnected subsystems was earliei 
proven (Fessas, 1987). Ihc mam contribution ol the present 
paper is to show—additionally—that with such a Icedback 
(n - 1) eigenvalues ol the nth ordci mtcrcfinnected system 
can be arbitrarily placed. It is obvious that the present result 
includes the former one. 

2. Vrcliminarics 

2.1. Form of system (A, 13,,). We con.sidcr the 
interconnected system (A. /^j) dclined by 

a - A.X 4 H,,u, (- 0 

where x is the n-dimcnsional state of (A./fjj),w is its 
two-dimensional input vector, A is the n by n .system matrix, 
and is its n x 2 input matrix. Matrices A and W,, admit the 


following partitioning 


All A ,2 

A,, A.. 


I I I 

"‘I., J 


( 2 . 2 ) 


with n-n^ + n.. System (A, fl,,) consists of the intercon¬ 
nected n,-dimensional subsystems (A„, ^„) - i - 1, 2 - of 
local slate-vectors x^ and with x ~|x| A 2 |\ m, and 

being, respectively the scalar inputs of these subsystems, with 
u = lu, u,]'. Without lo,ss of generality (Davi.son and 
Ozguner, 1983), we a.ssume that these two inputs arc scalar. 
Wc further assume that the global .system (A, Bj), as well 
as its two sub.systcms (A„, /?„)—i — T 2—are controllable. In 
that case, subsystems (A„, h„) are supposed to be in their 
companion controllable form (Kailath, 1980): 


0 1 


(2.3) 


where u,', denotes the last row elements of A„. When 
(A„, are m the above form, the n, x submatrices A,^ 
ii^i) of A assume no particular form: for them we use the 
notation: 

where a', denoles the last row elements of A,^. and A" the 
others. It is obvious that when the various submatriccs of 
(A. are m the above form, system (A, B^j) is as: 


1 

1 




1 





0 1 


^21 


022 

1 


* Received 31 (October 1991; revised 14 June 1992, 
received in final form ^ September 1992. Ihe original 
version of this paper was not presented at any IFAC 
meeting. This paper was recommemled for publication in 
revised form by Associate Editor T Baijar under the 
direction of Editor A. P. Sage. 
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Finally, with the elements of rows n, and n, /i,(=^n) of A, 
we form matrix A,„: 



2.2. The intercontrollability matrix D(s) and iLK kernel 
U(T). The following (n - 2) x n polynomial matrix is the 
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intercontroUabihtv matrix at system (A B,i) 
s -1 




D(s) 


f -1 


(2 6 ) 


> i -I 




V -1| 


As Ihe following lemma indicafes />(r) expresses the 
conditions lor the controllability of (A B^) 


Lemma 2 1 (Cdlogiannis and Fessas 1982) System (A ,) 
IS controllable if and only if rank Z)(i)-^“2 Thus the 
matrix 0(5) of a controllable system (A Bj) is a full lank 
matrix Its kernel (/(j) is an n x 2 polynomial matrix of rank 
2 such that 0(00(5)-0 

The analytical determination of 0(0 is as tolhws P is the 
matrix representing the column permutations of matrix 0(0 
which brings it to the form of Ihe matrix pencil 0(0 with 

O(0-O(^)f^ ls/„ ^ /l(0 (2 7) 

In (2 7) (j IS an (n - 2) x 2 (constant) matrix Lonsisting ol 
columns n, and n, + n-, - Ai of Z^(0 f is an (n 2 ) x (n 2 ) 
constant matrix and /,, . is the unity matrix ol order m - 2 
Since 0 ( 5 ) is a full rank matnx the pair (f CO is 
controllable In case G has the property rank[Cf|~ 1 with 
E,gi + F 2 g 2 - ’^>mc non zero numbers fj and f ^ f must 

be a cyclic matrix with the characteristic polynomial X(s) 
The pair (F C#) is transformed into its companion 
controllable form by the similarity transformation 7 The 
polynomial vector ^(>) = [ly s"* is the structurt 
operator of (f C7) The form of f/(s) is the content ol the 
following lemma 


Lemma 2 2 L^t D{\) be the intercontrollability matrix of 
(A J9<i) as in (2 1) with rank (6) = 1 for G as m (2 7) Then 
the kernel C/(5) of D(i) is equal to 


7’S, is) n 

lJis)-P -Xis) F, 

0 t 


(2 8 ) 


where P f (\) \(s) 1 , and f arc as prcviousK 

explained 


3 Mam development 

Consider the interconnected system (A B ,) with \ and 
flj as m (2 2) and (2 3) In that case thi corresponding 
differential equation in the stale space is 

x(/)-Ax(f) f fl,,fi(f) (3 1) 


In the operator domain to this equation corresponds the 
equation 


(v/ - A)x(0 = B,fUis) 
which in Its turn reduces to the equations 
7)(^)x(^)-^) 
and 

(fZ - A„,)x(0 -u(¥) 


(3 2) 

(3 3a) 
(3 3b) 


D(r) IS as in (2 6) Z is a 2 x n (constant) matrix of the 
form t -diagfcif^} the n, dimensional vector t, being 
equal to e, - [0 (J 11 —for i - 1 2—and is the matrix 

defined m (2 S) From (3 3a) it follows that i(s) must satisfy 
the relation 


xG)^UG)^is) (3 4) 

where f7(r) is the kernel of D(s) and 5(0 is any 
two-dimensional vector It follows that 5(0 must satisfy the 
equation 


nwtri* of the inteaonnected »y4letii (A, £IJ (Fessai, 198h) 
and IS defined hv the relation 

M(f)-0E-A„ms) (3 6) 

The three systems dchned, respectively (i) in the slate-space 
by the pair of matrices (A Bj. (ii) in the operator domain 
by fij) (ill) by the polvnomial matrix description 

(PMD) 

M(P)^(l)^u(l) (3 7a) 

x(l) - U(D)m (3 7b) 

are equivalent (Wolovich 1974, Kailath 1980, Chen, 1984) 

It IS noted that in ( W) 5(0 is the pseudo-state vector of the 
system and is related to the state vector x(/) of system 
(A B^i) by the relation 

Ji(0-C(Z))5(0 (3 8) 

(in (3 7) (3 8) the symbol D denotes the differential 

operator d/dr) 

Wc now consider the local feedbacks and 

u -/c>,x^ or u- with - diag (A'l,/C^s) This 

decentralized control applied to system (A fl^,) cor 
responds to the control wfs) = A ,(/(0^(0 applied to the 
system defined by the PMD (3 7) The resulting closed loop 
system matrix is then 

M,is) = i,F -A,„-K^)Vis) (3 9) 

We assume that the leedback matrix Aj has the specific form 



(3 10) 


where o, n,,) /f, art some real numbers (the 

feedback coefficients) Without loss of generality we assume 
that A„, 0 (sec also tht remark after the end of the proof) 

I hen M^^{\) is given by 

U \ 

P) -Pn I '^-Pn 

1,(0 


ft( \) (s nr„|) V(s) F|(s - a„,) 
/i(i ) rT(\ “ Pn^) 

(0 

~Xis) 

0 

(^ li) 

where r(s)- ZS„ A') = (/)(') (Ol scalar jxilv 

nomials {r,(i) (01 being prime not monic and 

ft(0~|n', tif„i I 0 filr(0 12**) 

Pis)^\i} 0 P, ,|t(0 (^ 12b) 

Obviously nr(\) and P{\) arc ol (maximum) degree n-3 
llie determinant of the matrix m (3 11) is the closed loop 
characteristic polynomial It can be (almost) arbitrarily 
assigned by suitable choice of the feedback coefficients 
rr, Of,, and /f, /J,, Ihc result is formulated as the 

following proposition 


Proposition 3 1 C onsidcr the interconnected system 
(A Zf,,) consisting of the subsystems as m (2 2)-(2 3) and 
suppose that rank \G] - 1 with (Z (/) as in (2 7) Then its 
n J eigenvalues A,A> , can be arbitrarily assigned by 

the decenlrali/cd conirol u - A,,x while the nth one—A„ 

n \ n -> 

must satisfy the relation A„ - J] A, - J] p, wheic 

r- I ' 

M/i 2 eigenvalues of Z 


A7(s)5(i) - u(0 (3 S) ProoJ In the matrix in (T 11) we subsiracl the second row 

multiplied by the coefficient tjtf from the first and wc 
The matrix M(v) appearing in (1 “i) iv termed Lharattermin assume that n,,, = /?„,(- K) In that case the matrix as in 
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(3 11) is equal to; 

I- -Pi^) h(,-K)\- 

Without loss of generality, Ej may be taken to equal 
(Ej* -1). The determinant of Af^Cr) is a monk polynomial 
of degree n, of the form: det Mj(.y) =/i(j)(.s -/t) The 
polynomial h(s) is of degree n - 1, and is equal to 
h(s) - a(i) + e,P(s) + (y - A:)Ar(y). Let 


h(-t) (s Aj) • • ’ (y i) ~ y" ' + fi(| 


(3.13) 


be this polynomial, corresponding to the eigenvalues 
A, and let A"(y) y" ^+ ^ 

/!,) ■ • ■ (j - 2 ). (31‘1) be the characteristic polynomial of 

system (f. C), of eigenvalues q,.The rest of the 

proof is as follows: 

• choose K == the nth closed-loop eigenvalue, as 

where jc,, is given by (3.14), and h., is given by 

(3.13); 

• compute the polynomial [h(s) - (j - /C)A'(,y)l. It is noted 

that this polynomial is not monic, and of degree n - 3, by the 
previous choice of K. Finally, determine the feedback 
parameters or, ■ ■■ a^^ ,, ^ by equating the 

polynomial ar(i)-h to the polynomial 

K)X(s)\. 

Thus, the closed loop characteristic polynomial of (/t. B^) 
is del M^is) = h(i;){s-k), and has A, j, X as the 

closed loop eigenvalues. It is also noted that since 

n 2 n I 

~ S ‘ind “ }] A the nth eigenvalue 

f “> 1 / -) 

A„ = A' = = P „2 iJ» related to olhci eigenvalues by; 


n 2 n - I 

/-I - I 


O.E D 


Remark. When the prtKif follows exactly the same 

lines, with the difference that in the polynomials 0 ( 5 ) and 
PU) (as in (3.12)) appear now the various elements of 
except the elements (n,, n,), (n, ni), (n,, n) and (n, n) of 
which appear at the last two columns of matrix 
(sE-A^-KJP. 

Example. Let {A, B^^) be the four-dimensional system 


A = 


-0 

1 

i 

1- 


-0 

0“ 

0 

0 

0 

0 

. fl.,= 

1 

0 

0 

1 

: 0 

1 

Y) 

0 

_0 

0 

0 

0 _ 


_o 

1 _ 


The intcrcontrollability matrix /)(j') is then 

r.v -1 -1 - 1 ] 

« . utid 

0 — 1 5 ~ 1 J 

f)(y) = 0 (y)/’- 


D(S): 


-1 - 1 

-1 -1 


with P = 


1 0 0 ()■ 

0 0 10 

0 10 0 

.0 0 0 1 


It follows that E 


0 1 
0 0 


and 




-1 - 

-1 




The transformation matrix T is 7' ^ 


-1 -1 

0 -1 


the kernel 


f/(j) is 



" -5-1 

0“ 

l/(y) = /> 

-s 

0 

-5 

1 


^ 0 

-l_ 

With F, = 


Or(5) 


and Af(i) 


y-l 

I. 0 -.vj 


I. " j 

^(j) —[0 ^,] ^ ^^ 5 , the matrix in (3.11) is 

which is equiv- 


alcnt to: 


or ,5 ftj -I- (s 




(‘V - (X 2 ) 1 

-(^v-^,)J 

or, + (a, F pi)s + (.V - k)s^ 


{} 

L P^s 

with or, - /Iv - A. Then, h{s) = Qr(j) + ^(.v) -I- {s - k)X{s) - 
[-o'.j-or,A)j"l, and det Af^Cv)/i(,0(.r ~ A'). 
Obviously, A(.v) = 5 ^, Xj, = 0, and |u,~^t=^0 . The polyno¬ 
mial /i(i;) is chosen equal to: /i(.v) = (j -f 1)' ^ -I- 2s‘ + is + 

1. i.e. /i„ = 3--(A, + A 2 + A,), with A,--1 (i-l,2. 3). 
The corresponding feedback matrix A,, is then 


r-1 -3 0 0] 

0 0 -2 -3/ 


4. Conclusions 

In this paper the problem of the pole placement with local 
static feedback has been considered. The problem has been 
examined only for two interconnected systems, and only for 
a special structure of system matrices (corresponding to the 
rank[G|=l case). Only n-[ poles can be arbitrarily 
placed, while the nth one must be real, and satisfy a certain 
relation. Obviously, this is the price lo be paid for the 
assumption that two feedback coefficients are taken to be 
equal; an assumption which in fact transforms a nonlinear 
problem (a.s is the pole placement problem), into a linear 
one. It is believed that, in the general case (rank [G] 2), all 

n eigenvalues can be arbitrarily placed with the n feedback 
coefficients. However, the formal proof of this statement 
remains open. 
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Robust Eigenstructure Assignment via 
Dynamical Compensators*! 
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Abstract —Based on a proposed complete parumetne 
approach for eigenstructure assignment in multivariable 
linear systems via dynamical compensators, insightful 
parametrizations of the closed loop eigenvalues sensitivities 
to the perturbed elements in the open loop system matrices 
arc obtained, and an effective algorithm for eigenvalue 
assignment with minimum sensitivity in multivariable linear 
systems via dynamical compensators is then proposed. The 
algorithm does not contain ‘going back’ procedures, and 
allows the closed loop eigenvalues to be conveniently 
optimized within desired regions A ruimcncai example 
demonstrates its effect, simplicity and numerical property. 

1. Introduction 

EicihNVALUh ASSiCiNMtN'i with minimum sensitivity m 
multivariable linear systems is an important problem iii the 
field robust control, and has now attracted much 
attention Goursharankar and Ramai ('avm and 

Bhuttacharyya (1983), Kaulsky a ai (19K5), Sun (19K7), Chu 
et al (19H4), Chen (19KK), Byers (19H9) and Duan (l992a) 
have considered the problem of minimi/ing eigenvalue 
sensitivity to model parameter variations in all the elements 
of the closed loop state matrix, while Gourshaiankar and 
Ramar (1976), Maiming Qiu and Ciourishankar (19S4), 
Song-Jiao Shi and Yuc-Yun Wang (1985), Mickle ct al. 
(1985) and Owens and O'Reilly (1989) have considered the 
problem of minimizing eigenvalue sensitivity to model 
parameter variations in some, but not all, of the e'cmenls of 
the open loop system matrices. Due to the fact that most ol 
the practical systems often pOvSse.ss .special structures and, the 
parameter perturbations arc usually sfx'cially structured, the 
problem of minimizing eigenvalue sensitivity to nuidcl 
parameter variations in part of the cfcmcnis of the open 
system matrices is more prclcrablc than that ot minimizing 
eigenvalue sensitivity to model parameter variation in all the 
elements of the closed loop stale matrix in certain sense 
Existing solutions to the problem of eigenvalue assignment 
with minimum sensitivity in multivariable linear systems al 
pre.seni are mostly restricted to the ease o( state feedback 
(Goursharankar and Ramar, 1976; Cavin and Bhattacharyya, 
1983; Kautsky et al, 1985; Sun, 1987; Maiming Qiu and 
Oourishankar, 1984; Song-Jiao Shi and Yue-Yun Wang, 
1985; Miekle et al, 1985; Owens and O'Reilly, 1989) and 
output feedback (Chu et al, 19H4; Chen, 1988; Duan, 1992a) 
and are subject to the following limitations. 

(1) Most of the existing solutions, except Owens and 
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O'Reilly (1989) and Duan (1992a) are based on such 
eigenstructure assignment results with which only implicit 
relationship between the system matrices, the closed loop 
eigenspcctrum and the design freedom can be established, 
and thus the robustness indexes as well as the finally 
converted optimization problems can not be arranged in 
forms explicitly expressed by (he design free parameters to 
be optimized. I'his con.sequently results in algorithms that 
contain “going back” procedures. Such procedur.:s not only 
add more computational burden to complement of the 
algorithm, but also may lead solutions to the problem far 
from optimal. 

(2) Most of the existing solutions except Owens and 
O'Reilly (1989) and Duan (1992a) arc, not al least in theory, 
proven to have utilized all the freedom existed in the design 
proce.s.s. Therefore, no conclusion can be drawn about their 
optimality. 

,3) Most of the existing solutions (except Duan 1992a) do 
not include the closed loop eigenvalues as optimizing 
parameters. While as a matter of fact, specific pcifuimance 
of a .system allow's the closed loop poles to be located in a 
certain region, but not necessarily at a lew specified points. 
Since closed loop eigenvalues ollen appear in the robustness 
indexes with higher nonlinearity, proper small changes in the 
closed loop eigenvalues may signilicanlly improve the 
robustness of the designed system (sec our numerical 
example). 

In this paper, we examine the problem of eigenvalue 
assignment with minimum sensitivity in multivariable linear 
systems via dynamical compensators. Since the closed loop 
system state matrix of a dynamical compensator controlled 
.system is strongly structured, we restrict our attention to 
minimize eigenvalue sensitivity to model parameter varia¬ 
tions in part of the elements ot the open loop .system 
matrices. We solve the problem based on a presented 
parametric eigenstructure assignment approach. I'or eigen¬ 
structure as.signmcnt in multivariable linear systems using 
dynamical compensators, Sambandan and C handrasekharan 
(1981) and Han (1989) have developed non-parametric 
approaches, and Hippe and O'Reilly (1987) has presented a 
parametric approach for the case that the closed loop 
eigenvalues are distinct and are dilTeienl from the open loop 
ones, and .still the last one must be real, In this paper, 
another parametiic approach to eigenstructure asignment via 
dynamical compensators is proposed based on an output 
feedback eigenstructure a.ssignment result in Duan (1992a) 
(also refer to Duan et al, 1991) which overcomes the few 
drawbacks in Hippe and O'Reilly (1987) (sec Remark 3.2) 
and establishes the complete parametric iLprc.sentalions for 
the closed loop eigenvectors and the compensator coefficient 
matrices with respect to the closed loop poles and four 
groups of partially free parameter vectors. Utilizing this 
paramelric eigenstructure assignment result, and with the 
help of the Heilman-Feynman I'henrem (Stephen, 1988) 
insightful parametrizations of the closed loop eigenvalue 
sensitivities to the perturbed elements in the open loop 
sysicm matrices are obtained, and an effective algorithm for 
iilsensitive eigenvalue a.ssignment in multivariable linear 
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systems via dynamical compensators is then proposed Due 
to the simplicity, explicity and the completeness of the 
proposed eigenstructure assignment approach, the obtained 
algorithm possesses several properties ( 1 ) It is simple 
numencally stable and requires less computational work ( 2 ) 
It does not contain ''going back” procedures since the 
optimization problem contained in the algorithm is in explicit 
forms with respect to the optimized parameters ( 1 ) Its 
optimality is completely determined by the optimality of the 
solution to the optimization problem (4) It gives systems 
with better robustness since closed loop eigenvalues can be 
conveniently included in the optimizing parameters and 
optimized within some desired specific regions A numerical 
example demonstrates the above advantages 

2 Problem formulation 
Consider the following linear system 

x = [A + ^A\x + \B-^^B]u v = [C + AC|i (la) 

where x e /?" V e (r m n) are respectively the 

state vector, the input vector and output vector ABC are 
known real matrices of appropriate dimensions and B ( arc 
of full rank, AA Ai? and AC are system parameter 
perturbations in the following form 

/ / / 

^A='^A.et ^R='ZB,€, 

t\ »= 1 I 1 

here A, B, and C, i — 1 2 I arc known real matrices of 
proper dimensions, t, t = I 2 /are small perturbation 
vanables 

When the following dynamical compensator 


Theorem I Let [A B] be controllable [A C] be observable, 
A/(j) /)(j) H{s) and / (i) be matrix polynomials satisfying 
the right factorizations (S) and ( 6 ), and s, i ~ i, 2 n + 
p be a group of self conjugate complex numbers (not 
necessarily distinct) 

( 1 ) There exist real matrices K, . i y ™ 1 2 of proper 
dimensions and matrices T V c C (,uch that 


A,-V\r‘ T'V-l 

(7) 

A-didg|V|t 

(H) 


if and only if there exist parameter vectors e C ^ ^ C"' 

/,, e *~1 2 np satisfying the lollowing 

constraints 

(dl) 4 s„ = g., if 1 - r, y = 1 2 

((72) glH^(s,)N{sX, + ^ - y - I 2 n+p 

where is the Kroncckcr function 

(2) When constraints (C I) and (C2) art met all the matrices 
T V satisfying (7)-(8) are given by 


V 

F, J 1 G| J 

C^) 

Ki-(v,„ v„ 


(lOa) 

7(1 = ((()! (||’ 

bi(fn Uh~ 

(10b) 

F, l/„ 

/. 

(11a) 


1^12 ^ 1 (^ ) 1 

(11b) 


While the corresponding real matrices i / 
either given by 


z - K 22 ^u = K^^y-^K^z ( 2 ) 

IS applied to system ( 1 ) the closed loop system is obtained in 
the following form 


with 

^ \A+BKuC 


('!) 

(4a) 


A/f, 


AA + BK,, AC -K ^BKuC ^ ^BK^ 
K., AC 


AC 


AflX, 1 

U J 

(4b) 


and zeR^ X,, /,/ = !, 2 arc real matrices of proper 

dimensions 

Noting the fact that nondefective matrices possess better 
robustness than defective ones (see Kautsky et al wc 

describe the robust control problem for system ( 1 ) as follows 


Problem RC Given system (1) and a proper region Q in the 
left complex half plane find real matrices K,^ / y = 1 2 of 
proper dimensions such that the following conditions arc 
met 

(1) Matrix A^ is nondefective, and has self conjugate 
eigenvalues all located in region 12 

(2) The eigenvalues of matrix A, + AA at f, - 0 i = 

1 2 , I are as insensitive as possible to small variations in 

,l 



_ [ ^ 1) 
K., 

K 

(12) 

with 

V- F 

Uf,fI) ' d> i(('v„r) ' 

(Ha) 



I - (/-M'/,)(t\ )' 

(Hb) 



w" IIV' A/,'| 

(He) 


vv;,- 1 k,„ 

>V,I ,,| >V|| />(»,)/„, 

iHcl) 

or by 


Kp d* 7 

K.. H'(/ B' 

(14) 

with 

(f' 

,o[) ' 0 , <J> (rr>) 'r 

(ISa) 



0 ,'d') 

(lib) 



/"-(/; AGfl 

(lit) 


4|- Uri] 

7 )1 '■It/ ~ 7(y,)yf(i, 

(Hd) 


Remark 1 1 The above Theorem 1 is a generalization of the 
output feedback eigenstructure assignment result in Duan 
(1992a) (Lemma I) It can be proven by observing the fact 
that the design of a dynamical compensator of order p lor 
system (la) can be converted into the design of an output 
feedback controller for an extended order system with thi 
following coefficient matrices (refer to Fletcher 1980) 


Remark 2 1 Fhe region Q in Problem RC represents the 
requirement on the stability and performance of the closed 
loop system (3)-(4) 

3 Preliminary result 

Let [A be controllable, and [A C] be observable, then 
there hold the following right coprime factonzations 

(sI~A) = ‘(j) (S) 

Ul-A"^) 'C^ = H^^^L '(^) (6) 

where )V(i)efi"'‘^ D(0e/^'’‘^ H{s)eR''’‘'” and L{s)f 
^11 po|yu 0 in,^| matrices and N(s) and D(^), H(s) 
and L( 9 ) arc both right coprimc 


A ^Diag|A (Jj B -Diag[£l l^,] ( -DiagfC /, 1 

(lb) 

and applying the Lemma 1 in Duan (1992a) and using tlu 
well known Matrix Inverse Lemma It is notable that this 
result can be further generalized using the results in Duan a 
al (1991) into the case where the closed loop matrix A 
possesses a general Jordan form 

Remark 3 2 This result clearly reveals all the degree of 
freedom in the dynamical compensator controlled nondefee 
tivc systems and gives the closed loop eigenvectors and iht 
dynamical compensator matiiccs in clear neat forms whicli 
can be conveniently realized by computers It differs with 
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that of Hippe and O'Reilly (1987) in several aspects, (a) It 
eliminates the strict conditions on the closed loop 
eigenvalues required in Hippe and O'Reilly (1987). (b) The 
approach of Hippe and O'Reilly (1987) requires the 
computation of inverses of {n +p) number matrices of order 
(n +p) and a matrix of order {m + p). While our approach 
requires only some simple matrix elementary transformations 
(see Remark 3.3) and computation of inverses of two 
matrices of orders p and m (or r), respectively, (c) The 
approach of Hippe and O'Reilly (1987) requires some try 
and test’ procedures while ours does not, when p is large 
enough, (say p>n-m-r^ \ (refer to Remark 3.4)). 

Remark 3.3. A key step concerning the implementation of 
the above Theorem 1 is to obtain the polynomial matrices 
N(s) and D{s) satisfying (5). One of the simple and efleclivc 
ways to achieve this is to find a pair of unimodular matrices 
P(s) and (?(,v) satisfying the following equation: 

y^(.0[A-.v/ /], (17) 

and then choose N{s) and y)(.s) as follows; 

= CH.V) - ((3,(.t) (IH) 

Subject to the controllability of [A, B\ equation (17) can be 
always realized simply by applying a series of elementary 
matrix transformations to the matrix [A sf B\ (refer to 
Duane/u/., 1991; Duan, 1992a, b,c) 

Remark 3.4, The is.sue ol minimal order compensators 
design for eigenvalue assignment in linear .systems, though 
studied by many authors (Kimura, 1975, Koscnhrock and 
Hayton, 1978), has never been completely solved Observing 
the fact that a dynamical compensator with ineicased order, 
but not the minimal, provides additional useful degrees of 
design Irecdom, we may choose the dynamical order p 
according to any conservative result in the literature. It is 
notable that conslrainis (C l) and (C'2) are met at least for 
the case of distinct closed loop eigenvalues when the 
compensator order p is large enough (say /> ' n m - / -t- I) 
to allow arbitrary a.ssignment of the closed loop eigenvalues 
ol the compensator controlled system 


4. Solution to Problem RC 

Observing that the matrix possessing eigcnstructurc 
(7)-(H) is nondcfectivc. Then a mam task left for solution to 
Problem RO is to establish the closed loop eigenvalue 
sensitivities to variables r,, i - \. 2, , / in the open loop 

system matrices. 1o achieve this purpo.se, we first pieseni the 
following lemmas 

Lemma I (Hellman-Feynman Iheorem (Stephen, 
1988)). Let M ^ {m,^) e be a nondefective matrix with 
eigenvalues A,, f = I. 2, . . , n, and y, and x, be, respectively 
the left and the right eigenvectors of matrix M associated 
with eigenvalue s,, r = 1, 2, . . . , n Then 


dl, 

Sm.. 


( , SM 


(1*^) 


Lemma 2. Let \A be controllable, \A ('| be observable, 
and constraints (Cl) and (C2) be met Then there hold 


AT'V^-W, T'^B’K^Z\ (20) 

where matrices B\ C arc given by (lb), while matrices V, 
2. K, W, Z arc given as in Theorem 1 


Proof. Noting that similar relations hold for eigcnstructurc 
iistsignment via output feedback (Duan et al., 1991; Kwon 
and Youn, 1987) then the conclusion follows recalling the 
fact mentioned in Remark 3.1. 

With Ixmma 1 and Lemma 2, we are now able to prove 
the following theorem. 

Theorem 2. Subject to the assumptions and condition I of 
Theorem 3.1, the eigenvalue sensitivities of system (3)-(4) to 


variations e,, / 1, 2, . . . , / arc given as follows; 

•'.; = |^=‘^.y + e.r (21) 

where 

<ii, = gl[H^{s.)A^N{s,) + 



(22) 


(23) 

i ^ 1, 2, . . . , n -t- p, j - \ , 2, . . 

., /. 


Proof. Define 



AA' = Diag[AA 0^], v4; = Diag[A, 

0 . 1 . 

(24a) 

AB'-Diag|A/i (),J, «>Diag[ff, 

0 , 1 . 

(24b) 

AC'-DiHg|AC 0„1, r;=Diag[C’, 

0,1. 

(24c) 

then there holds 



AA' = 2 AB' = t B',f„ AC - ^ 

A - 1 A 1 A - 1 

and 

(25) 


AA^ ^ AA’ + H'K AC + AB'KC + AB'K AC 

/ 

= S (^i + B'KCl + BlKC)f^ 

A = 1 

I I 

+ H(B;KQ + BiKC;)e,c,. (26) 

k 1 1 

Therefore by applying Lemma 1 and using (26), wc have 


with 


rdAA, 

de, 


■■ ii.. ^ c. 


d., = i!'(a; + B'Kc;+ b;kc)v„ 
b;/cc’;)v„ 

A. I 

/ - L 2, .. . , n 4 p. y = 1, 2, . . . , /. 


In view ol Lemma 2, d,j can be further converted into the 
following form: 

^ 'M,''', + i',B]w, + z^c;v,. 

Finally using equation (24) and applying Theorem 1, wc can 
convert and into the form of (22) and (23), respectively. 

It follows from Theorem 1 and Theorem 2 that the key 
step in solving Problem RC is to perform the following 
opliinizalion problem: 

Min fi,„4,,/ = 0, 1, i - 1,2, . . . , n+/>) (27) 

s.t. F Q, I - 1, 2, . . . , n + p. 

Constraint (C,) and (C 2 ) 

where 

n ^ p I 

g,,’ = 0 , 1 , i= 1 , 2 -- n + p)= X X 

,.i /=.[ 

(28) 

with a., > 0, I = 1.2, . . . , n + p; y = 1. 2, . . . , / being proper 
weighting factors. Here wc have only considered the part of 
d since it can be easily reasoned that d,j is the dominant part 
in when f,. i ^ I, 2, . . , f are small. 

Denote the real eigenvalue , 1 , of A^ by a,, and the 
corresponding parameters f and g, by and rj, respectively; 
denote a pair of complex eigenvalues .y, and s, of A, by 
~ o, 4 O/i, and the corresponding parameters by 
4-* V' gi. = lC’ + /■ = <•. where a,. 

and r},,, / ^ 0, L arc real. Then constraint (Cl) automatically 
hold.s and index (28) can be turned into the following form; 

= V,i- <’„i = 0. 1 , 1 ^ 1,2 - n+p). 
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Further by i;pecially taking the convex field Q as the sum of a 
series of square regions in the complex plane, the 
optimization problem (27) can be arranged into the following 
form: 


Min 7/^,, a,, y = 0, 1, f ^ 1, 2, . . . , n+/?), (29) 
s.t. constraint (C?2) 

fl, ^ (7, ^ 6,, i = 1, 2, . . . , n 4* p. 

Based on the above deduction and analysis, an algorithm 
for solution to Problem RC can be given as follows: 

Algorithm RC 

(1) Solve the right coprime factorizations (5) and (6). 

(2) Solve the parametric expressions for matrices 7;,, V|,. W„ 
and Zq according to (lU), (13d) and (15d). 

(3) Give the values of a, and ft,, i - 1, 2. . . , n f p 
according to the stability and dynamical response 
characteristic requirements of the closed loop system. 

(4) Establish the explicit expression of index F with respect 
to the optimizing parameters. 

(5) Solve the optimization problem (29) by applying proper 
optimization algorithm. 

(6) Calculate matrices T. and K according to (9)-(Il) and 
(12)-(13) or (I4)-(15) based on the parameters obtained 
in Step (5). 

Remark 4.1. The above algorithm has the following 
properties. 

(a) It gives closed loop systems with belter robustness (see 
the numerical results in the next section) since closed loop 
eigenvaluCvS can be conveniently included into the optimizing 
parameters and optimized within desired regions 

(b) It possesses good numencal property since a well- 
conditioned solution to Step (1) can always be obtained due 
the lack of uniqueness of the elementary transformations 
which conduct matrix [A-s/ R] into [0 /] (refer to the 
remarks in Section 4 of Duan (1992b)). 

(c) It is simple since it does not contain going back’ 
procedures and can be easily earned out in alphabetical 
order; it requires less computational work since its 
implementation involves only some matrix elementary 
calculations, the inverses of two matrices of reduced orders 
and the .solution of an independent optimization problem 

(d) Its optimality is completely determined by the optimality 
of solution to the optimization problem in Step (4) due to the 
completeness of the cigcnstructure assignment result. 

Remark 4.2. The key step in our algorithm is the solution of 
the optimization problem (29) which directly determines the 
optimality of the solution to Problem RC. Unfortunately, it 
is hard to derive general dchnite conclusions about the 
optimality of the solution to problem (29) even when the 
closed loop eigenvalues are previously a.ssigned to some 
specified points, since the problem is a general nonlinear 
programming with both the index and the constraints 
multilinear in variables and generally possessing no 
convexity. In practical applications, this optimization 
problem may he solved using any standard optimization 
algorithm. But before implementing this optimization it is 
important to simplily this optimization problem through the 
following two ways. (1) Properly fix an element in each / or 
ft, since there exists lack of uniqueness in eigenvectors of 
matrices (also refer to O'Reilly and Fahmy, 19K5). (2) Solve 
as many as possible variables from constraint (CJ. These 
procedures will significantly reduce the number of optimizing 
parameters and the number of constraints in the optimization 
problem, and hence make the optimization much easier and 
provides solutions with higher precision. 

5. Example 

Consider a controllable and observable linear system with 
the following parameters. 


" 0 1 o' 

r 

0 O] 

1 1 0 


) 0 1J 

„-i 0 



When extended to four dimensions as in (16), this system 
becomes (A\ B', C') which has been considered by Fletcher 
(1980), Chu et al. 0984), Chen (1988) and Duan (T992a). By 
the method given in Remark 3.3, we can easily obtain 


N{s) - 



L 1 J 

r.v -1 o' 

H(.0 = | 1 . 

L 0 1 . 


DU) = —j ’ + .f' + s, 


LU) = 


I r 

.1'. 


In the following, we consider the design of the first order 
dynamical compensator for this system. Restrict the closed 
loop pKiles s,, i — 1-4 to be real, and let 

/n, = 1. /i. A’i, = v,,, 1 = 1-4. 

then the right closed loop eigenvectors arc given as 

-s; I x,|, 1 = 1-4, (.10) 


and constraint (C2) can be written in the following form: 


Define 


n.v,, _V,. “ft,, 

V ■■ ~ n. 


(31) 


then the folli 

1992a). 


= V„„ -t,, = 1-4. 

)wing insightful fact is valid (refer to Duan 


Fact. Let i- 1-4 be dislinel and negative real, .r,. v„, 

I = 1-4, ; - 1-3 be parameters satislving the equation in 
(31). Then 

(1) \,^0, hold for 1-4, 

i¥= l¥^j. 

(2) Parameters x,, ^ I - 4. y 1-3 can be explicitly given 

through the following lorrnulae. 


x^- - -- 


.1 1 4 .S.nl, 


' i.>-t n i: 


^ V.n 


V41 - 


•'4 <^4.! “ ' 4 .’-^ 14 ) 


A 4 J - 1 


y,z " - I )y, 1 + X, V, 1. 


1 4. 


where x, and x,^ are two nonzero iree real pararnelcrs, 

It follows from the above fact that the optimization 
problem m Step (5) ol our Algorithm RC' lor this example 
.system has only six complelely free parameters (including the 
closed l(K)p poles) to he optimized. 

Eleven solutions to eigenvalue assignment in this example 
system via the first order dynamical compensators arc listed 
in I’ablc 1 (where represents the element of Ihc rth row 
and /th column in the constructed matrix K). Solulions 
and are existing ones, while solutions Kj 
and Ki, A',„ arc obtained by the presented approach under 
perturbations in the form nf (lb) with 


r 0 0 

0“ 


'() 

0 

0“ 

11 (I 

(1 

■ 

0 

0 

0 

J) 0 



J 

0 

0 




0 0 0 
0 0 1 


(32a) 


>4, - - 0; fl, -- -= - 0; C, - C , - 


C,-(). (32b) 
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Table 1. Dynamical roMPENSATOR matrices 


Solutions 


^12 


^21 

k-ri 

*2, 

K„ (Fletcher (1980)) 

-47 

34 

10 

49 

35 

1 1 

Kf (Chu et al. (1984)) 

-46.65 

41.39 

13.48 

36.32 

-31 69 

-10 92 

Kj (Chu et al. (1984)) 

-47.00 

41.70 

13.63 

36.52 

-31 90 

-11 00 

(Chen (1988)) 

-46.9999 

27.1833 

-17 2069 

-28.0806 

-15.9828 

-11 OtXX) 

(C hen (1988)) 

-46.99% 

-10.4883 

-24.3246 

-18.3150 

-5.72964 

-10.9999 

K*, (Duan (1992a)) 

-47.00537 

12.39453 

23.88391 

19.61450 

-4.703613 

-ll.(XX)49 


—46.99937 

H. 847533 

23.16630 

20.06524 

-3.165118 

-10.99989 

^7 

-47.00006 

8.797701 

5.108595 

90.98519 

-14.24602 

-ll.(XX)00 

(Duan (1992a)) 

-64..34406 

22 44226 

.34.43155 

23.33.524 

-7.737335 

-13.51834 

Kti 

-58.75030 

17.56029 

22.24512 

31.70565 

-8.947386 

-13.00002 

^ 111 

-58.74984 

17.57862 

13.49337 

52.27109 

-14.76805 

-12.99995 


Tabu 2. Closed loop eigenvalues and figenvectors 


Solutions 

•^1 

■^2 

a:,, 

-1 

-2 

Ai 

-1.051842 

-2.164131 

Ki 

-0.995089 

-1.981681 

K. 

-0.999940 

-2.001396 

K, 

-l.tXX)5l7 

-1.997385 

K. 

-l.(XX)740 

-1.994811 

K,. 

-0.9998724 

-1.999867 

A, 

-MXXWSH 

-1.99304 

K. 

-l.(XXX)79 

-1.729420 

K, 

-0.9999192 

-1.499925 

a:,o 

-0.9981721 

-1.504096 



.V 4 


-3 

-4 

1.4 

-2.766958 

-3.937068 

0.6599U2 

-3.0534.35 

“3.969795 

0.443847 

-2.997297 

-4.001368 

-1.209604 

~ 3.(X)39<^3 

-3.998fX)5 

-2.405559 

3.010174 

-3.994765 

1.492578 

-3.(XX)609 

-3.999352 

1.689753 

- 3 011227 

-3.994784 

7.683374 

-2.288860 

-6.499980 

1.246160 

-3.0(K)284 

-6.499H93 

1.896369 

-2.997073 

-6.5(X)61 

3.116829 


Xj jr4 


7 

14 

23 

5.357691 

8.439294 

15.93948 

4.487640 

10.018350 

16.08444 

-4.469316 

-8.52(KX)9 

-13.77098 

-4.699992 

-7.597358 

-11.275% 

3-822434 

6.8(X)663 

10.51307 

4.106956 

7.130991 

11.01099 

18.55133 

32.51201 

49.85145 

2.766928 

6.746092 

20.50924 

3.35737 

8.61836 

30.32894 

5.554411 

14.18514 

50.(X)907 


Solulion K^^ was obiaiiicd without consideration of 
robustness by a pole assignment approach, while solutions 
K^ ' K, and were obtained by minimising the condition 
numbers of the closed loop eigenvector matrices. Solutions 
Kit - were all obtained on the condition that the closed 
loop eigenvalues are previously assigned to - - 1 , i, ^ ” 2 , 
-3, .V 4 - -4, while solutions were obtained by 

optimizing closed loop eigenvalues within the lollowing 
region. 

-2.5. “1-5. 

~ 5 < I - 1 . 

The practical closed loop eigenvalues and eigenvectors 
corresponding to these solutions are given in Kablt 2 (it 
follows from (30) that parameters .s, and x, completely 
determine the closed loop eigenvector v,). Some robustness 
measures associated with these solutions arc given in Table 3 

(where A:,(V')=||V||,||V '||,,, c =|e,,f,.r, - 

IIMI ll.vjl, 1 - 1, 2, . . . , n T p). Table 4 shows the shifted 
closed loop eigenvavalucs of the system under structural 
perturbation (lb) and (32) with f, = fT-0.05. 

O.Ul. 


From lable 3 and Table 4, we can see the following 
points. 

(a) Solutions with smaller ||c|l 2 and values usually 

have smaller F values, but solutions with smaller F values do 
not necessarily possess small |lr ||2 and F' 2 (V) values. 

(b) The imaginary parts of the shifted closed loop 
eigenvalues generally decreases and the real parts generally 
get closer to nominal values as the index F gets smaller. 

(c) Inclusion of the closed loop eigenvalues into the 
optimizing parameters significantly improves the robustness 
of the closed Im^p system. 

Wc have found through numerical computation that index 
A' 2 (V) is more sensitive to the closed loop eigenvalues and 
eigenvectors than the other indexes appeared in Table 3, 
especially when its true value is large. Very small changes in 
the dosed loop eigenvalues and/or eigenvectors generally 
give relatively small error in indexes F, ||c ‘||2 and ||Ar||/,, but 
may lead index F' 2 (^) away from its nominal value. 
Probably because of this reason, Chu et al. (1985) and Chen 
(1988) have all derived, for their solution, incorrect values of 
the measure Kj{V) while giving the correct ones of measures 


Tadli 3. Robustness comparison of fxlsting .soluuons 


Solutions 

F 

Iklb 

A2(V) 

mir 

An 

I79.6.S4I 

4 . 54 . 21.350 

1016.8220 

84.92349 

AC I 

255.7211 

.57I.621(K» 

1454.6020 

80.70756 

A^ 

195.2416 

428.45190 

930.1136 

81.27722 

k\ 

173.2.508 

.3.33.6I6(K) 

752.6219 

66-4(XX)7 

Ka 

174.6446 

.346.41980 

803.3182 

58.30882 

4 

167,3580 

.3(K). 87540 

680.8795 

58.83395 

* * S 

A-. 

166,2806 

301.16710 

686.7354 

57.94417 

K-, 

156.9077 

705,201KH) 

1515 8720 

104.47370 

''7 

A 

48.9879 

86.94968 

557.6660 

81.34181 

H 

A» 

40. .59199 

HI 35593 

693.3435 

73.51208 

ri.tf 

AC in 

40.62913 

in0..3(WK0 

770.5225 

84.03561 
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Table* 4. S^^I^^ED closed loop eigenvalues under system perturbations 


Solutions 


s 


1.2 


^1 

K. 

K, 

Ks 

K, 

K, 

K, 

K, 


-1.1718552 ±0.2796065; 
-1.1708877 ±0.3763229; 
-1.1380780 ± 0.3386860; 
-1.2014933 ± 0.19934(K); 
-0.8852484 - 2.0108064 
-1.0295902 -1..5206610 
-0.9958176 -1.5%88CK) 
-0.9%7350 -1.5944849 
-1.0229320 -1.4578643 
-1.1052814 -1.1749888 
-1,094996(1 -1.1867652 


-3.828196 ± 1.1704669; 
-3.7891.54 ± 1.2370240; 
-3.8619442 ± 1.2308790; 
-3.7986069 ± 1.1193070; 
-3.552054 ±0.6713790; 
-3.725676 ± 0.9925040; 
-3.70.3596 ± 0.9529098; 
-3.704390 ±0.9542037; 
-3.9413924 -6.0962797 
-3.5209724 -6.1988390 
-3.5184949 -6.1997573 


||c ||2 and We have given much more precise closed 

loop eigenvalues and eigenvectors in Table 2 for all these 
solutions. In fact, it can be verified that with the values given 
in this table, del (A ± flAX - .v,/) and ||(A ± fiAX - .y,/)vj| 
are all in 10 while the eigenvalues given in Chu et ai 
(1984) and Chen (1988), det (A ± 5AX - .y,A), f -1-4 have 
only reached 10 ‘ and 10 respectively. Therefore, our 
values given in Table 3 are much more accurate. Moreover, 
it can be seen from whichever sense that solution A, is not a 
good solution at all. 

6. Conclusion 

This paper has presented a simple, effective, complete 
parametric approach for eigen.structure assignment in 
multivariable linear systems via dynamical compensators. 
Using this approach, unified explicit and complete 
parametric representation.s of the compcn.sator coefficient 
matrices and the closed loop eigenvector matrices can be 
established with rc.spect to the closed loop poles and four 
groups of partially free parameter vectors which clearly 
reveal the design freedom in the problem By utilizing thiS 
eigenstructure assignment result, the problem of robust 
dynamical compensator design, in the sense that the clo.sed 
loop eigenvalues are as insensitive as possible to small 
variations in part of the elements of the open loop system 
matrices, is successfully treated. And an efficient algorithm is 
presented which allows closed loop poles to be optimized 
within desired regions, and is shown, either by analysis or 
computational results, to be simple, require less computa¬ 
tional work and possess good numerical property and better 
optimality. 

Acknowledgement —The author is grateful to the reviewers 
for their helpful comments and suggestions. 
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Abstract —We consider the problem ol disturbance decou¬ 
pling by means ot a single stale Ieedback lor a whulc family 
of systems depending on a parameter, p. We give a 
geometric necessary and sufficient condition for the existence 
of solution to such problem under the h\polhesis that the 
systems of the family are gencncally injective and the 
dependence on p is polynomial or, respectively, the set of 
possible values for p is Hiiite, 

1 . /nlroductian 

Tm PRGBi.r.M ot synthcsi/ing a state Ieedback law which 
decouples the output of a system with respect to a 
nonmeasurablc disturbance, oi Disturbance Decoupling 
Problem (DDF), is a relevant one m control theory In the 
case of linear systems, a complete solution to this problem 
can be given in geometric terms using the notion of 
controlled invariant suhspaccs (Basile and Marro, IMbM; 
Wonham, UJK 5 ) Suitable generalizations ol the geometric 
approach provide a solution to the [)D!* m several other 
contexts, concerning for instance nonlinear systems (Isidori 
e( al., 1981) linear periodic systems (Cirassclli and Longhi, 
198h), iwo-dimensional systems (Conte and Perdon, 1988), 
inlinile dimensional systems (C urtain, 1986), 

In this note, our aim is to study the DDP for linear systems 
in presence of uncertainly. More precisely we will consider a 
family of linear systems ij(p) described by the equations 

ji:(f) ^A{p){i) -t- /f(p)u(0 -f f){p)y^>U) 
y(/) = C'(p)a(0. 

where p ^ (p,, , . . , p,) e £2 c K' is a parameter whose 
presence models an uncerlainiy on the system coefficients, u 
IS the control variable and w is a disturbance. The goal oi the 
design problem is lo find a static stale feedback law u - F\. 
not depending on p, such that lor every p in £2 (he output of 
the compensated system X,,^(p) dnc.s not depend on the 
disturhance w. This problem, that we call the Robust 
Disturbance Decoupling Problem (RDDP), was first 
considered m Hhatiacharryya (1983) in the case in which the 
matrices defining l^/lp) depend linearly on the components 
of p The results obtained in Bhaliacharrvya (1983) 
consist essentially of a sufficient geometric condition for the 
existence of solutions and of a procedure lor tesling it. 

* Received 29 May 1991; revised H May 1992; received in 
final form 17 July 1992 Tliis paper was recommended lor 
publication in revised form by Associate Editor R. V. Patel 
under the direction of Editor H. Kwakernaak. 

t A preliminary version of this paper has been presented at 
fhc First IFAC Symposium on Design Methods ol Control 
Systems. Zurich, Switzerland, September 1991. 

I Dipartimenlo di Elcttronica c Autoinatica, tjnivcrsiti^ di 
Ancona, Via Brccce Bianchc, 60131 Ancona, Italy. 

Dipartimento di Matcmatica, IJniversilii di Ancona, via 
Brecce Bianchc. 60131 Ancona, Italy. 


However, being equivalent to the existence of a single 
suhspacc having the same invariance property with respect to 
all the systems ol the family, the sufficient condition of 
Bhattacharryya (1983) is far from being necessary. 
Sunsequcntly, a special case, in which the matrices defining 
the systems ol the lamily take values separately m a convex 
set, has been investigated in Cihosh (1985). In that paper it is 
assumed that p ^[p\, py, p^, P4) £ £2 = {.r ^ (jt, , 4.,, ^3, ^4) 
such that A{p) = A(p^) - p 

(1 = + p,)B,. C(p) = 

r(/>,) = /',r„+(l -p,K',, /)(p) = 0 (pj=p, 0 „ + (l- 

P 4 U), and necessary and sulficient conditions for the 
existence of solutions lo the RDDP are lound. 

Here, wc strongly improve the results of Bhattacharryya 
tl983) by providing a necessary and sufficient condition for 
the existence ol solutions to the Robust Disturbance 
Decoupling Problem in the case in which the matrices 
defining ^j{p) depend polynomially on the components p^s 
of p, £2 IS an open subset of R' and gencrically (that is for all 
p except those belonging lo the set ot zeros of some 
polynomial) ^,i{p) is a left invertible system. Note that the 
last one, when depends polynomially on the 

components oi p, is a relatively mild assumption. Our results 
represent also an improvement of those ol Ghosh (1985) 
since the family X,y(p) we consider is more general than that 
in Ghosh (1985). 

From a technical point ol view, wc start by considering the 
case (probably of more practical interest) in which C 2 is a 
finite set, then we reduce the general case to that one. The 
tools we employ are those ol the so-called geometric 
approach (sec Wonham, 1985) In particular, in proving our 
results wc make use ol the notion of sell-bounded control 
invariant siihspace introduced and studied in Basile and 
Marro (1982) The intersection of two self-bounded 
controlled invariant subspaces has been shown (Basile and 
Marro, 1982) lo be a controlled invariant subspace and this 
implies, in particular, the existence of the smallest controlled 
invariant subspace containing a given suhspace, By means of 
this ob|ccl, wc can slate a necessary and suflleicnt condition 
for the existence of solutions to the RDDP that, assuming 
left invcrtibiliiy for the systems of the lamily l^fp), can be 
practically checked by means of classic geometric 
algonlhms- 

rhe pa[>er is organized as follows. In Section 2, we slate 
the problem and we describe some preliminary results on the 
set of Ihe controlled invariant subspai es of a system. In 
particular, relying on ihc results of Basile and Marro (1982) 
on self-bounded controlled invariant subspaces, we show the 
existence of minimal elements of such set and recall a 
procedure, based on geometric algorithms, for constructing 
them. In Section 3, we provide our main result and we 
discuss an example. 

Through the paper, we assume that the reader is familiar 
with the basic concepts of the geometric approach and, in 
particular, with the notions of controlled invariant subspace, 
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conditionally invariant subspace and with the related 
algorithms, as they are described, for instance, in Basile and 
Marro (l%9) and Wonham (1985). 


Proof. Any controlled invariant subspace V containing K 
and contained in Ker C is self-bounded, hence the conclusion 
follows from Basile and Marro (19H2). 


2. Preliminaries and statement of the problem 

Let us consider the family of linear control system XAp) 
described by 

|i(f) = A(,p)xU) + B(p)u(t) + D(p)w{i) 
\yU)=C(p)x(t) > 

where jr e A" = /?” is the state, u € U - R"" is the control, 
w eW = is a disturbance, y eY - R^^ is the output, 
p = (p,, . . . , p,) e Q is a parameter and 

(A(p), B(p), C(p), D(p)) are matrices of suitable fixed 
dimensions whose entries depend on p. 

The control problem concerning the pair (£,^(p), Q) wc 
want to consider is defined as follows: 

2.1. Definition. Given the family of systems 2^(p) 
described by (1) and the value set Q for the parameter p, the 
Robust Disturbance Decoupling Problem (RDDP) concern¬ 
ing the pair (Xj{p), consists in finding a static state 
feedback law u = Fx, not depending on p, such that for every 
p in n the output of the compensated system Sj^fp) does 
not depend on the disturbance w. 

2.2. Remark. One may prefer to refer to the above design 
problem as to a “simultaneous” or “parameter independ¬ 
ent” disturbance decoupling problem, u.sing, as done in 
Basile and Marro (1987) the adjective robust to denote the 
case in which the parameter p is supposed to vary with lime 
and the output of the comp»ensated system is required to be 
independent on the disturbance with respect to such 
variation. No matter what terminology one adopts, it should 
be clear that our problem is equivalent to solving, by the 
same feedback, a family of disturbance decoupling problems, 
one for each value of p in Q, while the one considered in 
Basile and Marro (1987) is essentially a disturbance 
decoupling problem for a time-varying system, whose 
acceptable solutions are time-varying feedbacks. 

It is quite easy to derive from Basile and Marro (1969) or 
Wonham (1985) the following rc.sult 

2.3. Proposition. The RDDP concerning the pair 
i'Ljip), Q) is solvable if and only if there exists a family 
{^(p)}/;eu subspaces of X such that: 

(i) Im D(p) c V{p) cz Ker C(p) for all p e U; 

(ii) there exist a linear map F:X~^V such that 
(A(p) -K B{p)F)V{p) cz V(p) for all p e n. 

By generalizing, in a straightforward way, the classic 
terminology of the geometric approach, a family of subspaccs 
{V{p)}p^u which the Condition 2..1 (ii) holds will be 
called a feedback-type family of controlled invariant 
.subspaccs, and any map F.X^U such that (/4(p)-f 
B(p)F)V(p) Cl V{p) for all p e Q will be called a friend of 
the family. Clearly, the difficulty in dealing with the RDDP 
is in testing the Conditions 2.3 (i) and (ii). In oidcr to 
describe a procedure for doing so, at least for a meaningfully 
large class of families 2j(p). wc need to introduce .some 
auxiliary results about the set of the controlled invariant 
subspaces of a linear system, lo fix the notation, let us 
consider the linear system 

{xU)^Ax(t) + BuU) 

[y(t)-Cx(t), 

with X E X - R". Following the terminology introduced in 
Ba.sile and Marro (1982), a conirollcd invariant subspace V 
for 2 is called self-bounded if it contains (V^* film B). The 
basic properties of the self-bounded controlled invariant 
subspaces are studied in Basile and Marro (1982); here we 
recall in particular the following results. 

2.4. Proposition. Let V* denote the maximum controlled 
invariant subspace for 2 contained in Ker C, and let K a V* 
be a subspace for which the condition (l^"^ Pi Im fl) cz A" 
holds. Then, the set of all the controlled invariant subspaces 
for 2 containing K and contained in KerC has a minimal 
element. 


Assuming that (V'*nimfl)c: K holds for a given subspace 
/C of T*, let us denote by V^(K) the minimal element in the 
set of all the controlled invariant subspaces for 2 containing 
K. The following result of Basile and Marro (1982) relates 
T^fA^) to other geometric objects. 

2.5. Proposition (Basile and Marro, 1982). Given 2 and a 
subspace K(zV* with (V* n Im fl) c: A", let S*{K-^\mB) 
denote the minimum conditionally invariant subspace for 2 
containing (K-i-XmB). I’hen, V^(K) coincides with the 
intersection V* nS*(K Im B). 

The above proposition points out, in particular, that by 
using the well-known geometric algorithms, which provide, 
respectively V* and, with a suitable initialization, S - 
(K -t- Imfi), (see Basile and Marro, 1969) one can construct 

y,(K). 

II 2 is left invertible, since n Im R - {()}, the condition 
(n Im fl) cz A" and the conclu.sions of Proposition 2.4 and 
Proposition 2.5 hold for any subspace A of V* (in particular 
all the controlled invariant subspaces of Ker C are 
self-bounded). Assuming left invertibility of 2, one has in 
addition that B is injective and, hence, the following result. 

2.6. Proposition (see also vSchumachcr, 1983). Let 2 be 
left invertible and let V V* be a controlled invariant 
subspace for 2. Tlicn, wc have the following. 

(i) If F .X~-*U is a friend of V', F is also a friend of any 
controlled invariant subspaee V' c- V. 

(ii) If F. LI IS a friend of V and F': A—► U is a Iriend of 
V cz T, then F^ Fjv 

Proof (i) Let (A + BF)V r V and {A BF')V'cz V hold 
for Fand for some F' .X P Phen, lor any v' e V' c V^, we 
have An’— BFv'— vy^ BF with and i» ,e 

V* d V. Since V’ n Im F •• {()}, this implies in particular that 
v^ = U; E V and hence (A -t BF}V' c: V'. 

(ii) By the .same argument as above, BFv'-BF'v' and 
hence Fu' - F'n' for any v' e V 


3. Main result 

Let u.s start by considering the RDDP concerning a pair 
(2,y(p), ). where 2,y(/;) is defined by (1) and Q, 

(F„, . . . , FJ c: F' IS a finite set. 

3.1. Proposition Ciivcn (2\y(p), ) let us a.ssume that 

y(n\ 1-^(0 + Bip)uU) 

^ \y{t)-('{p)xH) 

(that IS the undisturbed system) is left invertible for any 
p E Qf. Then, the RDDP concerning (2,y(p), ii,) is solvable 
if and only il ]mD{p)cV*{p) lur all p^Uf and 
fT^(lm Dip)))^ E Qf is a leedback-type family of controlled 
invariant .subspaccs 


Proof The ‘if part is obvious. ( onversely, let us a.ssume 
that the RDDP at issue is solvable. By Proposition 2.3 there 
exists a feedback-type lamily ol controlled invariant 
subspaces {V'(aOL^o, ^uch that \mD{p)dV{p) lor all 
peQf. In particular, Im/)(p) is contained in V*{p) and, 
hence, V*(lm D(p)) is well defined for all p in, in particular, 
V^(\mD(p))r V{p). Let F be a friend of {V(p))^^,^^^\ by 
Proposition 2,6 (i) F is also a friend of T^flm Dip)) for any 
pEilj and, therefore (F^(Im D(p)))^,^^ is feedback-type 
family of controlled invariant subspaccs 


The relevance ol the above result is due to the fact that the 
necc.ssary and .sufficient condition described can be 
practically checked. There is actually no difficulty in checking 
its first part, while its second part can be checked by using 
the following procedure. 

3.2. Procedure. Step (0). Compute V^(lmD(F„)) and a 
friend Fn of it. Set A,, - V,(Im D(/;,)) and - f;,. 

Step (r 1). Compute F^llm D(P ,,,)) and a friend F, t, of it 
h)) “ lU’ write X 

F,(lmD(F,,,)))0A;, set X,, X,VJ\m DiP,,,)) and 
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define F.+.iJr-l/ by F,^,|^ = F, 

F,n\v,^l'nLt^p,^,)) and F, 


l+l|V.(lni i)) 


F|| A', Otherwise, Stop 

It is, in fad, easy to realize by Proposition 2.6(ii) that if 
the above procedure slops because jiff- 

crenl from »|A^,fiv*(imO(/>,^.,)), then {Z)(/ 7))) 
js not a feedback-type family of controlled invariant subspacesf 
Otherwise, i.e. if the procedure can be performed up to Step (r) 
included, it eventually produces a friend of 

3.3. Remark. In general, very little can be said in 

geometric terms about the possibility of achieving the robust 
decoupling of the disturbance with stability, except- <iuite 
obviously—that a necessary condition is the solvability ot the 
DDP with stability for any p (Wonham, I985). As.suming 
that the above procedure yields a friend F ol 
(V^(Im ^nd Idling X" denote the subspate 

(U,orH/^*(JmD(p))), any other friend coincides with F, on 
X*\ Then, a stronger necessary condition is that, for any 
p € ilf the eigenvalues of the restriction of (A(p) + 

to the largest controlled invariant subspacc of S(p) 
contained in X" have negative real part. 

3.4. Remark, It is obvious from 3.2 that 

{l/^(Im feedback type family if, in particular, 

each element in it has zero iiilLTscctioii with union of all the 
others. It follows then that the RODP is genencally solvable 
in the space of all families 2,y(p) verifying the conditions that 
for any p the corresponding DDP is solvable and that 
dim A'i^,^ij^dim Vjim Dip)). 

Wc can now remove the finitencss assumption on the value 
set for p. So, Id us consider the RDDP concerning a pair 
U). where !,/(/>) is delinetl bv ( 2 ) and Qc R' is an 
open sd. 

3.!>. Propo.Kition (riven U) let us assume that 

(i) the entries ol the matrices (Aip), ('(p), D{p)) 

dclining ^,jip) depend polynomially on the components 
o(p - ip,. . . .p.)\ 

(li) there exists a pMilynoniial P{p) such that 

ji(0 - -* fi(p)u(() 

lv(0 - ('(p)>r(f), 


IS left invertible for any peil\{p^il such that 
Pip) = 0}. 

Denoting by max deg^, M(p) the maximum degn‘e m p, of 
the elements ol a polynomial matrix Mip). let r, be an 
integer greater than or equal to 

(max deg^, C(p) A max deg^,^ Dip) 

in - I) max (max deg^, Aip). max deg^, iPip))) 

and chose arbitrary distinct values a,,,. . 

> i ^ ^ such that the finite set 

i, - t/T' for - I), , . . , f)) IS contained 

in such that P(/^) —d} I'hcn, the RDDP 

concerning iX^fip), U) is solvable it and only il the RDDP 
concerning (2^y(p), Qf) is solvable. Moreover, any solution 
of one of the problems is a solution of the other. 


Proof, The only if part ol the first group of statements is 
obvious, as well as the fuel that any solution ol the RDDP 
concerning iXjip), U) is a solution ol the RDDP concerning 
(I.y(p), £2,). So, Id us assume that the RDDP concerning 
(iL,y(pj, ) IS solvable and Id F .X ~*V be a solution 
This implies that the elements ol ihe matrices 
Cip)Dip), CipMMp)^ Hip)F)Dip), .. Cip)iAip)-^ 
Bip)Fy' ^D(p) arc zero for all p c £2y. Since these elements 
iirc polynomials of degree Ics.scr than or equal to n, in each 
Pi, this implies (see the Appendix) that they arc zero for all 
p ^ R' and, in particular, for all p e Q. Hence, the output ol 
Ihe compensated system I./,.(p) docs not depend on the 
disturbance w for any p e- t2 or, in other words, F is a 
solution of the RDDP concerning (2-,y(p). ^2) 

As a result of the above proposition, the existence ol 
solutions to the RDDP concerning (2,,(p), £2) can be 
cheeked by means of the Procedure 3.2 with Qf ~ {P,^, , ' 

(fli,,, . . , ) 6 R' for _ ,n,). (Obviously this implies 


a large number of computations, but it also provides an 
algorithmic construct'.m of a solution, if there are solutions. 

3.6. Etampie, Ixt us consider the family of systems 2j(p) 
described by (2) for 


/ I 0 ,r\ 

A(p)= I 0 p + I 0 I B(p) = | 

\p - 1 1 0 / 


mp) 


r\ 

1 . ^( f ^) = (0 0 


and p e U = R. For p = 0.1(0) is easily seen to be left 
invertible, hence l(p) is genetically left invertible. The 
dependence of l(y(p) on p is not linear, nor is there a single 
controlled invariant subspace containing Im D(p) for all 
p t f2 having a friend which docs not depend on p. Hence, 
^Ap) Oocs not verity the sufficient condition described m 
Bhallacharyya (1983) for the existence of solutions to the 
RDDP. Actually, the computation shows that Ker C’ is 
controlled invariant lor all p e ^2, and hence V*ip) - Ker C 
for all p e Q. but (kMp)},,ei3 is not a lecdback-type family 
of controlled invariant subspaces. Since r, = 4, we chose 
il, =- {0-4} and we compute VJlm D(p)) for p The 
computation yields 

VJim Dm - 10 }, 


l/JImD(l))- span 11^-1 


VJIm Di2)) span 


11 


-4 

0 


I'^dm /M3)) ^ span ^ 


V'^(lm D(4)) - span ^ 


and il IS not difficult to verily that T - (1 0 0); U is a 

fiicnd loi V'dlnif)(p)) lor all p e £2y. Hence (T*(p))^^ij^ is 
a fcedhuek-iypc family of controlled invariant subspaccs and, 
by Proposition 3.5, f ■ i\ 0 0) is a solution to the RDDP 
concerning (l\/(p). B) In as one can easily check, 

('ip)iAip) -I Bip)FyDip) -■ 0 (or all p t £2 and all / ^0. 

4 (’onr/n.vn^/r 

We have given a geometric necessary and sufficient 
condition toi decoupling, by means ol a single state 
feedback, the disturbance from the output of a parameter 
dependent family of linear systems. The main assumptions 
we made are that the systems in the family have coefficients 
which depend polynomially on the parameter and that they 
are genencally lefl invertible I he led inverlibilily is used for 
inferring the existence of minimal controlled invariant 
subspaccs and for staling the problem in geometric terms. 
The assumption about polynomial dcpctulence allows us to 
reduce the general case to the case of a finite family of 
systems and to compute practically a solution to the problem. 
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Appendix 

Proposition. Let n(p) be a polynomial in the components 
p^s of /7 = (p,, . . . , pj with degree lesser than or equal to r, 
in each p,. Let a^^^,. . . , a^,.^; fl 2 n» ■ • ■ * ■ ■ ■ i 

a^^^ € R be arbitrary distinct values and consider 
the finite set fi/c R" defined by fly - 

(fli„./or 1 ,^ 0 ,..., r^}. If n(/’) = 0 for‘all 

Pe then n(p) is the identically zero polynomial, that is 
n(P)-0for all Per. 

ProoJ. We use induction on the number of variables. The 
statement is true for .y= I, .since, in this case, n(p) is a 
polynomial of degree lesser than or equal to r, which 
vanishes in r, + 1 distinct points. Assume now that the 
statement is true for s = k and let p = (p,, . . . , p^,,). Write 
n(p) as a polynomial in p^^, with coeflicients ri,(p), the 
n,(p) being polynomials in p == (p,, . . . , p*) of degree le.sser 
than or equal to r, in each p,. By evaluating the coefficients 

n^(p) in (a,„. fl 2 „.we get a polynomial n'(p* + ,) in 

one variable, of degree lesser than or equal to % + , which 
vanishes in + l distinct points. This implies that 

IS the identically zero polynomial and, hence, that 
all the coefficients n,(p) vani.sh in (u,„, Ujo, . . , Ohi) 
repeating the same argument, we get that each coefficient 
n,(/;) vanishes in all P eQ} = {P,, = (Oi,, , . . , 6 P' 

for 1 ^ = 0... . , r^}. Then, by the induction hypothesis, all the 
n,(p) are identically zero polynomials, and as a consequence 
n(p) IS the !dentically zero polynomial. 
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Stability Robustness Characterization and 
Related Issues for Control Systems Design*t 

YANLIN Lit and E. BRUCE LEE* 

Key Words —Robustness; optimization; linear systems; control system design. 


Abstract—Two issues are addressed in this paper. First, wc 
show that the H" norm of a sensitivity function matrix for a 
stable multivariable closed loop system indicales its stability 
robustness in the sense of gain margin and phase margin. 
Second, we apply this result to the problem of synthesizing a 
robust stabilizing controller. The emphasis of the analysis is 
LO feedback system design, llie goal is to design an 
observer based controller such that the resulting closed loop 
system has the robustness property that an LO stale 
feedback system has. This is achievable if the sensitivity 
function of the observer based control system is vciy close to 
that of the state feedback .system. The closencs.s is measured 
by the //“ norm and classical A/'" optimization techniques are 
used for design. 

1. Introduction 

CoNSinf R A n-iDBAf K system with the configuration .shown 
in Fi^. 1. Here A^(i) is the transfer function of a stabilizing 
controller and P(s) is the plant model. One fundamental 
concern m feedback control system design is stability 
robustness. Gain and pha.se margins arc frequently used as 
indicators for stability robustness. In classical trequeney 
domain SISO control systems synthesis, the robustness issue 
IS easily handled by employing synthesis techniques based on 
Nyquist plots (or Bode or Nichols plots). In this paper, wc 
use gain margin and phase margin to describe stability 
robustness. 

For SISO cases, stability robustness in the sense of gam 
and phase margins can be described, by the nearest distance, 
,say from the point (-1,0) in the complex plane to the 
Nyquist plot of the feedback system. Suppose that the closed 
Uxip system in Fig. I is stable and its Nyquist plot doe,s not 
meet the circle with center at (—1,0) and radius r„. In this 
case, It is known that the system has gain margin 
(1/(1 +r„), 1/(1 “ r^)) and phase margin fy„-±cos '(1- 
{rill)) (Uhlomaki et tiL (19K1)). Therefore gain and pha.se 
margins have an explicit relationship with the radius r„ of 
such a circle which the Nyquist plot of a SISO system does 
not meet. Wc call r^^ the simultaneous stability margin. It can 
be easily shown that the nearest distance from the Nyquist 
plot to (-1,0) point is the inverse ol the H ^ norm of the 
system’s sensitivity function Hence, the H norm of a 
system’s sen.sitivity function has an explicit relationship with 
gam and phase margin for a SISO feedback system 

The dchniiion of gain and phase margin lor an MIM 
system means that simultaneous gain and phase variations m 
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all the loops of a multivariable feedback .system (Lehtomaki 
et at. (19R1)). One motivation of this paper is to show that it 
is also true that //” norm of a system’s sensitivity function 
has an explicit relationship with gain and phase margin for 
MI MO systems. 

A typical example revealing the relationship between the 
sensitivity function and stability robustness is LO state 
feedback systems. If one uses LOR design technique in the 
SISO rase, then the obtained feedback system has gain 
margin (^,^) and phase margin +60'’. This is because 
the norm of the sensitivity function is less than or equal 
to one (use Kalman inequality), or the Nyquist plot does not 
pass through the unit circle, with radius = 1 and center at 
(-1,0). Thus a system with LQR controller has 
simultaneously stability margin r,, ~ L 

Since the robustness property of a feedback sysiem can be 
described by its sensitivity function, one can design a control 
system with good robustness by putting some constraints on 
the .sensitivity function. This provides us with a guide on how 
to design a feedback system with guaranteed stability margin 
(simultaneous gain and phase margins). Characterizing 
stability robustness by gain and phase margins is practically 
useful in the sense that if one knows how big the infinity 
norm of the sensitivity function is (by proper system design), 
then one has an idea of how much to adjust the loop gain to 
achieve a desired response without destabilizing the closed 
loop system. 

In the last half of the paper, we investigate a well-known 
question: when can an observer based control sysiem 
achieve the stability robustness property of an LQ state 
feedback control system? From the preceding discussion, one 
can conclude that this is achievable if the sensitivity function 
of the observer based control sy.stem is properly designed so 
that it is close to that of slate feedback sysiem in the .sense 
of ir norm. I'his leads to the idea of sensitivity recovery. Tl 
will be shown that sensitivity recovery implies loop transfer 
recovery Thus wc obtain a systematic way of doing LTR 
.system design. An interesting result is that if the controlled 
system is minimum phase, then loop/scnsitivily recovery can 
be achieved as well as desired, which is the same as the 
cla.ssical result. Tf the controlled sysiem is non-minimum 
phase, then the recovery error can not be made arbitrarily 
small However, the error can be minimized by using an //’ 
optimization approach, which is belter that the classical 
re,sult. This proposed recovery approach is more systematic 
than the classical approach (Doyle, 1981). 

rhe notation to be used is standard; The infinity norm of 
G(,v)is defined by ||G|U sup {a(G()(n)): e K), where a 
IS the maximal singular value. The real part and imaginary 
pari of G arc denoted by Re (C7) and Im (G), respectively. 
The determinant of G is denoted by del(G). Single- 
inpul/single-oulput and multi-input/multi-output are den¬ 
oted by SISO and MIMO, rc.spcclivcly. 

The paper is organized as follows. Stability robustness 
issues will be discussed in Section 2. In Section 3 we resolve 
some interesting issues posed in Section 2, and in particular 
propose a methodology to design an observer based 
controller that has good stability robustness, which leads to 
sensitivity and loop recovery. Illustrative examples are 
oficred in Section 4. 
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K(s) 




P(s) 
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Fig. 1. Feedback systems configuration. 

2. Stability robustness characterization 

Definition 1. For a stable SISO feedback system as 
diagrammed in Fig. 1, the quantity r„ with 0 < r,, < 1 is said to 
be a simultaneous stability margin if and only if 

\-re^^^K(s)P(s)¥^0 for all 0 < « Zjt, 

Re (j)=2r0. 

llie above definition can be undcrstfxid in the following 
way. Assume that a closed-loop system is stable. Then its 
Nyquist plot will not meet the point (-1,0). Simultaneous 
stability margin means that the Nyquist plot docs not meet 
the point (-1,0) and further, it is kept a distance r„ away 
from the (-1,0)-point. 

If the Nyquist plot is kept away from the (-l,0)-point by 
at least Tq, then the system (SISO) will simultaneously 
possess the following gain margin and phase margin 
(Lehtomaki et al, 1981): 


'('-?) 


PhM = ±cos“ 


This means that if the loop transfer function K(s)P{s) is 
perturbed to Ar(j)P(.r)fre^'^, then the closed loop system 
remains stable for all {<p, a) such that 


'(l- “')< 0<+COS 



and 


- - - - < a ’ 

1 + r„ 1 - r„ 

Definition 2. For the stable MIMO feedback system as 
configured in Fig. 2 with m inputs and p outputs having 
p^m, the quantity r„ with (J<rp<l is said to be a 
simultaneous stability margin in if and only ii 

|/ + X(j)P(j)+ UHV\¥^{) tor Rc(,s);=ra, (2) 

where URV is the perturbation in polar lorni with U*IJ - /. 
W=/and |iR||.. 

Definition (2) means that in the multivariable ca.se, a 
simultaneous stability margin of implies that if the loop 
transfer function K{s)P{s) is perturbed to 

fC(.0P(^)diag (/,,/,.L), 

(see Fig. 2j, then closed-loop stability still remains, provided 
for all / = 1,2, . . , , m. 

-cos ' (l - <<!>,< +COS ' (l - 

for I, = (pure phases); 


to be real numbers (pure gains). 

lliis will be explained in more detail later. 

Remark 1. In Definitions 1 and 2, we consider only the case 
0<r„<l. One can also consider the case when r^vi, in 
somewhat the same way. 

Remark 2. In Definition 2, dual results hold if m ^p. The 
only difference is that the sensitivitv function is (/ + 
P{s)K{s)) ‘ instead of {I K(s)P(s)) \ and the perturba¬ 
tion is considered at the output of the system. 



Fig. 2. MIMO feedback system configuration. 


Theorem 2.1. Consider a SISO controlled system as 
configured in Fig. 1, P(s) i.s the transfer lunction model of 
the controlled system and K{s) is the transfer function of a 
stabilizing controller. This feedback control system has a 
simultaneous stability margin r„ if and only if the sensitivity 
function \(.v): - [1 + A^(j )P(.v)] ' satisfies 

l|.V(.v)||, . (3) 

Proof. It is similar to that of rheorem 2.2. 

Next we are going to present a result which bridges the 
gap between characterizing stability robustness of SISO 
.systems and that of MIMO systems. 1'his is done by I'hcorem 
2.2 below. One feature of Theorem 2.1 and Theorem 2.2 is 
that one can characterize stability robustness of a closed loop 
system in a uniform framework with only the knowledge of 
nominal system model and controller (if only analysis is 
required, then only nominal plant model is necessary) We 
do not have to make assumptions on uncertainties that the 
control system may face. Tins is different from the usual 
approaches. Stability robustness adopted here for MIMO 
systems is characterized by gain margin and phase margin m 
each loop. ITiis robustness characterization is more 
straightforward for engineers than those characterized by 
gap metric, additive or multiplicative uncertainties (Georgiou 
and Smith, 1990; Vidyasagar and Kimura, 1986) because 
engineers usually prefer to know how much gam change can 
be made to achieve a given goal without making the sy.stcm 
unstable. It will be .seen that this simple stability robustness 
result has important applications m /y,/LO(i/lTR design 
and stability robustness analysis 


Theorem 2.2. ('onsider linear MIMO feedback systems as 
sugge.stcd by Fig 2 and let Kis) be a stabilizing controller. 
1.^1 URV be the loop perturbation in polai lorm with 
U*U = I, VV*-I and ||K|U'-r„. Ihen system has a 
simultaneous stability margin r,,, i c. 

\I ^ K{.s)r{s)-^ lJRV\ yf i) foi Rc(.s) -U. 

if and only if ||.’‘(.v)||,^ ■ 1/r,,, where \(^) .^- \l ^ A'(.s)f^(0| ’ 
is the sensitivity function 

Proof. 

\l + A(.s)/*(.y) T l)RV\ 0 for Re (.i) ^ 0 if and only if 
\/^S{s)URV\i^{) for Re(.v) -M), 
since / + A’(i‘)/*(.r) is stable by assumption 
Sufficiency. Assume ||A(j)||, l/Zf,, then 





(4) 


Hence, |/-f A(.v)F(,0 ^ f/WVl ^ 0. for RefO^^O. 


Necessity. We show this pari by contradiction. Suppo.se that 
|/T A^(.i)F(.v) 4 (yRV|=^0 for all Rc(.s):^^() does not imply 
|15(,y)|U l/r„. Then |l.y(.i')l|.,. ^ l/r^,. Thus there exists a 

frequency such that (7(S(jw,^)) - i/r^, and A(/cU(,) can be 
decomposed as S{ja)^^)- (singular value decomposi¬ 

tion), where and arc unitary and I - diag (l/r„, 2,) 
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Now take t/ = ~ K,* and V - then 

\l K{J(o^^P(Jo)^^) + URV\ ^ |A(/w„) * f l/RV\ 

= iv:2 

-|V:(2 ^-R)U:,\=0, 

since I ' - f? = diag (0,which conlradius the 
ussumptinn q 

Corollary 2 3 If an MIMO feedback system has simul¬ 
taneous stability margin r,,, then simultaneously in each loop 
of the feedback system of Fig 2, there is a guaranteed gam 
margin (denoted GM) and phase margin (denoted PhM) 
given by 


PhM = ±.os ((,) 

The following lemma is cited with little change from 
Lehtomaki et at (1981) which vsill be used in the proof ol 
Corollary 2 ^ 

Lemma 2 4 For square matrices G and L, det (/ + GL)=^() 
if 

n({l + G) ')n(l ' /) >1 (7) 


model of the controlled system, and 

KU)-i (il - A + BF + LC) 'L, 

be the compensator transfer function Then 

(i) r(i) = i/ + /c(s)P(jt) + «F) 'x 

LC(il-A + LC) ‘fl, and 

(u) 5(j)-[/ +A:(i)P(j)] '-{I-F(sl-A+BF) '«}{/ + 
F(sl - A + LC) 'B), where ,S(j) is the sensitivity function 
and T(t) is the complementary sensitivity function 

Proof (i) 

+ + Lr) 'LCUl-A) 'fl| ' 

y. F{\l - A + Bt + L( ) 'LC{',I-A) 'B 
= r[/ + (i/ /l + flr+/f) - 4) 'BF\ ' 

x(t/-/l + i5F+/C) 'LC(sl-A) 'B 
-1 {\I - A + Bf- -i LC{^.I-A) ' 
x(i/-/t + flf)] 'y ({s/-A) 'B 
-F{\I 4 + ffr) '{/ + /C’(i/ A) ') ' 

X LCUl- A) 'B 

-FUl-A-tBF) 'LC{l-Hsl-A) '!() ‘( 1 / A) 'B 
^Kil-A + Bh) 'LC(<,I-A + L(.) 'B 


Proof of Corollary 2 1 (cf lehtomaki ti at fl9KI)) let 
(/(j) denote K(s)P(s) Assume that ft'(i) is perturbed to 
0(s)/- where 

I -t1iag(/| I , 

Suppose that one has found an appropriate conliollci suth 
that ||(/+ G) 'lU' l/r,. Then by lemma 2 4 dcl(/ + 
GL) #0 for Re (i) ' 0 if 

!(', '-I)hr„ (H) 

for all 1 since (8) implies (7) To obtain gam maigins let all 
/,s be real From (7) one has 

1 + r,| '' 1 

Similarly, let /,s be e'*^' where 0, is real From (7) one get 
the phase margins expression □ 

Note GM = 1/(1 + r„) is interpreted as that any gain cr, 
inserted in any one of the feedback loops in Fig 2 satisfying 
1/(1 4 r„) ^ or, ^ 1/(1 “ r„) will not cleslabili/c the closed loop 
system Similar interpretation holds for PhM itos ‘(1- 
(r/)/2)) Every feedback loop may allow a phase factor c'*^' to 
be inserted, provided |0J cos '|(l-(r„/2))| si' that 
closed-loop system remains stable 

^ Stability margins for L (JCt toriiro^ sv\ferns and rohiLstness 
recovery 

Considei a MIMO feedback system with stale space model 
dr 

-Ak-^Ru (9) 


where xe/?"*, C f R'’^" and/? 'm Assume that 

{A, W, C ) IS a stabili/dble and delectable triple It is known 
that an observer based controller for this system has the form 
F(a/- A 4 BfEC) ‘i , where A Hf and A-IC arc 
both stability malnccs, and f P, I ~ QC , with P^O 
and ^ 0 satisfying 

A P-^PA ~ PRB'P^C r-U (II) 

AQ^QA' ~QC CQ^ BB'-O (12) 

respectively Our goal is lo present an idea for designing an 
observer based controller We will show how lo obtain an 
observer based control system which has guaranteed stability 
margin And then we show how lo design an observer such 
that the output feedback system recovers the stability 
robustness of an LO stale Iccdback system 

Let P(s)= C(sl - A) ‘B be the transfer function 


I-act (ii) can be easily vcnlied by noticing r{^) S{s) — I 

□ 

It IS known from Ihcorcin 2 2 that stability robustness lor 
a MIMO feedback system is completely charactenzed by Iht 
inlinity norm of the sensitivity function So we focus on 
sensitivity function analysis It one uses an 1 OG controllei, 
then the infinity norm of the sensitivity function is 

IIVr)|| -\\[l-hGl- A + Rh) 'B\ 

X(/ + r(i/ 4 + yt) 'yyHI.. (iM 

and it can be computed by an existing method it A and I are 
known (Boyd et at 1^)8^) The invt rst. ol ||^(^)|| is the 
simultaneous stability margin r,, From (17) the infinity norm 
of the sensitivity function is hounded by 

iis(oii.'ii(y H^i-A + Bh)'inw 

X in/ + f(?/- 4 4 yc) 

^IH/4 f(,/-44L() 

(using Kalman inequality) The next lemma is a special form 
of a result due to Mageirou and llo (I'JTV) which is useful in 
deriving the mam result in this section 

Lemma 7 1 I et A B AC and / (identity matrix) be matrices 
of compatible dimensions Then the lolluwing stalerricnls are 
equivalent 

(i) A IS j stability matrix and ||(/4 A(i/-A) 'B}|| 

l/r„ 

(ii) r,)* 1 and there exists a positive dchnite symmetric 
matrix A" ^0 such that 

A X ^ XA l-(^B + K){B X 4X)(' V") 4 / = () 

rheonrn 4 1 If 'he following two Riccjti equations 

A A'4 JfA - A'B/J A" 4 C X-0, (14) 

AT4 YA'-2YC 'CY + {YK' 4 R) 

y{KY+B'){'~'')4 l-i' (IS) 

have positive definite solutions X and Y, respectively, then 
the observer based controller L{sl - A + BF + LC ) 'E with 
A B'X and E = YC will be a stabilizing controller and 
the obtained feedback system will have simultaneous stability 
margin 

Proof Assume that the system is conirollabe and observ¬ 
able If (14) has a positive definite solution then A - BA is a 
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stability matrix and if (15) has a positive definite solution 
then the matrix A — LC us a stability matrix. By separation 
principle, F(s/ - A -h BF -h LC)~'L is a stabilizing control’ 
Icr. It is easy to see that (15) having a (xisitive definite 
solution implies 

||/ + f(j/-/l + Z.C) 'fl|u< *. ' (16) 

^) 

Hence the sensitivity function of the observer based control 
system satisfies 

IIS(s)IU^II{f-^ fXsf-A + LC) (17) 

Thus, by Theorem 2.2, the obtained feedback system will 
have simultaneous stability margin r(,. □ 

Another observation can be made from (13). If F and L 
are chosen properly such that F(a’/- A ^ LC) 'ff is very 
'small' then the .sensitivity function of the observer based 
control system will approach the sensitivity function of the 
system with state feedback. Therefore, the robustness 
property of a state feedback system is recovered. This is the 
key idea for robustness recovery. Wc will use the infinity 
norm to measure the 'smallness'. Then optimization 
technique can be used to design an output feedback system 
such that it recovers the stability robustness property of LO 
state feedback systems. 

Let us assume that the system (9), (10) .satisfies the 
following four conditions. 

(Al) (A, B) IS stabilizable and (C. A) is delectable. 

(A2) C and B arc of full rank, respectively. 

(A3) mSzp^n. 

(A4) The transfer function C{sl - A) '/t is left invertible 
and strictly minimum phase, i.e 

. \A -sI B 

C o 

for all Re (j) 0. 

The following result will be used in describing our 
observations concerning loop recovery and robustness 
recovery. This result is due to Mageirou and Ho (1977). 
Similar results with different settings can also be found m 
Petersen and Holloi (1988) and Zhou (1988). 

Lemma 3.2. Assume conditions (A1)-(A4) hold, then for 
any 0 there exists d q* > 0 such that the Riccati equation 

D D' y/1;' r'Y 

AY-^YA’-^Q^q^-^-YC'CY^-. -0, (18) 

0 q^d 

has a po.silive-definite symmetric solution lor all q ^ q*^ 
where () is a given positive definite matrix. Furthermore, 
with L defined as L: - YC\ the transfer function 

G(5);-f (.s/ ~ A 4 LC) 
satisfies the bound 

IK-^(/f^^)ll. ^ 

Remark. Lemma 3.2 means that given any t) >0, Riccati 
equation (18) has a positive-definite solution provided q >0 
is chosen to be sufficiently large. It al.so means that if 
(A1)-(A4) arc satisfied, then ||G(/ai)||., can be made 
arbitrarily small by choosing L properly. Denote by 5’^, the 
sensitivity function of an LO state feedback sy.stem, and 
denote by the sensitivity function of a feedback .system 
with observer based controller. 7'hcn, from the previous 
discussion, it follow.s that 

SJs) = l/ Kix)P(s)] ^--{l-F(sl-A+BF) '«} 

X (I^F(sI -A 4 LC) ^B], (19) 

SJs)^{I-F(isl-A + BF) 'B). (20) 

To make the stability robustness of an observer based control 
system recovered to be that of a state feedback system, one 
needs to choose L (ob.server gain) suitably; such that A - LC 


is a stability matrix and ‘^wrlU minimized. Note that 

\\S„^.-SJ\^=\\{I-F(sI-A + BF) ’fl) 
x{F{sl-A + LC) 'B}1U 
^\\l~ Fisl-A+BF) ‘fl|U 

X ||F(j/-/l + LC) 'fl|U 

^\\F(sI-A + LCy'B\\^. (21) 

To minimize ||-5, hs ”‘i^rllcr, it is sufficient to minimize 
\\F(s/ - A 4 LC)' Rll,, by choosing L appropriately. This is 
a typical H infinity control task. From inequality (21) several 
observations can be made which are useful. 

(A) If one can find a L such that ||F(5/ - A 4 LC) can 
be made as small as possible subject to A - LC being a 
stability matrix, then the stability robustness property, 
which an LQ state feedback control system has, can be 
recovered as completely as possible. 

(B) From Lemma 3.2, if the Assumptions (A1)-(A4) are 

satisfied, then there always exists L such that ||F(.t/-A-l- 
LC) is arbitrarily small with A - LC a stability 

matrix. 

(C) Let G^^{s) be the loop transfer function of a feedback 
system with LO state feedback and G,,f(.0 be the loop 
transfer function corresponding to the observer based control 
system. Suppose that (A1)-(A4) are satisfied and G\,(5) and 
G„jfj) have no poles on the jw axis, then approaching 

in terms of the L,, norm implies that G„(.v) approaches G„,f.r) 
since 

^ lll^ + ^ob^] “ ^mIIU 

-ll(/4-G.)(5;,b>- a;,)(/4g,)iu 

'■ + ii(5;,hs-a\,)imi(/4Gjiu 

where A is a constant. Thus robustness recovery implies loop 
recovery. 

(D) If the plant i.s ol non-minimum phase, then there does 
not exist an L such that ||F(.s/— A 4 L(') 'B\\^ is made 
arbitrarily small with A - L(' a stability matrix However one 
can iteratively solve for L such that - A 4 LC) 

is minimized subject to A - LC being a stability matrix. The 
minimum error hound between 5',,, and can be therefore 
obtained and the degree of loop transfer recovery can be 
characterized 

From the above observations, the following important 
facts are apparent. 

(1) If the model of the plant is of minimum phase, then the 
stability robustness property of an LQ stale feedback control 
system possesses can be achieved asymptotically by an 
observer based control system provided that ||F(.v/-A4 
LC) IS made arbitrarily small, where Fis the LQ stale 
feedback gain matrix and L is the filter gain properly cho.scn 

(2) If the model of the plant is ol minimum phase, then loop 
transfer recovery can be achieved by an optimization 
approach. 

(3) If the model is of non-minimum pha.se, the observer 
ba.sed conlrtil system can never achieve the stability 
robuslnc.ss prtiperty that a full LQ state feedback control 
system has, i.c the simultaneou.s stability margin is always 
less than one. 

(4) If the model is of non-minimum phase, one can also use 
the LTR technique suggested here. The advantage of this 
approach is that it provides a way of minimizing the error of 
recovery, which is the key difference between our method 
and the existing ones. However, the lower error bound can 
never be zero. Its value depends highly on the structure of 
the non-minimal phase zeros. 

We now introduce a new two-step LQG/LTR design 
procedure as sugge.sted by the above. 

Step (1). Design an LQ regulator by solving for the positive 
definite solution X of 

A 'AT 4 ATA + N'N ~ XBB'Xjp ^ 0. 

where N and p are free design parameters and (A, N) is 
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a detectable pair, with desirable loop transfer functions and 
A - BF ^ stability matrix, where F B'Xfp. 

Step (2). Design a sequence of filters by solving 

AY YA‘ ^ Q q^~^-YC'CY+^^^^ 

where d is a small positive number, Q >0 and q is used to 
help search for solutions, with A ~ LC d stability matrix and 
L^YC\ When the plant is of minimal phase, 6 can be 
chosen as small as necessary. 

The compensator produced by these two steps has the 
following well-known form: 

K(s)-^ F(5lBF ^ LC) ^ L. 

Rentork. 4.1. The LOG/LTR approach suggested here is 
essentially a combination of and fT optimization designs. 
In Step (1) one solves an optimization task and in Step 
(2) one solves an optimization task If the design 
objective is control, then in Step 1 one can choose F 
using state feedback techniques. 

Remark 4.2. The two-step LOG/LTR design procedure has 
its dual form, i.e. one can design the filter first via filter 
Riccati equation and then design a .sequence of l.Q 
regulators by //“ optimization. 

Remark 4.3. This LTR procedure can also be used to 
recover a full state feedback //" control loop transfer 
function. 

4. Illustrative examples 

Two examples are offered below to demonstrate how to 
design an observer based control system using the suggested 
approach such that the closed loop system has good stability 
robustness in the sense of stability margin. Example 1 deals 
with a minimum phase controlled system and Example 2 
considers the case in which the controlled system is of 
non-minimum phase. 

Example I. Consider a controlled system as given by the 
transfer function 

^ ^ j ‘ -f 12 j 4- 20 

“ .? 4 7V2j^^Vi3 4s' 4 2.4 ’ 



FiCi. 3. Bode plot of target loop and recovery loop. 



Flo. 4. Nyquist plot of systems with stale feedback and 
output feedback 


which is of minimum phase. We want to design an observer 
based controller such that the feedback system is stable, has 
good stability robustness and satisfies given p)erformancc 
requirements. By 'good stability robustness', we mean that 
the Nyquist plot of the observer based feedback sytem will 
be kept away from the point (-1,0), or it almost docs not go 
into the unit circle centered at the point (—1,0). By the 
‘performance requirements' we mean the target loop Bode 
plot is asymptotically recovered. By the two-step design 
procedure suggested in Section 4, the controller transfer 
function is found to be 

f'(i7-v4 4RE4Ln ‘L, 

where {A, B, C) are obtained as controller form realization 
of P(.y); F and L are the state feedback gain and filter gain, 
respectively with 

F = (6.2604, 54.6716, 87.0749], 

L-(150.3813, 1.1590, 0.1230]'. 

Figure 3 is the Bode plot which shows the target loop 
transfer function has been recovered. The solid line in Fig. 3 
represents the Bode plot of the target loop with state 
feedback and dashed line is the Bode plot of loop transfer 
function with output feedback. Figure 4 is the Nyquist plot 
of the system with state feedback and observer based output 
feedback. It shows that the Nyquist plot with observer based 
output feedback (dashed line) only just goes into the unit 
circle centered at (-1,0) thus the observer based control 
system has good stability robustnes.s. 

Example 2. We now consider the case in which the 
controlled system is of non-minimum phase type. Let the 
controlled system transfer function model be 

P(s)^ __ 

" ' .t'' + 7.2j=+13.4,1 +2.4' 

Following the same procedure as in Example 1, we again 
obtain the controller transfer function to be of the form 

FJsl-A„ + B„F„ + L„(„) 'L„, 
where the state feedback gain and filter gain are as follows 
F„ = [45.8(KK), Bl-STSO, 14.7394], 

/.„= 10^1-8.99.51, -0.0934, -0.0003]', 

and (y4„, C„) is from a controller form realization of 

Pnis). 

The Bode plot of loop transfer function with stale feedback 
and with output feedback is shown in Fig. 5, respectively. 
The solid line is the Bode plot of target loop transfer 
function with stale feedback. The Nyquist plot of the system 
with stale feedback and observer based output feedback, 
respectively is shown in Fig. 6. The solid line is the Nyquist 
plot in the stale feedback case and dashed line is the Nyquist 
plot when a suitable observer is used. It is shown that when 
the controlled process is of non-minimum phase, the 
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FiCi. 6. Nyquist plol of systems with state and output feedback.. 


robustness property that the state feedback system has 
cannot be recovered by observer based control strategy, 
which also implies that LTR can not be achieved as good as 
desired. However, one can minimize the recovery error by 
applying the method we suggested here. Since we have 
taken care of the sensitivity when we design the system, from 
the Nyquist plot in Fig. 6, one can .sec that the output 
feedback system still has reasonably good stability robustness 
even though the system under control is of non-minimal 
phase. 

5, Conclusion 

It has been shown that the infinity norm of the sensitivity 
function of a closed loop system indicates stability robustness 
of the system in the sense of gain margin and phase margin. 
This result applies to both SISO systems and MIMO systems 
and therefore bridges the gap between characterizing 
stability robustness of SISO systems and of MIMO systems. 
Based on this result, we have suggested a way of designing 
an observer based control system which has 'good' stability 
robustness properties. If the controlled system is of minimal 
phase, we conclude that an observer based controller can be 
found with our approach such that the Nyqui.st plot of the 
output feedback system will only just go into the unit circle 
centered at (-1,0) in SISO case. If the controlled system is 


of non-minimal phase then one can never design an observer 
based control system to achieve this. We also suggest another 
way of doing an LOO/LTR design. It should be pointed out 
that the recovery quality is closely related to the stability 
robustness. 
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Stability Robustness of the Continuous-time 
LQG System Under Plant Perturbation and 
Noise Uncertainty* 

J. S. LUOt and A. JOHNSONt 

Key Wonis Robustness; stability; linear optimal rcsulator; optimal control: state-space method. 


Abstract -- In this piipcr the robustness of the continuous* 
time problem is studied. The system to be controlled is 
described by the usual stale space (ormulation that includes 
the structure of plant parameter perturbations and noise 
uncertainty. A simple method is given to prove the saddle 
point inequality of the closed*loop system performance to 
cope with noise uncertainty One suflicienl condition, which 
IS applicable to general linear systems, for system stability 
robustness under plant perturbation, is presented 

1. Notation 

A\f^]B 'IS applied clement by element to two rnairiees 
A B. A ~ B IS a positive semidefinile matrix 
||u|| is the norm of a vector a (||fi|| "). 

||A|| is the spectral norm of matrix A (Mil ^ 

A* is a matrix obtained by replacing the entries o[ A with 
their absolute values. 

2. Introduction 

Standard LQCj SYSitM design is based on the assumption 
that the plant parameters and noise covariances arc exactly 
known But unfortunately both the plant parameters and the 
noise covariances arc often approximations and subject to 
changes or are slowly varying. Vhc existence ol plant 
perturbation and noise uncertainty may degrade the system 
performance or even dcslabilii^e the controlled system. 

Recently C'hen and Dong (19S9) proposed a state space 
approach to robust LQCj control design lor a continuous 
lime plant with norm bounded uncertainly in the parameiers. 
llieir first step, based on minimax theory developed m the 
works of Looze el ul. (19H3), Martin and Mint/ (1983) and 
Verdu and Poor (1984), employed the minimax control 
scheme which minimizes the worst-case performance to cope 
with the noise uncertainty. In their second step, based on the 
Bcllman-Oronwall inequality, they derived a sufficient 
condition for robust stability of the l.OCi continuous-time 
system under plant perturbations. In this paper we will 
follow the same procedure as adopted by Chen and Dong to 
solve the complete problem. After describing the conven 
lional continuous-time LQG control problem in Section 3, 
we characterize the plant perturbation and noise uncertainly 
description in Section 4. In Section 5 wc assume that the 
plant model is correct except that the noise covariances arc 
uncertain. After the problem of noise uncertainty has been 
tackled, in Section 6 we derive a suDicicnl condition for the 

* Received 4 June 1991; revised 29 January IW2; revised 
17 June 1992; received in final form 8 August 19<J2. The 
original version of this paper was not presented at any IFA(’ 
nieciing. litis paper was recommended for publication in 
revised form by Associate Editor E. Kreindicr under the 
direction of Editor H. Kwakernaak. 

t Kramers l.aboralory, Delft University of Technology, 
Prins Berhardlaan 6 , 2628 BW Delft, The Netherlands. 


stability robusincss ol the continuous-time LOG problem 
under plant perturbation. 

A.S opposed U) Chen and Dong (1989), in this papei the 
plant perturbation and noise uncertainty are given in a more 
convenient form. In Section ‘i, instead ol employing the 
minimax theorem wc give a simple proof ol the vaddle point 
inequality nt the closed-loop system performance, and in 
Section 6 we derive a sufficient condition lor system stability 
robustness which can be applied to a general continuous-time 
linear system and which is Ic.ss conservative than the 
conditions proposed by Chen and Dong (1989) and by Sobcl 
el al. (1989). In Section 7 we give an example to illustrate 
that It IS less conservative and this is followed with some 
conclusions in Section 8. 

3. Description of the conventional continuous-time l.QCl 
control problem 

Consider the continuous-time stochastic dynamic system 


described by: 

jL(f) ^ Ax(0 + Bu(/) + iv'(0. (3.1) 

y(/)-(x(0 + i'(0 (3.2) 

It IS assumed that the noise processes iv(0 and ri(0 are 
stationary white Gaussian with the lollowing properties: 

f:{»v(r)} . -- n. (3.3) 

Co\{w{i), M-(s))= Wb{t-s), W--0, (3.4) 

Cov (i>(/). i'(.s)) - Vri(t - .s), V -0, (3.5) 

and (w(t)} and (i'(f)} are independent of each other 
The performance index to be minimized is chosen as. 

J - hm E{x'{l)Qx(l) + u'(t)liu(l)}, (3.6) 

where 

« = /{'.-(). (3.7) 


Throughout this paper the following assumptions arc taken 
to hold (Johnson, 1985). 

(1) V >U. 

(2) {A, C) IS detectable. 

(3) (A, W^'^) IS stabilizable. 

(4) R -0. 

(5) (A, R) IS stabilizable. 

(6) (A, ") is detectable 

I’hen a unique steady-state LOO regulator exi.sts which 

produces a stable closed-lcmp and minimizes J in (3.6). The 
standard form of the steady slate LQG regulator is well 
known as follows 

u{t) - -Fi(/), (3.8) 

i^L) ^ Ax(r) H- Bu{t) + Hfy(/) - Cf (t)], (3.9) 

where 

F^n 'B^P, (3.10) 
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0 = PA+A^P + Q-PBR 'B^P. 

(3,11) 

H = 1C^V 

(-3.12) 

Q = A1 + + W - IC^V 'Cl, 

(3.13) 


4 . The plant perturbation and noise uncertainty description 
Assume that the actual continuous-time LOG system could 
be described by: 

x(t) = [A -H AA(01jr(0 + [fi + Afi(01w(/) + w(tl (4.1) 

y(/) = [C + Ar(0W0 + v(0. (4.2) 

where A, B and C are nominal parameters for LOG design. 
We suppose that bounds are available on the absolute values 
of the maximum variation in the elements of AA(0, Afl(r) 
and AC(t) and that AA'", A5"' and AC'" are defined as 
matrices with entries and (AC,;)„„. 

Then the plant perturbations are represented as: 

{AA(0: AA^(/)[=^] AA'" V/ a^O), (4.3) 

(Afl(/): Afi^(f)[--s) AB'" Vf ^ 0}. (4.4) 

(AC(0: AC^(t)[<] AC^ V/ ^ 0). (4.5) 

Meanwhile w(t) and v(t) in equations (4.1) and (4.2) are 
stationary and independent Gaussian white noise, defined as: 

£{ h .( 0 } = £{ v ( 0 }= 0 , ( 4 . 6 ) 

Cov(H-(/). w(i)) = (VV +AW')d(/-i), 

W' + AiV = (lV +AH')'aO, tV = H''20, ' ' 

Cov(i;(t), i.(j)) = (V + AV)d((-5), 

KH-Al/ = (K +AV)''aO. l/=V'''>0. ^ 

where W and V are nominal noise covariances for LOG 

design. Suppose that upper hounds are available for the 
actual covariances, i.e. 

(AW: W-HAWS W"'}. (4.9) 

(AK: (4.10) 

where W'" >0 and V"" >0 arc symmetric matrices. 

Figure 1 shows the actual continuous-time LOG system 
with the assumed plant perturbations and noise 
uncertaintie.s 

j x( t ) « ( A4-AA(1) Jx( t )+( B + AB( t ))u(l )+w(t) 

u(t) I y 11 ) = ( C-^ACU ) )x ( L ) ■» V ( i ) I y(t) 

r H I I 

J Cdv(w( t ) .w(s) )«(W-*.AW)S(t-5 ) [ 

i j Cov ( V ( t ) , V Is ) )» ( V ) fi ( f - 5 ) I I 

I PLANT ' 


L . !-Fx( ll 1 ^ _ I x(l)*Ax(t) + Bu(l)4'H(y( t)-Cx(t)) f- - ' 

I ^ J 1_ _ ^ . _ _ _ ^ J 

FLEDBACK KALMAN FILTER 

Fig. 1. The configuration of the closed-loop system. 

5. Robust continuous-tirne LQG design under noise 
uncertainty 

In this section we assume that the system parameter 
matrices A, B and C are correct (i.e. actual and design values 
are the same) and the noise uncertainty is defined as in 
(4.6)-(4.10). Under such assumptions, the optimal controller 
design assuming complete state information is still the 
standard optimal controller design described by (3.8), 
(3.10) and (3.11). However the Kalman Filter (KF) designed 
by using the nominal noise covariances may act very 
differently from what we would expicct as a result of the noise 
covariances mismatch. 

llie standard KF design with nominal noise covariances is 
accomplished using formulas (3.9), (3.12) and (3.13), while 


the actual plant is given by (3.1) and (3.2) with (w(r)) and 
{u(0} described by (4.6)-(4.10). 

We define: 

jcV) = x(r)-i(0. (5.1) 

and we subtract (3.9) from (3.1) and pul (5.1) into the result, 
so that we have 

i\l) = (A- HC)x'\l) + w(l) ~ Hv{t). (5.2) 

Consequently the actual differential equation for the error 
covariance is: 

fvo = (A- HC)Y"{i) + 2:"(/)(/4 - JHCy 

+ W + AW + H(V + AV)H’. (5..1) 

where H is calculated from (3.12) and (3 13) using nominal 
noise covariances W and V Sangsuk-lam and Bullock (1988) 
proved that under assumptions (1)“(3) in Section 3, as /—►□o 
equation (5.3) always converges to some constant which 
satisfies the following Algebraic Lyapunov Equation (ALE): 

0 = (A - HC)l!' -H 2:"(A - HCy 

+ W -H AW + H{V + AV)H^. (5.4) 

Moreover, from (5.3) wc notice that the estimation 
covariance is a monotonic increasing function of the actual 
noise covariance, i.e. if 

W-t-AW, > W-^ AW. and V-K AE,--E + AV,. 

and assuming , , 

!'(()) ^X'(O). 

then 

l\t) V/-() 

Now we consider the worst-case noise covariance 

2"'(/)-(A-//(’)r"(r) fr"(i)(A -//r’)' -H w'" + 

It converges to 1'" which is the solution ol the lollowing 
ALE 

0 = (A - //C’lr" -f r"(A -//C )' + W'" -h (5 5) 

and this is the upper bound of the estimation covariance of 
the KF under any determined H. Using the notation 
ZiH, W, F) to emphasize that I is a function of H. W and V’, 
we get 

Z(H, W 3- AW, V 3 AV)- Z{H, W"'. V"'). (5.6) 

It is a reasonable choice lor us to take the worst-case noise 
covariances W'" and V"" to design the KF which minimize 
the upper bound of the KF performance, i e 

I(//'"W'"V'")-.2:(//W'"V'"), (5 7) 

where //"* is the optimal gain matrix of the KF designed for 
the fixed pair of worM-ca.se noise covariance W"' and E"' 
while H could be any other gain matrix. Combining (5 6) and 
(5.7), we have the following saddle point inequality. 

2(7/'", W-HAW, E 3-AV')^ I(//'"W'"E'") 

<I(7/W'"E"'). (5 8) 

Now we investigate the LOG regulator, whose performance 
IS measured by (3.6). 

Theorem 5.1. The .selection of W"' and E"* to minimize the 
upper l>ound of the KF performance as a strategy for robust 
KF design is also a strategy for robust LOG regulator design 
which minimizes the upper bound of the LO(i regulator 
performance measured by (3.6). 

Proof, vSee Appendix 

Remark 5.1. The saddle point inequality for the maximal 
elements of the covariance uncertainties has been mentioned 
by Poor and Looze (1981) and Loozc et al. (1983) for both 
the KF and LOG systems, respectively, as special cases of 
minimax theorems. Our proof here, when W"* and E"' arc 
available, is simpler and clearer. Moreover, wc give the 
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proof of the following property. If 
then 



if c-^'n„c'^’'dr)air('.' 

i [ e'"'n,c^’'dr) 


J[) / \ 

J„ 1 


where Uq, and S arc all symmetric positive semidefinite 
matrices. This property was used by Poor and Lx)oze (1980 
and LiM^ze et al. (1983). In the authors’ opinion, the proof of 
this property is important in understanding Theorem 5,1. 

6. A condition for stability robustness 

From the above discussion wc may conclude that, undei 
the stated assumptions, noise uncertainty has some influence 
on system performance but does not affect the asymptotic 
closed-loop system stability. However, the plant perturba¬ 
tions described by (4.1)-(4.5) may well cause instability of 
the system. In the following discussion, we derive a condition 
for stability robustness. 

Putting (3.H) into (4.1) we obtain 


i(0 = M 



+ lAv4(/) 


-A/i(r)F| 


x{t) 

f(0 


+ w(i). 


( 6 . 1 ) 


where /’ is obtained from (3 10) and (3.11) Putting (3 K) and 
(4.2) into (3.9) produces; 


i(f)-[//r A-Bf-nC] 
l-^U) 


4 \HAC(l) 


x(0 


xit) 
XU) 


and combining (6.1) and (6,2) gives 


jf(0 


A BF |Lt(0 

lie A - BF - Hi f(0J 
^AU) -AB{t)F\\xU) 

HACU) 0 J[.x(/) 


/ 0 K'(0 

0 H ii(/) 

Next wc define the following 

m- 


jr(r) 

r; A 

A 

- Bl 

i(0 

, Kf — 

HC 

A - BF HC 


AG(r)- 


A>t(0 -AB{t)F 
//AC’(0 0 


(6 2 ) 


(6.3) 


implies 

P'0I|s||A#||||2(0II-0. (6.8) 

So instead of jr(/), we could consider the .stability of z(t). 

Applying the similarity transformation (6.6) to (6.5), wc 
get 

z(t) = Az{t)-h M ' AGU)MzU), (6.9) 

with the solution 

2(r)-e^'2(0)-i- f ^ AG(T)M2(T)dT. (6.10) 

Wc know 

VraO, 

where 

A'‘ - Re (A). 


Taking absolute values on both sides of (6.10), we obtain 
z*(r)- e'''.’(0)4-| c'"' ’‘M ' AG(T)A/z(r)dTj , 

|£l(e^')^z + (0)+ f (e''''-'V(A/-AO”( t)A/"z*(t)</t, 

■lo 

|-^'l.v,(0 = (c''')"^"(0)4 AG'(T) 

Al 

X M>,(T)dT, 


|£],V,(0 = e''"'z■"(()) 4 [ c''"" '\M ')" &G'"M*y,Jx)dT, 
A) 

Obviously, if 

MaxReA,[A"+(Af ')" AG^'M * ] < 0, (6.12) 

we have 

2 ^(f)—>0, V2(0)<'33 as 1—(6.13) 
i.e. the system (6.5) is asymptotically stable. 

Because 

max Re A,[A'‘+ (Af 

-niaxRc A,(A) + MaxA,[(Af ')'AG"’M^], (6.14) 


the sufficient stability robustness condition (6.12) which we 
have derived in this paper is less conservative than the 
condition of Sobel et al. (1986) which requires the right-hand 
side of inequality (6.14) to be negative. In the following 
Section 7 we give an example to illustrate that the condition 
(6.12) is less conservative than both the condition of Sobel et 
al (1989) and the condition of Chen and Dong (1989). 


AG^O- 


AA^U) 

H'A('U) 


AB ' U)F' 
0 


and 


Obviously 


ACr 


AA"’ 

AC 


AB"‘F * 
1) 


Ac;(r)[ -1 Acr (r)I 1 A, V/ - 0. 


(6.4) 


For stability it is necessary and sufficient to consider the 
following system 

i(/)-((;AG)i:(0- 


Now use the similarity transform M to obtain 


7. Example 

Wc employ the same example as that used by Chen and 
Dong (1989) with a slight modification of AA for the 
convenience of comparison. Tlie real continuous-lime 
stochastic dynamic system is de.scribed by equations (4.1) and 
(4 2) with plant perturbations and noise uncertainly defined 
111 Section 4. and 


A - 


-1 -3 

1.5 -2 


AA 


^1 '-■Cl' -K :]■ 

-»=[:]■ 

]■ 


AHO- 


■ cos(0 
0 


0 

0.1 sin (0 


no = Mid) 

Suppose M is ihe modiil matrix to translorm G to the Jordan 
matrix, i.e. 

M 'GAf-A, (6.7) 

where A is a Jordan matrix. Since ||A^||^o^, if ll-(0ll“*^^’ 


where f is an uncertain value varying in [-0.05 0.05] 


W = 

[' "1 

and V = [ ^ 

? 1 . 

Lo 2 ] 

Lo 

ij 

|- 0.1 

0 " 

SAW'S ' 

oM 

L 0 

- 0 . 7 j 

lo 

0 . 7 J 
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and 


The performance index is defined as (3.6), and the weighting 
matrices are given as 


-0.5 

0 ■ 


" 1 

0 

-0.9. 

Lo 

O.9J 


'-[o a 


and /? = 1. 


First we check the stability robust condition of Chen and 
Dong (198Q) 

fi = min ill,, H:.}>m(2a + 2fi ||f|| + y ||//||), (7.1) 


where 


-^, = Max ReA,(/t - fiF) - -1.940K. 

A , 


F = [0-3345 0.5471] and 




= Max Re - HC) = -3.4879, 
||A>t(f)||^l.l - a, ||AB(r)|| ^0.05- 
i|AC(0ll^n.2= V, 

1.7414 -0.0669] 
1.2191 0.2687 r 

and m = 1 is a certain appropriate positive constant satisfying 

Vt>0. 

The result is 

m(2cr + 2^ ||F|| f y ||//||) - 2.4215. 

Therefore the stability robust condition (7.1) of Chen and 
Dong (1989) is not satisfied. 

Next we check the stability robuslnes.s condition of Sobel 
et al. (1989) and the stability robustness condition (6.12) we 
have derived in this paper. The feedback gain matrix F and 
KF gain matrix H are calculated from (3.10)-(3.13) using 
nominal parameters A, B, C and worst case noise 
covariances 


W" 


\y M 

. ^ [2.5 0 

and V"" = 

Lo 2 . 7 J 

Lo 1.9 


We get 

F = [0.3345, 0.5471], 




7006 -0.0735] 

-0.2793 0.257hJ 


M 


The modal matrix defined in (6.7) is 

r 0.1518-0.11711 0.1518 + 0.11711 

0.0220 - 0 . 103 If 0.0220 + 0.10311 
0.6415 - 0.40151 0.6416-1' 0.40151 

-0.4271 -0.4437/ -0.4271 +0.4437/ 

0.6014 - 0.1209; 0.6014 + 0.1209i-| 
0.0367 - 0.3497; 0.0367 + 0.3497; 
0.6014 - 0.1209; 0.6014 + 0.12W; 
0.0367 - 0.3497; 0.0367 + 0.3497; J 

The definitions (4.3)-(4.5) lead to 
AyC 


1.1 O' 



0 

.0 0 .' 

L 0 J' 

1 0 

0.1 . 


To check the stability condition of Sobel et al. (1989) 


-a + jr[(M ')^ AG"'M^]<0, (7.2) 

where a =-Max Re A,(A) and 7r[(M ‘)^ AG'^AY^] is the 
Perron-Frobenius radius of non-negative matrix (Af 
We obtain 


-fl + jr[(M Ar;"’M^]:= 0.2180. 

The sufficient stability robustness condition (7.2) of Sobel et 
ai (1989) is not satisfied. But 

MaxRe A,[A" + (M ')* AC^'Af ^ ]--0.0140, (7.3) 


the sufficient stability robustness condition (6.12) we have 
derived in this paper is satisfied, so the system is stable. 

8. Conclusion 

In this paper we have considered the continuous-time 
LOG problem. First wc have given a direct and simple way 
to prove the saddle point inequality of the LOG system 
performance, to cope with the noise uncertainty, for robust 
LOG regulator design. Secondly a new sufficient condition of 
stability robustness for the continuous-time LOG regulator 
subject to plant perturbations has been presented in a 
convenient form of the uncertain plant parameters. The 
result is applicable to general linear .systems. An example has 
shown that it is less conservative than two previous 
conditions presented by C hen and Dong (1989) and Sobel et 
ai (1989). 
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Appendix proof of Theorem 4. 1 
From (3-6) 

lim E{x’ {t)Qx[t) ;4^(f)Fu(f)} 

t -' 

= lim/f{.r'(»)C>.r(/)+ (/•«(/) - - /■■i(t))) 


= lim £.{t'(f)(C + F'RF)x(i)- x'U)F'RFx{t) 

t » 


-x'(0F'RFx(/)-n'(i)F'RFx(/)j 


- lim 
Xf|[ 


(J f'rf 

-F'RF 

xU)x'(i) 

mxUi) 


-F'RF] 
f'rf \ 
jr(()i'(/)]l 
x{l)x'{l) Jj’ 


(A.l) 


I’hen defining 



xit) x{t) 




xU)x'(l) I 


From (3.1), (3.8), (3.9), (.‘).l) and (.‘>.2), we obtain 
BF IfjrlOl r ‘*'(0 

U(r)J [ 0 /I - WC 11 i(()J lw(()-//!-(/) 

Next wc define 


(A,2) 


BE 

L 0 A~HCy 


QF F'RF 
-F'RF 


-F'RF] 
F'RF y 



and 
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^^rw + AlV W + AW n 

Lm^ + AW W + AW + H(V' + AV)H’J' 

Under conditions (l)-(6) in Section 3, with F and // 
calculated from (3.10)-(3.13), all the eigenvalues of A lie 
strictly in the left half of complex plane, and therefore from 
(A.2) wc have 


S(^)= [ 


dr 


(A.3) 


If wc assume 


and 


then since 


AWr AW^- 


Al^,-AK= 


we obtain 
Ail 

and 


(r a- - 1 : 


TAW 

dw 

1. 


dW\ 

[dW 

dW ^ H dVH' ]' 

■“ 1 dW 

f^W] 


0 




no 

HdVH' 

-M). 


(A.4) 


Ail - AQ', 




i.c. A£2 is a positive semidchnitc symmetric matrix. Similarly 
S IS also a square symmetric positive semidelimic matrix So 
there exists a square root of .S’ satislying 

= .S''-,S'' - -.V, (A.5) 

and therefore 

f)J = - J, = tr ,S'E|('y) - tr 

- Ir.VfEif*-) - E,,(^)) 

^tr.S'f e-^'Aik'-^'Mt 


Sf‘^‘ACic'*’' dt 


Bexause 


we have 


...f 

= f trAV-^'Ane^^'di 

Jfl 

= [ trA‘'^e'’'Ane''''.S"'^df 

■'n 

= f tr(5''-c'"') A£2(3'V')^dr. 
Al 

(i”V') AQ(.V''’e-"')' ^ t». Vl =? 0, 


tr(i''-e^') A£J(5''’c'^')’^ ^0, Vtad, 
and therefore 


I tr (.9''^') A£2(5''‘'e'’')’^ dr -» 0, 


y, aA, (A,6) 

which means that for any F and //, we will have the 

hdlowing inequality; 

J(H, VV -f AVV. V + Ay)fZj(/l W'^\ V"'). (A.7) 

If we choose the worst-case noise covariances W'” and to 
design the KF, the upper bound of the LOG regulator 
performance is minimized, i.e. 

JUr\ W"’, F"')-:7(//, IV'”. (A.H) 

where H'” is computed from (3.l2)-(3.13) using W"' and V"‘ 
and H could be any other matrix. Combining (A.7) and 
(A.8), we obtain the following saddle point inequality: 

7(7/'”, W + AW. V -f AV') -^7(/7'”, W”'. I/"') 

^.7(//, W'”. V"”). (A.9) 
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A.b 9 tract—In this paper, a relationship is derived between 
quadratic stabilizability of linear systems with convex 
bounded uncertainty domains and the existence of a po.sitivc 
definite solution to a well defined set of Riccati equations. 
Both continuous and discrete-time models are investigated. 
For continuous-lime systems, the re.sults rcptirled here arc 
compared with the ones provided in the literature, where 
norm-bounded uncertainty is considered A numerical 
example is included. 

1. Introduction 

It is wbll known that the Linear-Quadratic Problem plays a 
central role in linear control .system de.sign. In fact, among 
others, the controlled system exhibit.s several important 
properties as, for instance, the ones staled in Gcromel and 
Cruz (1987) and Safonov and Athans (1987). 

In thii^ connection, the results provided in Petersen (1988) 
and Zhou and Khargonekar (1988) arc of great importance. 
The authors showed that the existence of a positive definite 
solution to a modified Riccati equation is a necessary and 
sufficient condition for quadratic stabilization of uncertain 
syslems by means of linear state feedback control. As a basic 
assumption, both papers deal with uncertain, continuous- 
time linear systems in norm-bounded uncertainty domains. 

Iliis paper provides similar re.sults for linear uncertain 
systems with convex-bounded unccrtairilies Boih continuous 
and discrcle-Umc syslems are analysed and no additional 
structural assumptions are taken into account (e.g. matching 

conditions). , 

We claim (and we discuss this point by means ol a simple 
example) that the convex-bounded uncertainty assumption 
can tackle most practical problems at least as well as the 
norm-bounded one. In fact, an important special case o 
convex-bounded uncertainty is the well-known interval 
matrices case. 


discrete-time ones. Associated with (1), we define the 

extended matrices (Barmish, 1983), F c and G e 
A 

p = n + m: 


A 



.0 

oj’ 

L/J 


Matrices A and which define the model under 

consideration are not precisely known. In fact we assume 
that matrices A and B are uncertain and only their 
dimensions (n. m) are exactly known. The uncertainties arc 
modeled in two different ways. 

Norm-hounded uncertainty. For matrices D, E and F with 
appropriate dimensions, we define the uncertain domain 
as being (Zhou and Khargonekar, 1988): 

f = fj, + OVE' liril 1). (3) 

Convex-hounded uncertainty. Given a set {F,, t - \ ■ M) 

of “extreme” matrices we define the uncertain domain as 
being (Geromel et al., 1991): 

6 sr"" -C == X ^ 0; 2 A, = 1 j. (4) 

Relation (3) Jefines an ellipsoidal type of uncertainty 
domain and (4) a polyhedral one. The important case of 
interval matrices can be exactly modeled by He with 
anpropriale choice of the extreme matrices i = 1 • ■ • M). 
The same no longer holds for Indeed, the matrix 

con.straini ||ri|i 1 introduces some dependencies among the 
parameters, leading in this case to an approximate 
representation of the uncertain domain. 

In Zhou and Khargonekar (1988), il is proved that the 
continuous-time system (1) is quadratically slabilizable by 
linear control for all F e if and only if there exists OO 
such that the modified Riccati equation 


2. Preliminary considerations 

Let the uncertain linear system be defined a.s 

/)ljt(()] = .4jr(l)+ «»(/). 

where x( ) e :K“ is the state variable, u( ) 6 ;ir ^ 
vanable and d[ l is a linear operator hy^-^UfOl “ 

for continuous-time .systems and <^1-^(01 'v 
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F;^PPE,-P(GR ^G' - cOD')P ~^-E'E^ Q - (5) 

has a positive definite solution, where Q and R arc arbitrary 
positive definite matrices. Assuming that a P e uT‘**l'* 
satisfying (5) exists and partitioning its inverse in four blocks. 

( 6 ) 


[wi IV, J 


where W, e in'”*". W, 6 and IV, e a linear state 

Teedback stabilizing control as «'(')= Kx(0 « readily 
determined from (Zhou and Khargonekai. 1988): 

K = IV^JlVr <2) 

uilrlition with the Lyapunov function i;(*) = *'W', 'x, the 
sUbSSv of’the closed'loop system can be verified for aU 

Fe%- 

t Riccati eauatim for aic uncertain domain 

In this section we present the mam results of this paper. It 
is prov^d'that. for continuous and discrete-time systems with 
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uncertainties belonging to ^ the concept of quadratic 
stabilizability via linear state feedback is equivalent to the 
existence of a positive definite solution to a set of algebraic 
Riccati equations. To this end, we first recall a result of 
Geromcl et al, (1991), which states that the uncertain system 
(1) is quadratically stabilizable by linear control for all 
F € if and only if there exists iv = W' e such that 
(see also Meilakhs (1978) for systems where only A is 
uncertain) 

r^WT>[}, 

T’BXW)T<.0, / = 1 M, (8) 

W^i), 

where 7 e is an orthonormal matrix spanning the null 
space of G' and B,^ ), / - 1 ■ ■ • Af are linear matrix functions 
defined by 0,(1V) - F,W WFI in the continuous-time case 
and 0^1^) = F,WF,'- W in the discrete-time one. 

For W E partitioned as in (6), it is easy to verify that 
in the continuous-time case, the first M 1 constraints in (8) 
do not depend explicitly on Wv Consequently, without loss 
of generality, can be calculated in such way that W '>0. 
However, the same is not generally true for discrete-time 
systems. 

Theorem 1. The continuous-time system (1) is quadratically 
stabilizable by linear control for all F s if and only if 
there exist positive definite matrices Q, e / = 1 - Af 

and R e such that the set of Riccati equations 

F;P^ PF,- PGR Y;T-h0, = (). z -l - Af. (9) 
have the same pt)sitive definite solution P -- P‘ >0. 

Proof. Assuming that 7*-7*'>0 verifies (9) and defining 
W-^P '.we have 

7’'0,(VT)7 = T'{GR 'C' - WQ,y/)T, 

<T GR 'G'r = (). (10) 

The last equality follows immediately from the definition of 
T Since W = P ' >0. all conditions in (8) are fulfilled and 
the system is quadratically .stabilizable VF e . 

Conversely, suppose that for some W, conditions (8) hold 
From Finsler’s Lemma (Geromel and Cruz, 1987) there 
exists F ~ F’ such that 

0,(W)<. - GF 'G'. M (11) 

Defining positive definite matrices (7,, / - I M as 

Q, = W ^G'-S,(W)]W 1 = 1 W, (12) 

after simple algebraic calculations we can verify that 
P=W ' solves (9). This concludes the proof of the theorem 
proposed. 

Some remarks are now in order. First, if P satisfies the 
previous theorem, then (7) provides a stabilizing gain for all 
F 6 i/'f Furthermore, to each model F g we can associate 
a Riccati equation, with the same solution W - P ' To see 
this, define 

M 

( 13 ) 

I -1 

and multiply (9) by A, e J)f , and add terms to get 

F'P^PF- PGR 'G'F + 0 = U. (14) 

Note that this is possible since ElV i A, ^ 1. Unfortunately, 
the result stated in Theorem 1 does not allow by itself the 
numerical determination of the P matrix. Indeed, it just 
brings to uncertain systems control design the properties of 
the Linear-Quadratic Problem (Safonov and Athans, 1987), 
For numerical purposes concerning the determination of 
W = P it appears to be better to .solve directly (8) by 
means of methods described, for instance, in Boyd and 
Yang (1989) and Geromel c/ a/. (1991). 

We turn now our attention to the discrete-time case. In 
this sense, we need some results which can be easily proved. 


The first one is based on the convexity of the matrix function 
FWF' with respect to F for W s-O. Consequently (see 
Geromel et al (1991)) 

FWF' ^ 2) KF,WF:, \/F e (15) 

The second one refers to the following equivalence for 
F, K = V' > 0 and S = S' > 0 being matrices with appropriate 
dimensions: 

FVF’cS^F'S ‘F<F ( 16 ) 

Theorem 2. The discrete-time system (1) is quadratically 
stabilizable by linear control with W * 0, for all F e if and 
only if there exist positive definite matrices Q, e 
i={ ■ M and such that the sel of Riccati 

equations 

f;pf, -r- f;pg(r + o pc) 'cpf, + q, = o, 

i = l (17) 

have the same positive definite solution P - P' ^0. 

Proof. Assuming that F = /’' "0 satisfies (17), using the 
matrix inversion lemma we gel 

P > F;(P - PG{R + G'PG) 'G'P)F, 

= F;(P ' + GR 'G') 'F, I = 1 ■ ■ M (IK) 

On the other hand, defining W = P ' and using (Ih) in the 
above inequality, it follows that 

T'BX^)T <T'GR ‘G'F- (). (19) 

Since W = F ' is positive definite, (8) holds and the 
di.scrcle-time system (1) is quadratically stabilizable by the 
linear control law (7). Conversely, assume that (8) holds (or 
some W'^O. From Fmslcr’s Lemma, there exists a positive 
definite matrix R e such that 

0,(VV^)- GR, ‘G' GR 'G', i - 1 M. (20) 

So, F,WF;< (W + GR 'G'). f - 1 ■ ■ Af Using again (16) it 
follows immediately that 

W '^-F,\W^GR '(;■) 'F,. i-l - Af (21) 

Finally, defining the positive definite matrices (7, as 

(^^5 IV ' - F;(W f GR 'G') I - l ■ • W. (22) 

and applying the matrix inversion lemma to (22) we verify 
that actually P-W ' solves the set of Riccati equations 
(17). lliis concludes the proof of the theorem proposed 
Once again, in the diserele-lime case, a stabilizing state 
feedback gam is provided by (7) Furthermore, for any 
feasible model F e 1/f we have a Riccati equation associated 
to each of them 7o sec this, let us make use ol equation 
(15) in (17) to get 

rn Af 

XS A,/■;(/' ' + GR 'G') '/■; 

1-1 I I 

r P-F'(P ' 4 GR 'G') 'F 
-Q (23) 

This means, that it is always p(jssible to define for each 
Fe a positive definite matrix (7 satisfying (23) such that 
P - P' solves the Riccati equation: 

Q^P '^ F'iP ' 4 GR VF) 'F 
- F - F'FF 4 F7T;(R 4 G'PG) 'G'FF, VF c 7, 

(24) 

For numerical purposes, the algorithm proposed m 
Geromel et at. (1991) can also handle the diserctc-lime case. 
If one exists, a W g satisfying (8) and solving also (24) 
can be readily determined. 

For the sake of completeness, let us introduce a simple 
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example, consisting on the continuous-time model (1) where 

“o’]’ ® = (25) 

with or and fi being the uncertain parameters. The goal is to 
determine the largest bound y in such way Ver - 
(|a'“0.5|^y} the system remains quadratically stabi- 
lizable. The set is defined by the matrices: 



"o 

1 

p 

C/i 

0.5 


'l 

o' 


0.5 

0 

0.5 

, n = Y 

0 

1 


_0 

0 

0 


_0 

0_ 


; -:]■ t;j|. 


(26) 


Using the numerical method proposed in Zhou and 
Khargonekar (1988) which iteratively tests the existence of a 
positive definite solution of (5), the maximum value for y 
has been found: y;v = 0.271. Associated to this value, (7) 
provides the stabilizing gain 

= [9.8683 -30.0783]. 

At this point, it is important to add that for the given 
example the results of Zhou and Khargonekar (1988) arc 
only sulficicnt for quadratic stabilizability Indeed, the 
constraint ||r|| ^ 1 does not take into account the particular 
structure of matrix V which is in fact diagonal. 

Now, using the convex programming method propo.scd in 
Geromcl et al. (1991) wc have solved (8) for two different 
.situations: 

y = 0.271, a:-[-0.0233 - 1.1010|. 

y^y,^^ 0.363. K - [-0.28H4 -”5 9.34K] 

The value of y^ is the one which maximizes y. As pointed 
out before, y<^ > y/v Jind from Theorem 1 we conclude that 
the Riccati equation (14) admits a positive definite solution 
for all 0<ry*-y(. Clearly, this is an improvement when 
compared with the results of Zhou and Khargonekar (1988). 


4. Conclusions 

This paper establishes a relationship between the existence 
of a positive definite solution of a Rjccati equation and the 
quadratic stabilizability of uncertain linear systems. It has 
been proved that, for boih continuous and discrete-lime 
uncertain models in convex-bounded domains, a Riccati 
equation exists whose solution is invariant over the space 
defined by the uncertain parameters. 

Thi.s solution enables us to determine a stabilizing linear 
control law and brings to the uncertain system control design 
the well-known properties (c.g. robustness) of the Linear- 
Quadratic Problem .solution. 
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Transformation of Nonlinear Systems in 
Observer Canonical Form With Reduced 
Dependency on Derivatives of the Input* 
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Abstract —The transformation of nonlinear multi-input- 
mulli-outpul systems x = r(x, u), y = li(x, u) into an observer 
canonical form with reduced dependency on derivatives of 
the input is studied. Necessary and sufficient conditions for 
its existence and a straightforward algorithm for obtaining 
the canonical model are derived. The proposed method 
involves the solution of a nonlinear algebraic equation 
system and systems of first order linear partial differential 
equations. The nonlinear canonical form obtained permits 
global observer error linearization and it is a stage in the 
design of nonlinear observers The method is illustrated by 
an example. 


such that n, + + • -h = n. The system 

y, ^fi,(x,u), 
y, =DA,(x,u), 
y, =Z)"/i,(x, u), 

yl"' = ’/i,(x,u), (2-2) 

y2=- ^2(x,u), 
y2 = Dh.l%,u), 


J Introduction 

Thesymulsis Oh nonlinear observers has received .significant 
attention during the past few years. Tlie basic results lie m 
the transformation of nonlinear systems into observer 
canonical forms admitting observer error linearization 
(Krener and Isidori, 1983; Beslle and Zeitz, 1983; Krencr 
and RespKindek, 1985; Tsidori, 1989; Nijmcijcr and van der 
Schaft, 199(1). T ransformation procedures for some cla-sscs of 
systems arc developed (Zeitz, 1984, Keller, 1986a,b, 1987, 
Birk and Zeitz, 1988, Xia and Gao, 1989) Vhe two-slep 
transformation method proposed by Keller (1987) uses the 
generalized observer canonical form (GOCT) but it is not 
developed in the general case of nth order systems. 

A nonlinear observer canonical form with reduced 
dependency on derivatives of the inpui and an approach for 
the transformation of nonlinear mulli-input-mulii-outpui 
(MIMO) systems arc proposed here. 

2. Problem statement 

Con.sidcr the nonlinear MIMO system 

x = r(x,u), x(())^Xo- (2.1a) 

y = h(x,u), (2.1b) 

where x is a slate n-vecior, the input u is an r-vector, and 
the output y is an m -vector. The nonlinear vector functions 
f(x, ii) and l«(x. ■) are assumed to be real and sufficiently 
differentiable in the considered domains of x and u. The 
system can be time-variable if for example u, - f. It is 
assumed to be locally observable with observability indices 
^>0, I = 1, 2, . . , m, (Nijmcijcr and van der Schaft, 1990) 


(the observability map (Zeitz, 1984)) admits a solution of the 
type 


u = |u^u^, u^,. , (u^"^)^]^, i;^max(n,,n2 


X ~ x(y, u), 

where D = d/d/ and 

y = |y^y^y^ • ,(y''’ 

The observer canonical torm is 

z - Az-I- ■(y, u, u), z(0) = z,,, 

y = ci, 

where 

* ~ [-^l I I • ' t ^n\' ^ni i ] \ 2> > I 4 n 2 ' ' ' ' ’ 

a(y, u, u) = lfl,(y, u), fl,(y, u)-, 

.a™,+ ■ ■.‘«n(y. 

A and C arc block-diagonal n x n and m x n matrices: 


(2.3) 


(2.4a) 

(2.4b) 


rA. 

0 

0 - 

fc. 

0 

0 n 

0 

A2 

0 

0 

c. 

0 

. 0 

0 

A„,. 

0 

0 

■■ C,J 

ro 

0 ■ 

0 0-1 





1 0 
0 0 


0 0 
1 0 . 


C, [0 


0 1 ],. 


* Received 3 May 1991; revised 3 January 1992, revised 11 
1992; received in final form July 30 1992. The original 
'version of this paper was not presented at any IFAC 
ti^ecting. This paper was recommended for publication in 
revised form by Associate Editor A. Isidori under the 
direction of Editor H. Kwakemaak. 

t Control Systems Department, Technical University, 
Plovdiv, 18 Vasil Markov Sir., 4(XK) Plovdiv, Bulgaria. 


The proposed canonical form (2.4) differ from the GOCF 
for MIMO systems (Birk and Zeitz, 1988) in two ways—the 
output y depends linearly on the state z and the vector 
function ■ has a reduced dependency on the time derivatives 
of the input u. This form permits an observer of the kmd 

i = Az + ■(y, u, n) -I- K(y - J), 
j - CL 
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The observer error e = i “ i obeys the linear homogeneous 
equation e = (A ~ KC)e and the observer design can be 
achieved by pole assignment. 

If the system (2.1) is transformable into the form (2.4), 
then the transformations 

x=^T( 2 , u) with det ((Jr/i3i) ^ 0, (2.5a) 

i = T '(x,u), (2.5b) 

exist and the equivalence condition 

h(x, u) ~ Cz, (2.6) 

is satished for any u, Z(, = T '(X(,,u) or any z,,. u, and 
*<, = T(2,„ u). 

The transformation problem con.sists in finding the 
transformations (2.5) or the vector a(y, u, ii) directly. 

3. Necessary and sufficient rondtiions for transformahility 
Each of the m subsystems in the form (2.4) has a single 


ouput v,, state vector z, = ^. ^,1^ and a 

nonlinear vector function a, = [a^ f,,^^ ct,]’' 

its model: 







1 + 1 

S ' 


n, > 2. 

5-2 

J 



1, 2, . 

. . , m. 


where y'*' = /)*y. and = D^ h,. 

If the system (2.1) is transformable into the special form 
(2.4) then the systems (3.4) arc solvable and permit the 
corresponding unknown components of the vector function a 
to be computed. On composing the system (3.4a) the term 
Sajdu in the third equation has to be substituted by the 
right hand side of the second equation. 

It is well known that the systems of first order linear partial 
diflerential equations like (3.4) arc integrable if the mixed 
partials commute. The system (3,4a) is integrable if and only 
if 






s, k - \ ,1, , . . , m 2r, 


= + «,(y,ii.u), (3.1a) 

y, = 2 j, y = n,-f n, + ' ■ + fi,. i=L2, . ...m (31b) 

Let the system (2.1) be transformable into ihc form (2.4). 
The equivalence condition (2.6) holds and can be written 
componcniwi.se: 

I 

/!,(*, B) = z,. /-I. 2, ...m. (3.2) 

k I 

The sequential differentiation of (3.2) with respect to time 
and substitution in accordance with (3.1) reads 


Z)"-/i<(x, u) = O”' 'fl/y, u, u)-I-/2”' ‘a^ i(y.u) 

(3.3) 

The necessary conditions obtained are .similar to the 
generalized characteristic equation used by Keller (1987), but 
the left hand sides arc known through the original system 
(2.1) and the number of the unknown functions is one less. 

The vector x can be substituted by the solution (2 3) and 
so the left hand side of (3.3) bccumc.s a function of the 
vectors y and y. 'Fhcir components y,y,y. , y’”' and 
u, u, ii,. . . , can be accepted as generalized arguments, 
llic derivative /3"' 'fl^(y,o,u) depends on all of them. It 
has such a structure that the arguments y''’' “ and arc 
included linearly with coefficients 3ajdy and 3uJ3u, 
respectively. The next derivative O'"' i(y,u) docs not 
depend on y^'*' and Thus the terms 

fr‘ j^.^,(y,u). A:-2, 3, . , n, depend only on y,y,y, 
yin, A) ^ y, u\ . . . , u^"' and also linearly on y'"*' 
and ** Due to the structure of these derivatives and 
their specific dependence on the generalized arguments, the 
differentiating of (3.3) with respect to them yields the 
following equations a.s other necessary conditions. 

ao,(y,o, u) a/il"''(y,u) 

5y ■ 

3a,(y,a, i) a/i,'"''(y, u) 

du 


aa,(y,ii,u) 5/i!''''(y,u) 



where w -- |y^ u^ u^|^. /).*(w) = Ju,/9h'j or 


ilw) 


ah{"''(,) 
■" ■ 




ah;'''’(.) 


' f m \ I 


(w) ^ 




rhe intcgrability conditions for the .systems (3 4b) are 
analogous. 

Using the observability map (2 2), (2 3) and the 

necessary conditions (3.3) and (3.4), the lollowing theorem 
can be formulated 


Theorem 3 I The system (2.1) is translormable in the 
canonical form (2.4) if and only if. 

(i) there exist observability indices n, -0, t - 1,2, , m, 

/I, + ^2 3- ■ + ■“ n for which the observability map (2 2) 

has the solution (2.3); 

(ii) the systems of first order linear partial differential 

equations (3 4) arc solvable with solutions ol the kind 
fl,(y.u,u), „,,<(y,u); 

(ili) the functions 


0, = S/>'" 

; ^ n, + n, 3- • 'n,, i“l,2, 


(3 S) 


arc nonmull functions of the urgunicnts y and u. 

Proof. 

Necessity. The necessity of conditions (i) and (ii) follows 
immediately from the validily of the necessary conditions 
(3.4). The expressions (3.5) arc a diflerenf record of the 
neces.sary conditions (3.3). 


Sufficiency. I.el conditions (i)-(iii) hold From the condi¬ 
tions (li) and (iii) it follows that the canonical model is 
completely defined. The sequential differentiating of (3.1b) 
and substituting m accordance with (3.1a) and (2 4b) yields 
the following expressions for the generalized variables 
.> 1 ”' >' 2 . • • ■ ..V? ■ ’’ 

y. = ^r t + X '"..I 

t I 

x(Cz,u)3-D* 'fl,(Cz, u,|j), (3 6) 

/ - 1, 2, . . . , m, y n I + ^2 ■ 'f '‘m ^ ~ I - 2, . - , , n, “ 1 


Obviously, the generalized variables arc expressed as 
functions only of z and u. I'hc substitution of expressions 
(3.6) m (2.3) immediately gives the straight transformation 
(2.5a). Therefore the system (2.1) is transformable in the 
form (2.4) and the equivalence condition (2.6) holds. If Cz is 
replaced by h(x,a) in expressions (3.6) and the latter arc 




Brief Paper 


497 


substituted in the left hand side of system (2 2) then the 
inverse transformation (2,5b) is directly determined in the 
form: 

k I 

2 / * = 0 *h,(ii,u)- 2 D' \,,(h(»,u).u) 

I 

- [>* 'a/h(x. u). u, u) =- r, ^(X, u), 
z, = /!,(», u) = r, ‘(x.u), 

( = 1,. . , m, ; = n,+n,+■ • • + A = 1,2 .- 1 , 

The same form was derived by Keller (1<)K7) for single 
output systems. If in (3.7) the vector x is represented by 
(2.3), then the vector z is expressed as a function of the 
generalized variables, too. So the transformations (2.5) exist 
and are invertible 


4. Transformation algorithm 

The necessary and sulhcicnt conditions of Iheorem 3 1 
determine the following algorithm for transformation of 
system (2.1) into the form (2.4) 

(i) Find an m-tupic of integers n, ^ F 2, . , m. with 

n\ 02 + ■ ■ ^ n, for which the .lacohian matrix ol the 

observability map (2.2) with respect to the x vector (the 
observability matrix) has rank ri. If such indices do not exist 
the .system (2.1) is not ob.scrvablc and not transformable. 

(ii) Find the solution (2 3) of the system (2.2) and replace it 

in the derivatives 2)'’'/i,(x, u), / - 1. 2. . , m, to turn them 

into function.^ of the generali7ed variables y, u. 

(iii) For i ^ 1, 2, . . . . m, if n, -- 1 then go to (iiih), else go to 

(ilia) 

(ilia) Form the system (3.4a) and if il is solvable find the 

I 

solution a^,j ^ n. If n, *2 then lorni the systems (3.4b) 

and il they are solvable (ind the solutions 

n,- I, n, - 2, .... 2. System (21) i.s nontriinslormalilc it 

any ol the systems (3.4) is not solvable 

(mb) Determine the components a^ „ , , in accordance with 

(3.5) If they are non-null functions solely of the arguments y 

and u then the canonical model is determined, else system 

(2 1) IS nontransformable into (2 4) 

(iv) Form the expressions (3 h) and replace them in (2.3) to 
find the transformation x - T(i, u). 

(v) compute the inverse transformation T '(x, u) by the 
recurrent formulae following from (3.1): 

'(x,u)-/i,(x,u), 

-7 n, ^ T, iA(>l. U) - DT, |(X, U) 

- a, n,iki !(•>(*’ 

y - rii 3- ri;, "f • ■ ■ -t- n,, k — n, - I, n, - 2, . , . . 1. 

/--I. 2. . ,m 

The integration of the systems (3.4) in step (iii) ean be 
performed conveniently with zero initial conditions. Nonzero 
initial conditions can be used to scale the z vector The 
algorithm requires the conseculive comp<^sition and solving 
ol the lirsl order systems of linear partial differential 
equations (3.4). As far as ihe solution found for the 
component is used into the system of equations for 
the algorithm is recurrent. 

5 Example 
Consider the .system 

il ~2Vx,(x, u,)-4r,A4, 

^ {x^ -h 1)(X4 - Vx,) 3- (VT, ■ X4)U|, 

i, - 3" - M| , 

A4*'"(v(r, -A4)‘ 3 (A^-uJ'^'f VA|, 

y, A4 “ Vjt,. 


On «, = ^2 = 2 the observability map (2.2) takes the form 

V, =X4-Vt,, 

>1 = (V^I - + (-fi - “i)^ + Vxl 

-X, - xj - Uj + 2 Vx^X4, 
y2 = x,-U|, 
y3 = X 2 + x,-U|-ii,. 

Its Jacobian matrix has rank 4 and the solution (2.3) being: 

■*^ 2 = v? -yz + lii. 

A^-y. + Ui. 

■^4 = yi -"y 2 +Wp-^-yi- 
The derivatives /il"'^(x, u) are 

^ A, 3- X 4 3- U 2 -- ly/x^ A 4 3- 2(a^ - U|) 

X (Az 3-A, ’ u,) - 2 (a^- ujrii - li., 

~ (a, + 1 )(a 4 - Va^) 3- (Va, - A 4 ) 

X u , 3- A^ 3- A , - w 1 — li I w I , 
and after substitution of the solution (2.3) take the form 

Tile .systems (3.4a) read 


9a 2 


3a, ro- 


dy 

2 vJ 

dii 10 - 

i9u 

9a^ 

f’l. 

r 

da, \ 


LiJ' 

du L 0 

c?U “ 1 


These systems are solvable and have solutions fl> = y}-Wz. 
fl 4 = y 2 - ii|. FTom (3.5) there lollows 

“1 ■- fi) “ w)" vf 4- Wj, 

a , = h\,^'^{y, u) “ Da^iy, U, ii) - v, 3- y, v, 

rhese functions depend solely on y and u. Then the 
canonical model is 

i, = Z; 3- li^. 

22 = 2 , ^ U 2 , 

i,-\ ' i .2 4 ^ 7 '<- 4 > 

i4-~-^z,y z4~m,. 
yi = 
y2 = ^ 4 - 

The straight and inverse transformations are computed m 
accordance with steps (iv) and (v) of the algorithm: 

X ^T(z, u) Iz), z„ Z4 ■+ U|, z, + z^]^ 
i = T '(x. u) = |Vx,, X., - Vxi, X,, X,- u,l'. 

b. Conr/u.vion<v 

The method proposed and the two-siep transformation 
method of Keller (1987) are essentially similar. Due to the 
special kind of the introduced canonical form, the 
transformation method considered has been developed as a 
straightforward algorithm for nth order nonlinear MIMO 
systems The class of the transformable systems is restricted 
by the reduced dependency on time derivatives of the input 
and the linear output equation of the observer canonical 
model. 
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A Nonlinear Fuzzy Controller with Linear 
Control Rules is the Sum of a Global 
Two-dimensional Multilevel Relay and a Local 
Nonlinear Proportional-integral Controller* 
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control. 


Abstract—The author analytically proves that a nonlinear 
fuzzy controller with linear control rules and \ members for 
input fuzzy sets is the sum of a global iwo-dimcnsional 
multilevel relay and a local nnnhneai proportional-intcgral 
(PI) controller which adjusts the control action generated by 
the global multilevel relay. As N increases, the resolution of 
the global multilevel relay is enhanced but the n)lc of the 
local nonlinear PI controller in total control action is 
decreased. As N approaches the global multilevel relay 
approaches a regular linear PI controller while the control 
action from the local nonlinear PI controller approaches 
zero. T he role of the global multilevel relay and the local 
nonlinear PI controller in total control action is quantita¬ 
tively described, as is the degree ol nonlinearity of the tuzzy 
controllers with different /V. 

1. Introduction 

To ADVANCt FUZZY control technique, sound theory needs to 
be developed. The author believes that one way to develop 
such theory is to analytically investigate structures of luzzy 
controllers and relate the structures to nonfuzzy control 
theory. Such relations will provide solid frameworks for 
analytically solving many important but previously difficult 
problems in fuzzy control technique, such as stability, by 
utilizing abundant well-developed and powerful nonfuzzy 
control techniques. 

To reveal structures of fuzzy controllers and link the 
structures with nonfuzzy control theory, a novel method was 
initially developed (Ying, IM87), presented (Ying et ai, 
1M8R) and published (Ying et ai, I'he work showed 

that the simplest possible nonlinear luzzy controller with two 
members for the input fuzzy sets, “error” and “rate change 
o( error” (“rate” for short) was equivalent to a regular linear 
PI controller when a linear defuzzification algorithm was 
used or to a nonlinear PI controller when a nonlinear 
defuzzification algorithm was used. Using this method, the 
results on the linear propierties of the fuzzy controller were 
generalized to fuzzy controllers with more members for the 
input fuzzy sets and different fuzzy logic, first by Siler and 
Ying (1989) and then by Buckley and Ying (1990) and 
Buckley (1989a). Moreover, the Limit Theorems for linear 
fuzzy control rules were developed (Buckley and Ying, 1989) 

‘Received 11 July 1991; revised 10 March 1992; received 
in final form 8 April 1992. The original version of this paper 
was not presented at any IFAC meeting. 1 his paper was 
recommended for publication in revised form by Associate 
Editor M. Tkeda under the direction of Editor A. P. Sage. 

t Department of Physiology and Biophysics, Biomedical 
Engineering Center and Office of Academic Computing, 
University of Texas Medical Branch, Galveston, TX 77550, 
U.S.A. 
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and extended to multiple-input-multiple-output fuzzy con¬ 
trollers (Buckley, 1990). Following the generalization of the 
results on the linear properties, the results on the nonlinear 
aspects of the fuzzy controller were also mathematically 
generalized to the fuzzy controller with more members for 
the input fuzzy sets, first by Buckley (1989b) and then by 
Wang pffl/. (1990). 

In this paper, a nonlinear fuzzy controller with linear 
control rules is first defined. The author then analytically 
derives the explicit structure of the fuzzy controller and 
relates the resullani structure to the multilevel relay and PI 
controller of nonfuzzy control theory. 

2. Theoretical analysui of structure of the nonlinear fuzzy 
controller 

2.1. Components of the nonlinear fuzzy controller. If T 
denotes sampling period and r\T (n is a positive integer) 
denotes sampling time, then the scaled inputs at sampling 
time nT arc 

p* - GE • e{nT) = OE[y(nT) - setpoint], (2.1) 

r* - GR • r{nT) -- GR\e(nT) - e{nT ~ T)], (2.2) 

where e{nT), r{nT) and y{nT) designate crisp unsealed 
error, rate, and process output at sampling time nT, 
respectively, and e{nT - T) specifics crisp un.scaled error at 
sampling time {n — 1)7', GE and GR are the scalars for the 
crisp error and rate. 

Let the number of members of the luzzy sets “error” and 
“rate” be the same and the membership functions be 
identical. This condition can easily be met, since if the 
number of members differs, some members can be added to 
the smaller set to attain equality. Assume there are 7(7 1) 

members for positive “error” (“rale”), 7 members for 
negative “error” (“rate”) and one member for zero “error” 
(“rale”). Therefore, there are a total of 

N = 27-tl-3, (2.3) 


members for the fuzzy set “error” (“rate”). Index systems 


{E 

and 

...,E 

E„,E,.. 

..,Ej 

„Ej}. 

{E 

.R , 

, R[), R\ 

Rj , 

. Rj). (2-4) 


arc adopted to establish relationships between the indexes 
and the names of the members of the fuzzy .sets “error” and 
“rate”. E, represents a member of the fuzzy set “error” and 
/?, represents a member of the fuzzy set “rate”. The positive 
indexes specify the members for positive error (idte), the 
negative indexes denote the members for negative error 
(rate) and the index 0 corresponds to the zero error (rate) of 
the fuzzy sets. The membership functions corrc.sponding to 
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the members in (2.4) are expressed as: 

ift-Ax), I* .Mn(jr). 

,(jc), juy(x)). (2.5) 

Denote the central value of the membership function as 
ki and define k. j = -L, A^) = 0, and kj = L. Also, let the 
space between the central values of two adjacent members 
be equal. Then the space S is: 

S = J, (2.6) 

and consequently the central value of ^,(jr) is A, = / ■ .V. It is 
obvious that the base of each member is 2S, It should be 
noted that the equality of the bases docs not imply loss of 
generality because the bases of the members of ‘‘error” and 
“rate” are different with respect to the actual unsealed 
inputs, e(nT) and r(n7’). 

The membership function ^,(x) in this study is the 
commonly-used triangular-shaped membership function sat¬ 
isfying the following conditions: 

(1) For r =1, -y+ 2, . . ,7-2,/-I. 

;4,(x) = i[z-(i-1)5], if z 6 [(i - 1).5, i5], 

=-^[Jt - ('+ if z e[f5, (/+1)5], 

M,(jt) = 0, if z«[(/ -l).5, (. + 1).5] 

(2) For I = y or t = -J, 

= 1)5], if z 6 ](y-1)5,75], 

= if jt e (75, 

and ^y(jr)-0, if jt ^ ((7 - 1)5,+o^), 

^ 1)5), if jr e [-75. (-7 + 1)5), 

if x€i-^JS\ 

and fi /(jr) = 0, if jc ^ (-71)5). 

It IS obvious that 

^Sx) + ju,^ ,(x) = 1, xe -1-'^). (2.7) 


Figure 1 shows an example of such a membership function 
with N = 7 (7 = 3) and 5 = 5. In this paper, /i,(e*) is denoted 
as the membership for E, and /i,(r*) as the membership for 
Rr 

Assume there are 2N — 1 (or 47+1) members in the fuzzy 
set “output”. Among these, 27 members are for positive 
“output,” 27 members are for negative “output” and one 
member is for zero “output”. Using the index system ( 2 .^^), 
the members of the fuzzy set “output” can be described by 

{u U U f/,„ U, .„ U._j]. (2.K) 

The central values of the members of the fuzzy set “output” 
are designated as y,. Let y y„ = 0 and Yij H 

Further, let the space V between the central values of two 
adjacent members be equal. Therefore the space is 


and the ith central value can be written as 

( 21 ") 

For the luzzy set “output,” the author requires: ( 1 ) the 
membership function be symmetrical about its central value; 
and ( 2 ) the shape of the membership functions of all the 
members be the same. 

It is necessary to use /V" fuzzy cimtrol rule.s to cover all the 
possible combinations of N members of the fuzzy set “error” 
and N members of the fuzzy set “rate”. In the study, the 
fuzzy control rules must comply with the following rule; 

IF “error” is E, and “rate” is 

THEN “output” IS U (2.11) 

In other words, the index of the member of “output” is 
always equal to the negative sum of the indexes ol (he 
members of “error” and the members of “rate”. Such a 
control rule i.s called here a linear control rule. To illustrate 
this rule, take N ^ 5 as an example. If the members of the 
input fuzzy sets are (negative medium (NM), negative small 
(NS), zero (ZO), positive small (FS), positive medium 
(PM)) and are represented as {F f i- ^ 

2 * ^ 1 ' ^( 1 - ^ 1 ' ^^ 2 )' corresponding nine 

members of the fuzzy set “output” can be indexed as {U 4 , 
(7 ,, fy 2 . I. ^< 1 - ^ 1 . ^ 2 > fA) which may be 

interpreted as (negative very large (NVL), negative large 


membership 



-15 -10 -5 0 5 10 15 


Fig. 1. An example of a triangular-shaped membership function with seven members (N ~ 7). T he space 
between the central value of two adjacent members is 5 (5 - .S) 
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Tabif 1 An ixampil io shdw 



t ,(NM) 

R ,(NM) 

(/,(PVL) 

R ,(N.S) 

(A(PI.) 

/?„(ZO) 

<./,(PM) 

W,(PS) 

JA(PS) 

K,(PM) 

UuUO) 


HOW lo CONST Run 2S HI/7Y C 
RUII (2 11) whfnN^ S 


E ,(NS) 

U70) 

JS(PL) 


t/,(PM) 


fA(PS) 

l/„(ZO) 


IJ |(NS) 

U ,(NS) 

U ,(NM) 


ROl Runs ACCORDINC. 10 IHI 


£,(PS) 

i\(PM) 

[/,(PS) 

C/„(ZO) 

U„(ZO) 

II ,(NS) 

V ,(NS) 

U ,(NM) 

LI ,(NM) 

LI ‘.(NL) 

V ,(N1.) 

LI ,(NVL) 


(NL), NM, NS, ZO, PS, PM, positive larj^e (PI ), positive 
very large (PVL)} The corresponding 2S (u/zy control rules 
satisfying the rule (211) are shown in 1 able 1 

Zadeh fuzzy logic AND is used to execute the 11' side of 
the fuz/y control rule That is 

p(i.y)-Min(//,(e*), fx,(r*)). (2 12) 


Being lu//jfied, the memberships of e* and r* arc obtained 
as 

- [(c*-0 4 1)A], !(<’*-'•'>]. (- <«) 

''mI"'(^19) 


where is the membership ol the member ot the hiz/y 

set "output'’ obtained when t, and R, arc used m the IF side 
Because the membership function ol "output” is symmetrical 
about Us central value, the central value of the member 
y (n,) Y (.!/)■ 1^*^' resultant membership Irom the IF 

side, namely ^(/./), are used lo calculate the 1 KLN side ol 
the fuz/y control rule i e 

i;(/. y) -//(m) y (M/)- -Mm(p,(p") p,(rM) (/^ylV. 

(2 \^) 

where i'(f y) is the incremental contiol output contributed by 
the fuzzy contiol rule (2 11) II more than one mcmbciship 
results, say u, and . from executing two dillcrenl luzzy 
contiol rules Lukasiewicz fuzzy logic OR is used to get 
combined membership, because the conditions being 
ORed are maximally negatiyely correlated I hat is 


which are the memberships lor the members E,, ,, R^ and 

R^,t, respectively Membership lor all other members of 
"crior” and rate” is zcio Therefore only the following 
lour fu/^y control rules are executed 


(rl) If "error 
^ .. M.M 

(r2) It ‘error” 


IS + , and Tate” is , , then "output” is 
IS and rate” is then "output” is 


(rl) If "error is E, and "rate” is then “output” is 

(r4) If "error” is E, and "rate" is R then "output' is 

Applying equation (2 IJ) to each of the fu//y contiol lules 
results in the following 


p - Min (^1 -I- jUi 1) (2 14) 


Recall that the shapes ol the membership lunctions ol 
output ' were required to be the same In the deliizzihcation 
piocess theielore, the contiibution from the members of 
‘ output in the I III N side ol the luzz> control rules should 
be weighted by their memberships calculated from the IF 
side ( onscquently, the scaled crisp incremental output 
(ill Ai/(/i7) can bi c.dculatcd by the lollowmg dcluz/ihca 
lion algorithm 


T. »'(' }) 

(lU \u{nl)-LtV ^ 

})y 

- LL 

>: /) 


1^) 


Finally, a new crisp output ol the luzzy conliollei at sampling 
time nl is calculated as 


u{nl)^u{nl 1 )^(jV Aw(/i/) (2 16) 

where (jU is the scalar for incremental output and 
u{nJ - T) IS the output ol the lu/zy controller at sampling 
time (n - 1) / 

2 2 Analytical analysis oj stria turv oj ihc nonlinear Juzzv 
controller with linear control rules 

Theorem 1 I he structure of the nonlinear fuzzy controller 
With linear control rules is the sum of a global 
two'dimcnsional multilevel reia\ and a local nonlinear PI 
controller 


Proof The author first proves the theoiem in the situations 
III which both e* and r* are within the interval [ L, I \ 
Others situations will be dealt with Idler 
(A) Both e'* and are within the interval \-L,L\ 
With losing generality, assume I hat 


iS e* ' (/ -f 1)V, 
y.S-r*- (y+ 1).S 


(2 17) 


(rr) oil T L / + 1) - -Mm (p,, ,(e*), 

p,,.(r’^)) (i+yT2)V 

(r2^) v{i -+ L y) - -Mm (p,, ,(<'"). (i+y + 1)V 

(r'P^) n(r,y -t- 1) - -Mm (/i,(r*). ju,, ,(r*)) (/ Fy + 1)V 
(r4*) n(i,y)- Mm (|U,(e*), iU,(r)) (/ fy)V 

To determine the results ol the Mm operations m (rL) to 
(r4*) the author conhgures a square by the intervals 
[/,S (/+ 1)A] and |/S,ly+l).S| and divides the squaic into 
eight regions as shown m Fig 2 In diflercnl legions, /i,(c*) 
^ 1 , ,,(f’*), //,(/*) and have diflcrcnl relationships in 

terms of the magmiudes of the membcrshijis The outcomes 
ol evaluating the Mm operations arc illustrated in I able 2 
Since the fu//y control rules r2 and r‘\ generate two 

GR*r(nTl 



Fn» 2 Possible input combinations (IC) ol scaled error, e*, 
and scaled late change of error, i*, ol process output which 
must be considered lo carry out the Mm operation in (rP) to 
(r4*) when both e* and r* are within the inteival [~L, L] 
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Table 2. Results of evaluating the min operations in (rl*) to (r4*) for all 

COMBINATIONS OF INPUTS USING ZADEH FUZZY LOGIC' AND (MIN) WHEN SCALED ERROR 
AND RATE CHANGE OF F:RR0R OF PROCESS OUTPUI ARE WITHIN THE INTERVAL \ -L, L\. 
THE INPUT COMBINATIONS OF SCALED ERROR AND RATE CHANGE OF ERROR ARE SHOWN 

GRAPHICALLY IN RC. 2 


Region 

rl* 

r2* 

r3* 

r4* 

rCI and 1C2 

/i„ ,('■*) 

ftCr*) 


ft(«-‘) 

IC3 and IC4 

ft. !(<■’) 

ft(^*) 

ftCf*) 

ft('-*) 

IC5 and 1C6 

ft. i(e’) 

ft.,(^*) 

ft.,(^*) 

ft(r*) 

IC7 and ICS 

ftM(r*) 

ftn(f') 

ft,,(^*) 

ft(f') 


memberships for the same member, the equation 

(2.14) is needed to calculate the combined membership for 
V. 1 ). For the ICI to IC4 regions, 

0.5)51 ^ I 

because 

0ii[e*-(i + a5)A^] + lr* - U-^{15)S]^S. (2.21) 

Similarly, it is easy to prove 

M /1 \(f*) ^ 1, for IC5 to 1C:H regions. (2.22) 

Hence, the combined membership for t/ (, + i.s always the 
sum of the memberships being ORed. Replacing the Min 
operations in (rl*) to (r4*) with their corresponding 
outcomes in Tabic 2 and using the defuzzification algorithm 

(2.15) in connection with (2.7), (2.9), (2.IK) and (2.19), the 
scaled incremental output, GU Au{nT), for all eight 
regions can be found as follows: 

for ICI, IC2, IC5 and IC6 regions 


GU • Au(nT) = —(i -E) + 1) 


GU H 

r 


(2.23) 


_ \ GE • e (nT) - (i + 0.5)5] + [GR - r{nT) - (/ -F 0.5)51 
2S - 2l GE^nT) - ’(74 0.5) 

GU ■ // 

N^[ 

for IC3, IC4, IC7 and IC8 region.s 
GIJ H 

C17 Au(nr)=-(,+;+1)"^-- 

_ |(^g f(>'7')-(i + Q.5).V) + K;W -^nT)- (/^^).5).V| 

2.9 - 2 |GR ■ r{nT) - (/ + O Sj.vi 

GU II 

x-^VT- C 


GU • Au(nT) consists of two parLs. The first part is 
“(i 4/4 l)Gf/■ A//(A/- 1), which is a two-dimensional 
multilevel relay with respect to i and y. Note that the 
multilevel relay, denoted as Relay(/,/) can be rewritten as 

« . / V . ..GU H 

Rclay(i. ;)=-((+;+ 1) 


- - ((/ + 0..5).S + (y + ().5).9). 

= -((i + 0.5).S + (y + 0.5)5) (2.2.5) 

The point ((i 4 0.5)5, (7 4 0.5)5) is the coordinate of the 
center of the square shown in Fig. 2. Evidently, the 
multilevel relay contributes its control action according to the 
absolute position, with respect to the entire scaled input state 
plane, of the center of the square in which the current scaled 
input state {e*, r*) lies. Therefore, the author calls the 
multilevel relay a “global” multilevel relay. The second part 
of GU' Au(nT) is a nonlinear nonfuzzy controller, which is 


denoted as du(f, y). The equations (2.23) and (2,24) indicate 
that 6u{i, j) is calculated according to the relative position of 
the current scaled input state (GE c(riT), GR ■ r(n7')) with 
rc.spect to the center of the square, ((/4().5)5, (/4 0.5)5), 
in which the current scaled input state lies. Therefore, one- 
can see that the role of the nonlinear controller is to locally 
adjust the control action generated by the global multilevel 
relay. The author calls such a controller a ‘ local * nonlinear 
controller. 

A regular discrete-lorm linear PI controller whose output 
becomes zero when it.s inputs, e{nT) and r{nT), reach a 
steady-stale ((/4 0.5)5/G/’, (j ■n).5)S/GR), can be ex¬ 
pressed as 




-t- A' 


r{nT) 


(y + 0 5)5 
GR 


). (2.26) 


where K^, and K, arc the proportional-gain and integral-gain, 
respectively. Therefore, the local nonlinear controller i.s 
actually a nonlinear PI controller with a local and changing 
steady-state ((/ 4 0 5)S/GE, (/ +().5)5/G/?); 






GR 


The proportional-gain and integral gain change with input 
stales and are described in equation (3.1) 

(B) Either e* or r* is outside the intcival ( L, L\. 

To analytically describe the behavini ol the nonlinear 
fuzzy controller when either r* or r* is outside the interval 
{-L, L], the author divides the scaled input state plane 
outside the square configured by the interval [-L, L] on the 
sealed error axis and the interval L, L] on the scaled rate 
axis into 12 regions, as shown m Fig. 3. By using the same 
method described above, GU ■ Af/(nT) can be analytically 
derived for the regions, as shown in Table 3. According to 
Table 3, the nonlinear fuzzy controller becomes the sum of a 
global one-dimensional multilevel relay and a local linear P 
controller with a local and changing steady-stale for the IC9, 
R. 10, KJ3 and 1C14 regions, and the sum of a global 
one-dimensional multilevel relay and a local linear integral 
(I) controller with a local and changing steady-state for the 
fCll, R‘:i2, IC15 and IC16 regions. The nonlinear fuzzy 
controller generates its maximum increment {GU H) and 
decrement { — GU ■ H) in the ICT9 and iri7 regions, 
respectively. For the ICIR and IC20 regions, the increment is 
zero. B 


It should be noted that when a scaled input state {e*, r*) is 
on a boundary of two adjacent regions, GU Au{nJ) 
calculated by using the formula of cither region is the same. 
In other words, there is no discrepancy in control action. 


3. Properties oj the nonlinear fuzzy controller with linear 
control rules 

3.1. Dynamic change of local nonlinear FI controller 
gains. Comparing (2.23) and (2.24) with (2.26), one can see 
that the proportional-gain and integral-gain of the local 
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ICIB 

i 

1C12 

U;R*r(nT) 

ini 

L 

IC17 

in 3 



ICIO 

IC14 



‘ GE*rlnl) 

irg 

ICl 9 

IC15 

1 

l( lA 

IC20 


Fio ^ Possible input combinations (IC) of scaled error, e* and scaled rale Lhan(»c ol error, r*. ol process 
output which must be considered In carry out the Mm operation in (rl*) to (r4*) when either r* or r* is 

oulsidi the interval | / L\ 


lABir 1 Tin- S( A1 t 1) INCRIMINIAI OinPUl Ol THl H)//\ (ONIROLLLRS 
GV AinD, WHIN nmFR srAiii) l rror (7f e(n7) OR St At ID RAH 
cHANCii Oh I KROR GR r{nT), or proci ss ouipur is ouisini ihi intirvai 

I-/ /| ntl INPIK l OMHINAIIONS Oh SI Airo tRHOK AND RAIl (HANOI Ol I KROR 
AR[ SHOWN r.RAPHK Al l> IN hU. 2 


and 

, Gif 




, av H 

1C 10 


1 

(GR 

r(Ai/)- 

> 2/ 

ICO 1 and 

, (,1/ 

H 



,('.U H 

IC 12 


1 

{Gl 

e(n 1 ) 

' '' 2/ 

IC^n and 

(,U 

H 



,(,V H 

IC 14 


1 

(GR 

r{nl) 

> 21 

ICIS and 

, , all 

H 



, (,U H 

K^lh 


1 

((,L 

c(n1) 

' 21 

IC 17 



-Of; 

H 


ins 



0 



K P) 



(f(/ 

// 


IC2() 



0 




nonlinear PI controller vary with input stale and are 

__ct_au H 

4L ' ^ 


(3 1) 


Obviously, the local nonlinear PI controller can automati¬ 
cally adjust the proportional-pain and integral gain to adapt 
to different scaled input states I he further the current scaled 
input state {GF e(nf) GR r{nl ) is Irom the center ol the 
square ((i + 0 ^)S (/4()S)S) the larger the proportional 
gam and integral gam With the constraints on v* and r* 
spccihed in (2 17), the range of the value ol the nonlincai 
function r*) is calculated as 

1 - 2 (^ 1 ) 

Hence, the ranges ol the proportional-gain and mtegral-'gam 
are 


where 


P(e\r*) = 


S - \GF 


S 

~einT) - (i 4 0 5)S\' 

for ICl, 1C2, l( 5 and IC6 regions, 


and 


(3 2) 


P{e*.n- 


5- \GR 


S 

r(n7)-(/ +0 5)51’ 

for IC3, 1C4, 1C7 and K'K regions 


GR GU H ^ ^ , GR (>U // 

' AL ' 21. 

0 4) 

3 2 The role of the global multilevel relay and the loial 
nonlinear PI controller in total control action and degree of 
nonlinearity The absolute value of maximum Relay(i,y) is 
Rclay^^, - (/V - 2)C;L/ fII(N- 1), which is achieved when 
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= nr '|'he absoluU* value of maximum 

^u(i, /) Ls - OU ’ H/(M “ 1), which is achieved when 

C£ • p(nT) - (1 I 1 )S and G/i r(nT) - (j -f 1 )S ur when 
GE ■ e(nT) - / .V and GE ■ r(nT) - J - .V. The author deftnes 
the ratio 


regular linear PI conlrollcr when N is (p is zero). The 
corresponding and K, arc 


GR GII H GE GU H 

. 2ir^~ . 2l'~ 


(3.12) 


P = 


du,_ 


Relay,,, 


X 1(K)% - ,- X 1(K)%. 

N - 1 


(3.5) 


to describe (1) the role of the local nonlinear PI controller 
and the role of the global multilevel relay in total control 
action; and (2) the degree of nonlinearity of the nonlinear 
fuzzy controllers as N changes. The smaller ihe ratio p, the 
less .significant the role of the local nonlinear PI controller in 
total control action and the more sigmricanl the role of the 
global multilevel relay in total cfmtrol action. When yv = 3, p 
reaches its maximum, 50%, which indicates that the local 
nonlinear PI controller plays as important a role as docs the 
global multilevel relay. 

According to (3,4), the ranges of the proportional-gain and 
integral-gain are independent from N. I’hal means that the 
ability of the local nonlinear PI controller to adapt locally to 
input slate is the same for fuz,7y controllers as N changes. 
However, it should be noted that the role of the local 
nonlinear PI contrcdler in total control aetinn is governetl by 
the ratio p and Ihercfoic is dilTcrenl when N is diffcicni. 

The ratio p also describes the degree of nonlinearity of the 
nonlinear fuzzy controllers. 3 he smaller the ratio p, the finer 
the resolution of the ouipul of the global multilevel relay and 
therefore the less nonlinear the fuzzy contioiler 

3.3. Thv siruviiAre of the nonlinear fuzzy controllers when 


Theorem 2 (Limit theorem) The nonlinear fuzzy controller 
with linear control rules becomes a linear PI controller as 


Proof When the control action from the 

local nonlinear PI controller approaches zero according to 
(2.23) and (2.24), that is du{i, /)—►(). On the other hand, the 
control action from the global multilevel relay becomes 


Rclay(i,y) - -(i + /+ 

2J 

Because 


GV Hii i\ 


(. 1 . 6 ) 


I.V - I (i + 1).V - (i + I) , 

yS' --) J. (/+ i).s = (y + \)j. 

the inequalities (2.17) can be writ ten as 


(3.7) 


/ c* 
L 


/ 4 I 

T 



r* 

L 


7 + 1 
J 


and hence 


(3,K) 


L 


I 

J 


r" 



} 


(3.9) 


when (thcrcfnie / jc, ;x and / —► '^). Substituting 

(3.9) into (3.6), yield 


Rcluy(/./) 


GU H ^ ^ 

-2L- (' 




2L 


' einT) 4 


2L 


r(nT)). (3.10) 


and hence 


4. Relationship between fuzzy controllers with N ^ 3 and that 
with N - 2 

In Ying ct ai (1990), we analytically proved that a simplest 
possible (N -- 2) nonlinear fuzzy controller constructed in the 
same way as those in this paper was a nonlinear PI 
controller: 

for ICl, IC2, ICS and IC'6 regions 


, ('r. ■ -f GR ■ T(nT] GUj H 

2L - GE ■ \e(nT)\ 2 ' ^ ^ 


or for IC3, IC4, IC7 and ICS regions 

GE e{nT) 4 Off r(n^l') GU II 


duinT) - - 


IL GR ■ |r(nT)| 


(4.2) 


where 


GE - |c(n7')| L and GR • \r{nT) -- L. (4. 3) 

Based nn (4 1) and (4.2), the simplest possible nonlinear 
fuzzy controller docs not incliule the multilevel relay and 
hence the nonlinear PI controller is a global eontrollcr having 
one single global and fixed steady-state (0, 0) It can be easily 
proven that the fuzzy cfintroller with /V - 2 can be expressed 
as 


duinT) ^ r^)e{nT)-\ K^,{c\ )/-(aiT’)|, (4.4) 

where the gains and the ranges of the gains are Ihe same as 
those in (3.1) and (3.4), whieh iiidiLales that the loeal 
nonlinear PI controller can adjust its gains to adapt to 
different input states as much as the global nonlinear PI 
controller can. However, there is a fundamental difference 
between these two nonlinear PI controllers. That is, the role 
the local nonlinear PI controller can play in total control 
action is Ic.ss significant because the role is governed by the 
ratio p. The role is small when N is large. On the other hand, 
the global nonlinear PI controller coniribuies sole control 
action and therefore its ratio p is UM)% by definilion, which 
also means the fuzzy controller with N - 2 is more nonlinear 
than any other fuzzy eontrollers. 

5. Conclusions 

The author eoneludes that the nonlinear fuzzy controllers 
with N consist of a global mulnlevel relay and a local 
nonlinear PI controller similar to the global nonlinear PI 
controller when N - 2 but with a Uieal and changing 
steady state ((i 4 ().5).S767:, ( j-i l).S)S/GR). 3’he conse- 
quenees of employing more than two members (/V 3^3) for 
input fuzzy sets are (1) introducing the global mullilevel relay 
with resolution GU • HUN - 1); and (2) reducing the role of 
Ihc local nonlinear PI controller in total control action from 
1(K)% to l/(/V 1). Larger N makes the fuzzy controllers less 

nonlinear. As N approaches the nonlinearity disappears 
and the fuzzy controller becomes a linear PI controller. The 
degree of nonlinearity of the fuzzy controllers and the role of 
the local nonlinear PI controllers in total control action are 
quaniiialively described by introducing the ratio p. The fuzzy 
conlrollcr with N-2, whose p is 1(K)%, is the most 
nonlinear fuzzy eorilrollcr. 
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GU ■ AM(n/) - Relayfi, j) 
(GV H GE 
2L 


, GV H GR 
einT)-^- 


r(«7')). 


(3.11) 


Therefore, one can immediately conclude that the nonlinear 
fuzzy controller (and the global multilevel relay) becomes a 
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Polynomial LQ Optimization for the Standard 
Control Structure: Scalar Solution* 

K. J. HUNTt and M. SeBKKJ 

Key Wonb —Optimal control; aipcbraic system theory; polynomials 


Abstract —The problem of LO optimization lor the 
“standard” control structure is studied We obtain a soluticm 
for the scalar case of this problem using the pKilynomial 
equation approach. In solving the optimal regulation 
problem using polynomial techniques wc derive a couple t>f 
linear polynomial equations which together define the unique 
optimal regulator. The coelficients of these equations arc 
obtained from polynomial spectral factorization. 

I. Introduction 

In this work we focus on the synthesis of I .(J optimal 
control systems using the polynomial equation approach 
(Ku6era, 1979). In particular, we derive the optimizing 
solution for the scalar case of the Standard Control 
Structure. This polynomial approach provides a di.stinctive 
alternative to the state-space (Kwakcniaak and Sivan, 1972) 
and Wicner-Hopf (Youla e( al., 1976) optimization 
procedures. 

The standard structure (Fig. 1) has been used as the basis 
of optimal control problems (Francis, I9K7; Kwakernaak, 
1990). Il has also been studied in 7/,-optimal control 
problems; firstly using the Wiener-HopI approach (Park and 
Bongiorno, 1989) and then the stale-space approach (Doyle 
et al., 1989). 

In the multivariable case all sensor outputs arc 
incorporated intoy, while the regulated variable is the vector 
z. The vector f denotes all exogenous inputs to the .system 
including, for example, disturbances and references The 
vector u reprevSenls the control inputs. In the paper wc 
restrict our attention to the case where all thc.se signals arc 
scalar. This provides an important first step towards solution 
of the multivariable problem, which is currently under study 
(sec Hunt ct al., 1991). 

The stability of the scalar standard structure was lully 
analysed in Kui^era (1986). Under the assumption of full 
internal stability of the overall system a solution for the LO 
optimization problem for the standard structure was obtained 
by Hunt et al. (1992). The results presented in that work arc 
closely related to the solution given here. Here, however, wc 
recognise that the standard structure is an artificial 
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construction incorporating a variety of control problems and 
therefore require only stability of the lecdback pari of the 
system. 

A preliminary version of this paper was published by 
HunI and Sebek (IWl). The solution ol a closely related 
problem may be lound in Hunt and Ku6era (1992). 

l-l. flotation All systems con.sidercd in this work arc 
assumed to be linear, time-invariant and discretc-iime. The 
systems are described by means of real polynomials in one 
indeterminate d. Where d is to be interpreted as the unit 
delay o|Krrutor described by the relation dx{t)~x(t I) for 
any sequence A(t). The reader is leferred to tin. work by 
Kut'era (1979) for details. 

F'or simplicity, the arguments o( fKilynomials arc usually 
omitted; a polynomial X{d) is denoted by X For any 
polynomial X(d) wc denote its adjoint in negative powers of 
d by X*id) For any polynomial A’(^J) we define (.V) as the 
constant term independent of d Stable polynomials aic those 
with all their zeros having magnitude greater than unity. 

Problem Jormulalion 

2 1 Plant. T he plant under consideration is governed by 
the polynomial model 

A^y - B^u + ( 1 ) 

A,z - C> -f /Xt. (2) 

where y is the measuremeni, z is the varialile to be 
regulated, u is the contiol input, and C is a sloehastic 
disturbance. A^,B^,F^ and A^,C\,D. arc two iriplc.s of 
coprime polynomials. 

Further, we wrile A^ and A in terms o( their 
slabic/unstable faciorizaiions as lollows. 

A, -a; A,, (3) 

A,- A! A,. ( 4 ) 

where ' denotes a stable polynomial and denotes an 
unstable f>olynomial. 

Moreover, it is a.ssumcd that. 

(A.l) the disturbance t is a stationary, zero-mean while 
noise .sequence with mten.siiy 0 .^0; 

(A.2) the Iran.sfcr function is .strictly causal so that 

-0 and (A,) - 1; 

(A.3) the unstable part of divides A^, i.c there exists 
.some polynomial K such that A, K = 

Remarks 

(1) The plant description (l)-(2) is quite general and 
corresponds to a plant described by a transfer matrix 
where the denominator of each entry equals the least 
common denominator of the corresponding row. 

(2) Assumption (A.3) is a standard assumption (Park and 
Bongiorno, 1989; Doyle et ui, 1989) meaning that all 
unstable modes appearing m z must also be seen in y to 
be controlled. 

2,2. Regulator. The linear regulator which operates on the 
plant measurement y is described by 

u - ~Rv. (5) 
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Fin. 1. Standard feedback system. 


The iranslcr function R is written in rational lorm as 

R = X 'T. (6) 

where A' and Y are polynomials, and (A') - 1. Thus, we seek 
a regulator described by (S)-(6) and are only interested in 
those with X((J) invertible. 

We also write the regulator transfer function in the lorm 
K=P '</. (7) 

where p and q are rational functions to be further specified 
below. 

A deep analysis of the internal stability o( feedback 
systems has been given by KuCcra (1979, 1986). This analysis 
shows that to ensure stability ol the feedback part of the 
system we must have: 

(I) p and q must be stable rational functions which satisly 
the relation 


the given data make the problem regular (that the above 
conditions hold), i.e. that; 

(A.5) there exist stable solutions /A and Df to equations 

(l())-(ll). 

We may now summarize the solution to the optimal 
regulation problem: 

Theorem 1. The optimal regulation problem using measure¬ 
ment feedback has a solution if and only if; 

(CM) the greatest common divisor of and is a stable 
polynomial. 

(C.2) there exists some polynomial H such that A //- 

The optimal regulator is a realization of the unique 

transfer function 

'cm;. (12) 

Here, P and Q is the unique solution having the property 
(Z) -0 of the linear polynomial equations 

D*l)}Q~Z*A, - C,, (13) 

DtDfKP ^ L,, (14) 

where the polynomials and C, satisfy 

(15) 

L,-- D,DIA*R,A' \ (16) 

Proof. 

1 he control and mcasiuement sequences may be wriltcn as 


pA, + qB,--\. (8) 

(2) R is realised without unstable hidden modes. 

We return to the relation (8) in the sequel when 
considering questions of stability. To satisfy the remaining 
c.ondition we proceed under the assumption: 

(A.4) The regulator is free of unstable hidden modes. 

Note here that in considering stabilizability ol the system 
we require internal stability of only the feedback part ol the 
system (later, when aiming to achieve a Unite cost we also 
require stability ol the closed-loop transfer tunction between 
t and 2 ), This concept recognises that the standard control 
structure is an artificial construct incorporating a variety of 
control problems, some of which lead naturally to a 
realization in the standard form which is not completely 
internally stable In oui previous solution to the 1.6 
optimization problem tor the standard structure we required 
complete stability of the overall system (Hunt ei ai, 1992). 
In our approach presented here the solvability conditions arc 
relaxed. 

2.?> Co^t. The desired optimal regulator evolves Irom 
minimization of the cost function 

Q.fp,), (9) 


u^-qE^C. (17) 

z^A \l)^ -qCJi^,)l. (18) 

The corresponding correlation functions are 

(p^ - qE^,(l)E*q\ (19) 

0,-A,'(D,M.J0(O -r/r.F'jM/' (20) 

Introducing the spectral factorization (11) 0,^ becomes 

0 .. ( 21 ) 


The proof now proceeds in three parts. In part (a) we 
determine the optimizing regulator, in part (b) we examine 
the stability ol the closed-loop transfer function m order to 
determine whether the cost is finite, and in part (c) we show 
that the feedback part ot the system is mtcinally stable. 

(a) Oplimizalion 

rhe cost lunction (9) may now be written as 

J^(R,ql),Dlq^ •+ Q,A. ^C.qD^DJqH -~ 

- qC.E^ H*0 + n D*(p)A, *’). (22) 

Introducing the spectral factorization (10) this equation 
becomes 


where 0„ and 0. are the correlation functions of u and z in 
the steady-stale, respectively. R, and are real, 

non-negativc weights, not simultaneously zero 
The optimal regulator problem is to minimize the cost (9) 
subject to the constraint that the feedback part of the system 
be internally stable. 

3. Problem solution 

Wc define the spectral factors 77 and 77, by the relations 

A*R A, C*Q,( Dfl\. (10) 

E^tpE* ( 11 ) 

and note that wc are interested in solutions /7,, D, which arc 
stable polynomials. It is clear that necessary and sufficient 
conditions for stable spectral factors to exist are that: 

(1) 77, A, and (7M . Ii^ve no common factors with zeros on 
the unit cirefe. 

(2) has no zeros on the unit circle. 

For simplicity, we proceed here under the assumption that 


7- (A, 'ql),DJ):nfq*A, 'M q \q*C:E:D,(P 
A qC, E„ 77 ! 0 - 77,77 * 0)A, ' ’). (2.3) 

After complefing the squares, and some algebraic manipula¬ 
tion, the cost becomes 

7= (vrv,)+(K), (24) 

where 


ipQ,DJ):R^ 


^2- 


77 77* 


(25) 

(26) 


We note that only the term depends on the regulator; 
does not include q. We therefore proceed by attempting to 
minimize the term 

The second term in V^ contains both causal and non-causal 
parts. These parts may be split by introducing the 
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polynomials Q and Z as follows 

c*t;n(t>Q _Q /* 

A d*d; a d*d; 


(27) 


Nott that this equation is just tht linear polynomial equation 
(M) V', now reads 


' A A n*i): 


(2K) 


IIndcr the eonstraint (V) — (V*) -0 Iht conlnbution of tht 
find term in (28) to the cosl vanishes To mmimizt. iIil losI 
WL therefore set the eausal part of V^ to zero 


A A 
or 

<?= 

^ O Z), 

hrom tht relation (K) the remaining regulator term is 


r^) 


n-A/{\ qB^) 


(M) 


ind in (10) and (11) together with (7) we have tht rigulalor 
which minimizes the cost Moreover due to the assumption 
ol the existence of stable spcttril factors L) and Df ( ^ S) q 
defined by (10) is seen to he stable However it is not clear 
Irom (H) that p is a stable r.ilional function To show that 
the feedback part of the system is stable we must still show 
that p IS stable 
(b) finite cost 

I he cost function will be hnite if ind only if all the r itional 
lunetions in equations (17) and (IK) nc stable St dulity ol q 
has already been pi oven so we must ^till e1e il with 

j A '(/) q( t )w (12) 


C learly proving that the optimal cost is Imite amounts to 
showing that the transfer function between md i, (the 
closed loop iransler lunction) is stable 

It IS straighiforw ird to u write the above expresHon m the 

'"™ A '(/) A r ((I >'> i< tvit (ii> 


After substituting /A, - A K and caneclling the A term 
ihioiighout this IS seen to bi the linear equation (14) 

Moreover it is novx cleai from (lb) that p is stable due to 
the definition ol the spectral laetors I) and !)( (and 
Assumption A S) and the feedback system is seen to be 
stable Assumplion (A 4) ensures that no unstable hidden 
modes can deslrov stability of the sysle-ni Putting togeihci 
equations (AO) and (lb) with equation (7) ihe optimal 
reguiatoi transfer funeiion (12) results 

f in illy we note that ( onelilion (C 1) in 1 henre'in 1 as 
well as ensuring st ibili/tibilily of the plant ds(i ensures the 
LXislenee of \ unique solutuin to ihe pair of equations 
(H) (14) with (V ; ()(sieK.uil ra (1970) for details) 

( orollan 1 Ihe polynomials /’ mil Q equalions (ll)-(14) 
also satisfy the line ir polynomial equation 

A /M « 0 fh 

Broof Substitute lor p and </ from equations (10) and (lb) 
into eqinlion (S) (the result ean 4ilso fie pioved by 
eliminating the eouplmg term / from the equalions 
(HI (14)) 

4 ioni fusions 

A solution It Ihe oplinul lef’ulation problem lor the 
standard eoiitiol sliueUirc has been given We ennsidereel i 
scalar extern il polvnomiil model of the plant Ihe optimal 
rcguljior polyiiomi ils were shown to satisfy a p ui ol coupled 
linear polynomial cquilions Ihc eoellicienls ^1 these 
equioons are ohl lined bv two speelral I u lori/alions 11 was 
shown th II one ol the polynomial ti|uations ginranteed 
opliinililv md (he other si ihililv ol Ihe iLedbaek part ol the 
system 

A( know Ud^i mints K ) Hunt is eurrentlv •! Personal 
Keseareh fellow ol the Roy d Soeiety ol I dmburgb Ihe 
ii'thors acknowledge the eomribiilion nl lolliague Vhidimir 
Kueera ibrough ni my diseussions surioiinding this line ol 
work 


Here q has already been shown to be e stable lationil 
lunetion Wc poslponi discussion ol the slabilitv ol p until 
part (c) below Then taking aeeounl of the Assumption 
(A 1) It IS clear that the remaining necessary md sutlieient 
condition for stability ol the closed loop Irinslei lunelioii 
(and therelorc for finite ness of the cost) is 

(I) The unstable part of A must divide (f / I) B ) \ c 
there must exist some polynomi il // such ih U 
A ff~C L ~ D B^ I his IS Condition (C 2) in (he 
theorem 

We note tinallv that m arriving it the above eondition my 
possible cancellations between the unstable part ot A and q 
in (11) WLie ruled out due to the constr unts (A A) and (H) 

(e) Internal stability 

We now invcstigdle the stability ot the rational function p 
r rom equation (H) p is given as 

p (1 B,(f)A, ‘ (1*1) 

Substituting firstly lor q from (10) and then foi () from 

(II) ^ may be wiitleii alter some rearrangemenl is 

p~{L*C*<pi)A H^l,<pb*AHA* /*A B,) 

X {P D,P*f)IA ) ' (^‘^) 

The denominator of this expression meludes the unslahk 
faelors D*Pf anti A^ In order to obliin a stable p the 
numerator must include these unstable terms as factors i c 
there must exist some polynomial P such that the numerator 
of the above expression is equal to O^P*A^ P Substituting 
this value into the numeralor we obtain 

P-, a. o<>) 

' l) D,A 

Also from the substitution we obtain 

d*d;p 

- Frc A /y + f ;A « A* 7*A B^ (17) 


Ri f( ri ni t s 

Dovle J ( K (dover P P Khiigoiukai md B A 
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Decentralized Estimation and Control with 
Overlapping Input, State, and Output 
Decomposition* 

ALTLJ6 iFTARtt 

Key Words —Large-scale systems; control theory; estimation theory; decentralized control; 
decentralized estimation; overlapping decompositions; linear systems. 


Abstract— The extension principle, which was introduced 
recently, is generalized to the case of input, stale, and output 
expansion. Estimator and controller design problems are 
considered within the framework of the extension principle. 
Decentralized estimator and controller design with overlap¬ 
ping dccompositionvS is also discussed within the same 
framework. It is shown that if the extension principle is used 
then any estimator or controller designed in the expanded 
spaces is contractible to the original spaces for implementa¬ 
tion. Furthermore, it is shown that if an estimator designed 
for the expanded system achieves good estimation and/or 
asymptotic estimation then the ennti acted estimator also 
achieves good estimation and/or asymptotic estimation for 
the original system. Similarly, it is also shown that if a 
controller designed for the expanded system achieves 
stability and/or good performance then the contracted 
controller achieves stability and/or good performance for the 
original system. 

1. Introduction 

Many gf today s technological and social problems involve 
systems which are so complex that it is very costly, if not 
impM)ssible, to handle such large dimensional systems as a 
whole. For the purposes of estimation and control, it is 
usually necessary to decompose the given system into a 
number of interconnected subsystems Once a decomposition 
is available, each subsystem is to be considered independ¬ 
ently and the solutions to the subproblcms arc to be 
combined to obtain a solution for the original problem. 

In order to obtain a useful decomposition, it is essential to 
identify the parts of a system that are weakly interconnected, 
flowcvcr, many large scale sy.slems (c.g. see Ozgiiner et al., 
1988) may consist of subsystems which are strongly 
connected through certain dynamics (the overlapping part), 
but weakly connected otherwise. For those systems, disjoint 
decompositions may easily fail to produce useful results. 
However, it has been demonstrated that the recently 
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introduced overlapping decompositions (ikedu and Siljak, 
1980) may produce useful solutions in such ca.ses (c.g. sec 
iftar and Ozgiiner, 1987). 

The estimator or controller design approach within the 
framework of the inclusion principle (Ikeda ct al., 1984) 
starts with expanding certain spaces (e.g. state, input, and/or 
output) of a dynamic system with overlapping subsystems 
into larger spaces such that the subsystems appear as 
disjoint. Decentralized estimators or decentralized contrid- 
lers arc then designed in the expanded spaces and arc finally 
contracted to the smaller spaces for implementation on the 
original system. 

The earlier results on the inclusion principle, however, 
were restricted to the expansions and conlractions of the 
stale space only. Expansions and contractions of the input 
and output spaces were first considered by Ikeda and Siljak 
(1986) and by Ohta ct al. (1986) Later Iftar and O/guncr 
(1990) considered coni roller design with stale and input 
inclusion, and introduced a special ca.sc ol inclusion called 
extension. It was shown that if the extension approach is 
u.scd then any control law designed in the expanded spaces is 
contractible to the original spaces for implementation. 
Contractibility is required in order to preserve the desired 
rclation.s l>elween the expanded and the original systems 
following the application of appropriate controllers nr 
estimators 

In the present paper, the extension principle is generalized 
to the case where the output space is also expanded besides 
the stale and the input spaces. Estimator and controller 
design problems are considered within the framework of the 
extension principle. Decentralized estimator and controller 
design with overlapping decompositions is also discussed 
within the same framework. It is shown that if the extension 
principle is used then any estimator or controller designed in 
the expanded spaces is contractible to the original spaces for 
implementation, Note that, this property may not hold if 
.some other form of inclusion is employed (c.g. sec Ikeda and 
Siljak, 1986). Furthermore, it is shown that if an estimator 
designed for the expanded system achieves good estimation 
and/or asymptotic estimation then ihe contracted estimator 
also achieves good estimation and/or asymptotic estimation, 
respectively, for the original system. Similarly, it is shown 
that if a controller designed for the expanded system 
achieves stability and/or good performance then the 
contracted controller also achieves stability and/or good 
performance for the original system. 

Throughout the paper, denotes the A:-dimcnsional real 
vector space, 4 denotes the identity operator on IR^, 
denotes the space ol m x n real matrices, .V" denotes the 
space of M X M symmetric positive semi-definite real matrices, 
for x e IR" and W e .V'’, \\x\\y^ denotes the weighted 

semi-norm x’Wx, and (■)^ denotes the transpose of f ). 

2 Extension principle 

In this section a special case of inclusion, called extension, 
is introduced. Consider the following linear time-invariant 
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(LTI) systems: 


Proof. Given in the Appendix. 


n 


and 


i = A-r + Bu, 

y - Cx, 


^ X ~ AxBu, 


(I) 

Nexl we consider the outputs; 


H’ ^ Dx G IR *. 

(7) 

(2) 

of the system 2 and 

vv - Dx e 

(H) 


Here jr e IR", aefR'”, and yEU‘ arc, respectively, state, 
input, and output vectors of the system 2. Similarly, x e [R”, 
u e IR'", andJ'elR^ are state, input, and output vectors of 
the system Z. The outputs y and y are assumed to be 
measurable. It is also assumed that h ^n, and 1^1. 

In the sequel, the system 2 is referred lo as the original 
system and 2 is referred to as ihe expanded system. The 
state, input, and output spaces IR”, IR"", and Ifi' of 2 arc 
called, respectively, original state, input, and output spaces; 
similarly, the spaces IR”, IR"*, and of 2 arc called 
expanded state, input, and output spaces. 

Consider the transformations: 

and 

RR^'^L. (3e) 

Definition 1. The system 2 is an extension of the .system 2, 

and 2 is a disextcnsion of 2, if there exist transformations as 
in (3a)-(3c) such that lor any initial .state a„ 6[R” of the 
system 2 and any input u(t) e IR"^, l):s r of the system 2 
the choice 

if, = 7>„. (4a) 

and 

u(t) = Ru(t), V/^0. (4b) 

implies that 


of the system 2. These outputs arc not necessarily 
measurable, but it is assumed that estimates of these 
quantities are desired. It is also a.ssumed that k ^ n, k-^n, 
and k^k. In addition to the transformations (3a)-(3c), we 
define the following transformation: 

rank(C>)-Jt. (9) 

We now introduce the inclusion principle lor the outputs to 
be estimated: 

Definition 2. The output (R) of the system 2 includes the 
output (7) of the .system 2 if there exists transformations as 
in (3a), (3b), and (y) such that for any initial state e of 
the .system 2 and any input w(r)elR'”, c», of the 

system 2 the choice (4a)-(4b) implies that 

w(/; x) - C^vv(t; jf), V/ :^(). (10) 

The necessary and sufficient conditions for the inclusion ot 
outputs to be estimated arc provided by the following 
theorem: 

Theorem 2. The output (R) ol the system 2 includes the 
output (7) of the system 2 if and only if there exist 
transformations as in (3a), (3b), and (9) such that 

DA‘ - QDA'T, 

and 

DA’UR = QDA’B, (lib) 

for all / c {(), 1.2, .. ) 

Proof The proof follows similar lines lo the prool ol 
Theorem 1. fJ 


rank (T) - n, 

(3a) 

rank (/?) - m. 

(3b) 

rank (.V) ~ /, 

(3c) 


(3cl) 


j(t; jf,), u) = 7x(t; Xf), w), V/^O, (5a) 

and 

y(/; x) - 5’y(t, i). Vt^O. (5b) 

Remark 1. Ilie extension defined above is a generalization 
of the extension first defined by Iftar and Ozgiiner (1990) to 
the case where the output space is also expanded bc.sides the 
state and the input spaces. It is a special case of inclusion 
defined by Ikeda and Siljak (1986). In fact, it is a 
generalization ol unrc.striction, first di.scus.scd by Ikeda et al 
(1984), to the case of .state, input, and output inclusion 
However, it is dififerenl than the unreslriction defined in 
Ikeda and Siljak (1986), where unrestrictioii was defined for 
an arbitrary input u(0 in the original input space, and the 
input in the expanded space is obtained by a transformation. 
u{t) - R*^u{t). Here, on the other hand, the extension is 
defined for an arbitrary input ii(0 in the expanded input 
space, and the input in the original space is obtained by the 
transformation given in (4b). I hercforc, for the unrcslnction 
the allowable set of inputs for 2 at any time i.s only an 
m-dimensional subset of IR"’, but for the extension it is IR'". 

The necessary and sufficient conditions for the extension 
are provided by the following theorem: 

Theorem 1. The system 2 is an extension of the system 2 if 
and only if there exist transformations as in (3a)-(3c) such 
that 


If the expanded system is an extension of the original 
system, then a simpler condition may be found. 

Corollary 1 Given that the .system 2 is an extension ol the 
system 2, the output (8) of the sy.stem 2 includes the output 
(7) of the sy.stcm 2 if there exists a translormalion as in (9) 
such that 


l) Qi)I\ (12) 

where T is the transformation satisfying (6a) 

Proof. For i - 0 (lla) holds if and only it (12) holds. Given 

that (6a) holds, (iia) hold.s Vi r. (0, 1.2_} if (12) holds. 

given that (6a)-(6b) hold, (llh) holds V/ t {(), 1.2. .. .) if 
(17) holds. Hence, the result follows. □ 


3. Estimator and controller design with exienMon 

In this section, we discuss the design of LTI estimators and 
controllers by using extension. A LII estimator for the 
system 2 can be de.scribcd by: 


i -= fz + Gy 3- Eu, 
V = Hz + Ky, 


(13) 


where z is the state and neIR* is the output of the 
estimator 1'^; the output v of Iis an estimate of w given in 
(7). 

A LTI controller for the system 2 can be described by: 


TA - AT, 
TBR - B, 

5’C=CT. 


(6a) 

(6b) 

(6c) 


z - Fz -\r Gy, 
v = Hz-^ Ky, 


(14) 


where z e IR'" is the state and e R"' is the output of the 
controller F,.; the output v of F^ is applied to the input of the 
system 2 in order to control it; i.e,: u v. 
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A LTl estimator for the system t can be described by: 


jr, . z = Pi Gy, 


(15) 


where is the state and is the output of the 

estimator the output v is an estimate of w given in (8). 

Similarly, a Lfl controller for the system 2 can be described 
by: 


z ~ Fz Gy, 
ff = Hz-\- ky, 


(16) 


where i e R'’ is the stale and 0 is the output of the 
controller \\, the output u is applied to the input of the 
system 2 for control purposes; u = v. 

It is assumed that the dimensions of r,. and T, do not 
exceed the dimensions of V,. and respectively (i.e. p ^p 
in both cases); this assumption may be justified since 2 is a 
part of 2, and thus should not require a larger dimensional 
estimator or controller (Ikeda and Siljak, 1986). In addition 
to the transformations (3a)-(3c) and (9), wc define the 
following transformation: 


rdnW{P)^p. (17) 

Wc now introduce contractibility for estimators and 
controllers: 


Definition 3a. The estimator (l.^i) is contractible to the 
estimator (13) if there exist transformations as in (3a), (3c), 
(9), and (17) such that for any initial state of the 

system 2, for any input M(r) € 0 :i -1 < of the system 2, 

and for any initial stale z„ e R'' of the estimator f,. the choice 


tibility, in this case, must be defined for an arbitrary u(/) in 
the expanded input space. The outputs w(/) of and u(r) of 
must be such that when the loops arc closed (i.e, when 
M(r) = u(r) and u(f)-i)(r)) the condition (20b) is satisfied. 
The requirement (21b) ensures this condition. 

It is important to satisfy contractibility in order to preserve 
the desired relations (.such as stability, good performance, 
and good estimation) between the expanded and the original 
systems following the application of the appropriate 
estimators or controllers. First, let us introduce the following 
definitions. 

Definition 4a, Given a system (such as 2), we say that an 
estimator (such as F^) achieves good estimation with respect 
to a given weight W e and a given scalar tolerance 
function g(0 if ||u(0” ”^’(011 w — where p(/) is 
the output of the estimator, and w(r) is the output of the 
system which is to be estimated. 

Definition 4b. Given a system (such as 2), wc say that a 
controller (such as F^) achieves good performance with 
respect to a given trajectory function /(/)eR", , a 

given weight W e S", and a given scalar tolerance function 
S(t) if ||x(0-/(/)|lw “^(0- V/^O, where x(/)elR'’ i,s the 
state of the rontrolled system. 

Now wc can prove the following resulls; 

Theorem 3a. Suppose that the output (H) of (he system 2 
includes the output (7) of the system 2 and that the estimator 
F,. is contractible to the estimator F^.. Moreover, suppose 
that the estimator is applied to the system 2, that the 
estimator f\. is applied to the system 2, and that (18a) -(18c) 
hold. Then. 


X(i — Txf], 

(IHa) 

u(t) = R**u{t), \ft^{}, 

(l«b) 

and 


^(1 “ 

(18c) 

implies that 


z(t; z„, >. u) - Pz{t\ i„, V. m), V/ =^:(). 

(19a) 

and 


v{t; z, y) =- Qo{t\ z. v), Vt -^0. 

(19b) 


(a) If \\ achieves asymptotic estimation then ]\ achieves 
asymptotic estimation, i.e. if lim, ... (u - w) ^ 0 then 
limf (i' - w) - 0 , 

(b) If F,. achieves good estimation with respect to Q^WQ 
and g(0, ihcn F^ achieves good estimation with respect 
to VT and j?(0. 

Proof, (a) Note that, by (19b) C>f' = n, and by (10) ()vv w\ 
tnerefore, (i^ - w) - (>(0 - w), from which the desired result 
follows. 

(b) Note that, II i'(0 “ -- #f(0 implies 

||C?(fi(0 ~ w(f))l|^,^ g(/). Wc have alieady shown that 

Q(v -w)^ (v - M^); thus ||ii(r) -- iv(0llw ' ^(0- □ 


Definition 3b. The controller (16) is contractible to the 

controller (14) if there exist transformations as in (3a), (3b), 

and (17) such that for any initial stale x,, e of the ^slem 
2, for any input u(t) t R"’, O^ r- of the system 2, and 
for any initial state i„ c IR^’ of the controller f. the choice 

7>„. (20a) 

M(0-f^M(0- V/5r0, (20b) 

and 

z., Fzn. (20c) 


implies that 


Theorem 3b. ^Suppose that 2 is an extension of 2 and that 
the controller F^ is contractible to the rontroller \\. 
Moreover, suppose that the controller F^ is applied to the 
sj'slem 2 and that the controller I\^ is applied to the sy.stcm 
2. Then; 

(a) Stability (respectively asymptotic stability) of the 

expanded closed-loop system (obtained by applying \\ to 
2) implies the stability (respectively asymptotic stability) 
of the original closed-loop system (obtained bv applying 
F, to 2); _ ^ 

(b) Assuming that Fx,, and z^^ = /%, if G. achieves good 
performance with re.spect to f{t) = Tf{t), 
W=(T^)^WT**, and g(0 then l\ achieves good 
performance with respect to /(/), W, and g(/) 


z(t] y, u) = Pz{t- 2n, y. w), Vr 0. (21a) 

and 

u(r; z, y) ^ y?r»(/; z, y), Vr=^0. (21b) 

Remark 2. Note that contractibility for an estimator is 
defined for an arbitrary input u{t) in the original input space, 
and the input u{t) for the expanded system 2 is obtained by 
the transformation (IHb). This is the natural choice, since the 
actual input applied to the real system 2 is M(r). In this case, 
there is no restriction on the input u(/) which is assumed to 
be applied to the fictitious system 2. In the case of a 
controller, however, since the controller is designed in the 
expanded spaces, u(r) must be arbitrary (othcrwi.se, there 
will be a restriction on the controllers that could be designed, 
e.g. see Ikeda and Siljak, 1986). Therefore, the contrac 


Proof, (a) If the expanded closed-loop system is stable 
(asymptotically stable), then the overall state vector 
(x(f; x„. z„)^f(t;X(), tbe closed-loop expanded 

system is bounded for all (and 

lim, [i(/;Xo. ^(t; Xo. ^ ^^) ^ 

all z„e By (5a) and (21a), the overall state veclor of the 
original system has the properly: 


rx(r.Xo.z„)l^rF*^ ()-|fx(t;Fx,.F''z,)l 

Lz(/;xj>, z„)J [ U p\[z(t\Tx^^, P^Zi,)\' 

where P* satisfies PP** = and thus, 

[x(i; Xti, z,i) 2 (t; x„, Zp) ] , 
is also bounded for all r art) (and 

lim, ,^lx(t;x,„ Zo)^z(r;Xo, z,,)^]' 


( 22 ) 
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for all Xq^ T^U" -U'’ and for all = which 

implies that the original closed-loop system is also stable 
(asymptotically stable). 

(b) If \\x{t)-m\w^S{0 then \\T^J{t)-T**m\\w^ 
g{t)\ furthermore, since T^T - T**f(t)=f{t) and (5a) 

implies that T^x(t) - x(i); thus ||Ji(0-/(Olliv ^ #r{0 □ 

Remark 3. Recall that the estimator or the controller is to be 
designed in the expanded spaces and then contracted for 
implementation. Thus, in the case of an estimator, if good 
estimation with respect to W and ^(/) and/or asymptotic 
estimation is desired for the original system under inputs 
u(t)e^c:R"’ and initial conditions a^, efclR" and 
2 oe 3 ^c:(R^, then the estimator must be designed to 
achieve good estimation with respect to Q^WQ and g(r) 
and/or asymptotic estimation for the expanded system under 
inputs u(/) 6and initial conditions Jt„ e # 3 7’.':? 
and i,, e 3 Similarly, in the case of a controller, if 

closed-loop (asymptotic) stability and good performance with 
respect to f{t), W, and g(0 are desired for the original 
system under initial aindilions a^ e c and e ci R'’. 
then the controller must be designed to achieve 
closed-loop (asymptotic) stability and good performance with 
respect to Tfit), and g(/) for the expanded 

system under initial conditions A(, e 3 7 F and 4 e#3 

The necessary and sufficient conditions for the contrac- 
tibility of estimators and controllers are given by the 
following theorems. 

Theorem 4a. The estimator (15) is contractible to the 
estimator (13) if and only if there exist transformations as in 


(3a), (3c), (9), and (17) such that 

FP=PF, (23a) 

GCAf = PGCA’TM Vy e {0. 1, 2. . . .). (23b) 

GCA^B = PGCA'BH**, Vy e (0, 1, 2, . . . (23c) 

E=PER^, (2.3d) 

HP-QH, (23e) 

KCA' QKCA'T, Vy e {(), 1,2,...}, (23f) 

and 


E=PER^, 

(25c) 

HP=QH, 

(25d) 

KC=QkSC, 

(25e) 


where S is the transformation satisfying (6c) and is a 
matrix satisfying RR^ I„,, where R is the transformation 
satisfying (6b). 

Proof. Given (6a)-(6c), it can be shown that (23b) and (23c) 
both reduce to (25b), and (23f) and (23g) both reduce to 
(25e). □ 

Corollary 2b. Given that the system I! is an extension of the 
system 1, the controller (16) for the system ^ is contractible 
to the controller (14) for the system 1 if there cxi.sts a 
transformation as in (17) such that 


FP = PF, 

(26ii) 

GC = PGSC, 

(2Ab) 

HP - RH, 

(26c) 

KC^ RkSC, 

(26d) 


where .V and R arc the transformations satisfying (6c) and 
(6b), respectively: 

Proof. Given (6a)- (6c), it can he shown that (24b) and (24c) 
both reduce to (26b), and (24c) and (24f) both reduce to 
(26d). [1 

Since the estimator/controllei is to be designed in the 
expanded spaces and then contracted lor implemcnlahon, it 
is important that any estimalor/controller designed in the 
expanded spaces be contractible. In fact, if X is an extension 
of Z then such a property holds. 

Corollary 3a. 11 the system Z is an extension of the system Z 
then any estimator of the form (15) for the system Z is 
contractible to an estimator ol the form (13) for the system Z 
with. 


KCA'B = QkCA'BR“. V/ e (0. 1,2, . . (23g) 

Proof. The proof (ollow.s similar lines In the proof of 
Theorem 1. U 

Theorem 4b. The controller (16) is contractible to the 
controller (14) if and only if there exist translormaiions as in 
(3a), (3b), and (17) such that 


FP - PF, 


(24a) 

GCA' =- PGCA'T, 

V/ e {0, 1, 2. . 

, .) (24b) 

GCA^BR - PGCA'B, 

V)e (0. 1,2,, 

..) (24c) 

HP = RH, 


(24d) 

KCA' = RkCA^T, 

Vy f (0, 1,2, . 

■ ■) (24e) 

and 



KCA^BR = RkCA^B, 

V/e |0, 1, 2, 

,,,}, (241) 

Proof. ITie proof follows similar lines to 
Theorem 1. 

the proof of 

□ 


The above conditions reduce to simpler ones if the 
expanded system is an extension of the original system. 


G = GS, 

ll = QH, K = QkS. 


(27) 


Proof. With P Ip, F, G, E, //, and K delined above satisfy 
(25a)-(2-5e). □ 

Corollary 3b. H the system Z is an extension of the system 
Z, then any controller ol the form (16) for the system Z is 
contractible to a controller of the form (14) for the system Z 
with; 


F^f\ G^GS, II^RH, K = RKS (2H) 

ProoJ With P ^ I^, F, G, fl, and K defined above satisfy 
(26a)-(26d). n 

Remark 4. Hodiid and Siljak (19H6) have suggested the use 
of aggregation for the design of estimators for large .scale 
systems. They showed that, for the case of stale expansion 
only (i e. when the original and expanded input and output 
spaces arc identical), if the original system (1) is an 
aggregation of the expanded system (2) then any estimator of 
the form 

i - Ax ^Bu-\^ k\y - Ct|, (29) 


Corollary 2a. Given that the system Z is an extension of the 
system Z, the estimator (15) for the system Z is contractible 
to the estimator (13) for the system Z if there exist 
transformations as in (9) and (17) such that 

EP = PF. (25a) 

GC^PGSC, (25b) 


for the expanded system is contractible to an estimator of the 
form 

A ™ y4jf + Ru 3-fC[y - C a], (30) 

for the original sy.slem. Here aeIR" and jpeIR" are the 
estimates of the slate vectors of the expanded and the 
original systems, respectively. Corollary 3a, on the other 
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hand, shows that if extension is used then any estimator of 
the form (15) (which is a more general form than (29)) can 
be contracted for implementation. The use of the more 
general form ((15) over the form (29)) may be useful, since it 
allows the design of decentralized estimators (note that (29) 
is not a decentralized ewStimator unless the system matrices A, 
B, and r can be obtained as block diagonal matrices, which 
can happen only in trivial cases). As will be illustrated in the 
next section, decentralized estimators for the expanded 
system that are designed using extension result in estimators 
for the original system which have a special overlapping’ 
decentralized structure Such a structure may be useful in the 
implementation of the estimators (the estimator (3U) lacks 
this structure unless the system matrices A, B, and C have a 
special structure). 



4. Overlapping decompositions 

In this section decentralized estimator and controller 
design with overlapping decompositions is discussed within 
the framework of extension. For nolational simplicity, only 
systems with two decentralized estimation/control agents arc 
considered. The extension of the results to systems with 
more agents is straightforward. 

Consider the system 2 given m (1). Suppose that the state, 
the input, and the output arc partitioned as; 


and 


Here it is assumed that jt^. u,, and vs correspond to the 
overlapping parts of the stale, input, and output spaces, 
respectively. Then, by choosing appropriate transformations 
and complementary matrices, an extension I, as described in 
(2), of I can be obtained (for details see Iftar, IW). If an 
estimator is to be de.signcd, we also partition the variable to 
be estimated as follows; 


Fig. 1. Implementation of overlapping decentralized 
estimators. 




where G’leIR''' 






Gi e M'" 




f - 1. 2, 


K\ 


\ t'l c 
elR"-’"''. a:;. 




(35b) 


and p, 


x-=^[xlxj.x{)'. 

jr, e ff 

? = L2, 3. 

(31a) 

given by (13), where 


u = (uj. u', u')', 

M, 6l 

f - 1. 2, 3, 

(31b) 

F - F, G - 

fGl GJ 

L (1 Gr 

y = (y^. v/. vD', 

V, fP'', 

/ - 1. 2, 3. 

(31c) 


{F\ 0 


is the dimension of z, (/ = 1, 2). The contraction of the above 
described overall expanded estimator to the original spaces is 


(36a) 


w'^ (W|^ wj, K', eK*', f^t,2, 3. 

The matrix D in (7) is also partitioned compatibly: 


(32) 


[) - 


On 


thy 

thy 


Ih^ 

l>2y 


//- H\ 
0 


Hi 


K'z, 

0 


tth + K\, 

Kh 


IT- 

^ IZ 


(36b) 


The implementation of these overlapping decentralized 
estimators (which form the contracted estimator Pc) is 
illustrated in Fig. 1. Assuming that the decentralized 
estimator P^. achieves asymptotic estimation and/or good 
estimation when applied to the expanded system 2, by 
I'hcorem 3a, the above described contracted estimator I\. 
achieves asymptotic estimation and/or good estimation when 
applied to the original system 2. 

Next, suppose that local controllers described by: 


I'hc corresponding variable to be estinialed for the expanded 
system is given by (8), where, for an appropriate 
transformation (1, 


D ^ 


Ou 

Oz. 

0 

Om 


Z),; (I D,, 

0 

0 D,*. D,, 


Suppose that local estimators described by: 


p;.: 


2, - b\z, -H G,y, E,u„ 

o, = H,z, + a:,v, 


/ - 1. 2. 


(33) 


are designed such that the overall expanded estimator I ^ 
described by (15), with 

F - blockdiag {F,, F,). C; = blockdiag (C/,, G.), (34a) 

blockdiag {/Zi, ZA). blockdiag {F’j, F 2 ), (34b) 

and 

k = blockdiag (A',, K^), (34c) 

satisfies the design requirements (such as asymptotic 
estimation or good estimation) when applied to the expanded 
system 2 . Let us partition the matrices of the local estimators 
as follows: 


c, = [G', cy, £, = [£'1 £2], P‘'“) 


f, — F,z, + G^y\ 


i = 1. 2. 


(37) 


arc designed such that the overall expanded controller Y\ 
described by (16), with 

F -- blockdiag {F|, Z^), G ^ blockdiag {G,, G^}, (3Ha) 
/Z= blockdiag {//,, IF], A - blockdiag {A,, Kj], (38b) 


satisfies the design requirements (such as stability and/or 
good performance) when applied to the expanded system 2. 
Let the matrices of the local controllers be partitioned in a 
similar way to the partitioning of the local estimator matrices 
given in (35a)-(35b). The contraction of this overall 
expanded controller to the original spaces is then given by 
(14), where the matrices F, G, H, and A are obtained from 
the matrices of fj and P^ as described in (36a)“(36b). The 
implementation of this contracted controller, which consists 
of overlapping decentralized controllers, is illustrated in Fig. 
2. ASvSuming that the designed decentralized controller P,, 
achieve.s stability and/or good performance when applied to 
the expanded system 2 , by Theorem 3b, the above described 
contracted controller P^ achieves stability and/or good 
performance when applied to the original system 2. 

Remark 5. Comparing the contracted controllers obtained in 
this section and those obtained by Ikeda and Siljak (1986), it 
is seen that they both have the same structure. However, 
there is an important difference in the intermediate design 
stages. Since unrestriction is used in Ikeda and Siljak (1986), 
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Fig. 2. Implementation of overlapping decentralized 
controllers. 


the controller designed for the expanded system may not be 
contractible for implementation on the original system. It is, 
therefore, required to modify the designed controller (as 
shown in Ikeda and §iljak, 1986) before contraction. This 
modified controller, however, need not satisfy the design 
requirements (e.g. stability or good pjcrformance) even if the 
unmodified controller does. Therefore, either the expanded 
closeddoop system with the modified controller or the 
original closed-loop system with the contracted controller 
must be tested for these requirements; and the design must 
be repeated if these requirements are not satisfied. On the 
other hand, the controllers designed using the present 
approach can be contracted directly without the need of any 
modifications, testing, or redesign, since the contractibility of 
these controllers is guaranteed by Corollary 3b. 

5. Conclusions 

The extension principle has been generalized to the case 
where the output space is also expanded besides the state 
and the input spaces. This principle is conceptually close to 
the unrestriction defined by Ikeda and Siljak (1986). 
However, the unrestriction was defined for an arbitrary input 
in the original input space, and the input in the expanded 
space was obtained by a transformation. This fact brings in 
the restriction that a particular estimator or controller 
designed for the expanded system by using unrestriction may 
not be contractible to the original spaces. It has been shown, 
however, that with the approach undertaken here any 
estimator or controller designed in the expanded spaces is 
always contractible to the original .spaces. 

Furthermore, it has been shown that, if an estimator 
designed for the expanded system achieves good estimation 
and/or asymptotic estimation then the contracted estimator 
also achieves good estimation and/or asymptotic estimation 
for the original system. Similarly, if a controller designed for 
the expanded system achieves stability and/or good 
performance then the contracted controller also achieves 
stability and/or good performance for the original system. 

Finally, decentralized estimator and controller design with 
overlapping decompositions has been discussed within the 
framework of extension. Although systems with only two 
decentralized agents have been presented for notational 
simplicity, the extension of the results to systems with more 
than two agents is straightforward. 

Acknowledgement —^The author wishes to thank the referees 
for their constructive comments. 
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Appendix 

Proof of Theorem 1. The responses of the two systems (1) 
and (2) are given by 

.v(f) = C''''»-|, + [ c"'' '’«M(T)dT. (V)) 

and 

J(r) = c^'jf„4 [ e'^*' ^‘Rw(T)dr. (40) 

J[) 

respectively. By substituting (4a) and (4h) lor Ci ^nd M(r). 
respectively, and using the power senes expansion lor the 
matrix exponentials, it can be shown that (5a) holds lor any 
and any u(t) if and tmly if 

A'T^TA\ (41a) 

and 

A'B = TA'BR, (41b) 

for all i e (0, 1,2, . . .) Similarly, (5b) holds for any x^^ and 
any u{t) if and only if 

CA'T^SCA', (42 a) 

and 

CA'B --SCA'BR, (42b) 

for all 1 6 (0, 1, 2, ...). Now it is straightforward to show the 
only if part, since (6a) can be obtained from (41a) with / -^ 1. 
(6b) can be obtained from (41b) with i -0, and (6c) can he 
obtained from (42a) with i ~ 0. To show the if part, note that 
for i = 0 (41a) holds trivially; for i = 1 (41a) holds if and only 
if (6a) holds, lor i ~0 (41b) holds if and only il (6b) holds; 
and if (6a) holds, then (41a) holds for / =^2 and (4lb) holds 
for 1 ^ 1 . Similarly, for / - 0, (42a) holds if and only if (6c) 
holds, and given that (6b) holds, (42b) holds if (6c) holds, 
and for i sjt 1, (42a) and (42b) hold if (6a)-(6c) hold. Hence, 
the result follows. 
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Spectral Factorization of Linear Periodic 
Systems with Application to the Optimal 
Prediction of Periodic ARMA Models* 

SERGIO BITTANTlt and GIUSEPPE DE NlCOLAOi 

Key Words —DivScretc-time systems; time-varying systems; spectral factorization; zeros; prediction 
theory; Kalman filters; ARMA models; periodic systems; cyclostationury processes. 


Abstract —A cyclostationary process is a stochastic process 
whose statistical parameters, such as mean and auti>correla- 
tioii, exhibit suitable periodicity. In this paper, we consider 
the cyclospectral factorization problem which consists of 
finding a Markovian (state-space) realization of a given 
cyclostationary process. It is shown that a significant class ol 
periodic state-space representations is in a one-to-one 
correspondence with the periodic solutions of a ddference 
periodic Riccati equation. This result is applied to the 
solution of the prediction problem for ARMA models with 
periodically varying coefficients. If the periodic ARMA 
model is minimum-phase, the optimal predictor is given a 
simple input-output expression that generalizes the well- 
known one for time-invariant ARMAs. Otherwise, the 
computation of the optimal predictor calls for the solution of 
a cyclospectral factorization problem. 


1. Introduction 

A CYCLOSTATIONARY (CS) process is characterized by the 
fact that its statistical parameters (mean, variance, 
autocorrelation) are periodically time-varying (Gardner and 
Franks, 1975). Processes of this type naturally arise 
whenever some form of periodicity is involved and their 
applications range from the modeling and prediction of 
seasonal phenomena, see e.g. Vecchia (19155), to the 
synthesis of communication systems (Gardner, 1991). In 
trying to develop a system theory approach to the study of 
such processes, they have been characterized as the output of 
a linear system (fed by while noise) whose impulse response 
exhibits some suitable periodicity (Gardner, 1975) or 
alternatively, as the output of linear input-output models 
with periodic coefficients (periodic ARMA models) (Jones 
and Breisford, 1967; Pagano, I97rt). So, apart from basic 
issues dealt with in Bittanti (1986), little is known about the 


• Received 4 October 1990; revised 3 March 1992; received 
in final form 8 June 1992. A preliminary version of this paper 
was presented at the 11th IFAC World Congress on 
Automatic Control at the Service of Mankind which was held 
in Tallinn, Estonia during August 1992. The Published 
Proceedings of this IFAC Meeting may be ordered from: 
Pergamon Press Ltd, Headington Hill Hall, Oxford, 0X3 
OBW, U.K. This paper was recommended for publication in 
revised form by Associate Editor T. Ba^ar under the 
direction of Editor H. Kwakernaak 

t Dipartimento di Elettronica, Polilccnico di Milano, 
Piazza Leonardo da Vinci, 32 , 20133 Milano, Italy. Fax; 
+ 39 . 2 . 23993587 , email: BITTANTiro^IPMELLPOLIMLIT. 

I Centro Tcoria dei Sistcmi-CNR, c/o Dipartimento di 
Elettronica, Politecnico di Milano, Piazza Leonardo da Vinci 
32 , 20133 Milano, Italy. Fax; + 39 . 2 . 23993587 , email; 
DENICOLAO(a)IPMELLPOLIMI.lT. 


Markovian (state-space) rcprcscnialion of CS processes. In 
particular, it would be highly desirable to characterize the set 
of Markovian representations of the same CS process as well 
as invc.stigate the existence of innovations representations 
leading, in analogy with the stationary case, to stable 
whitening fillers. 

The main objective of this paper is just to lay the 
foundations of a state-space theory of discrete-time CS 
processes. In particular we will examine the following 
problem: given a CS proces.s which has a finite-dimensional 
linear periodic realization, characterize a whole set of 
alternative periodic state-space representations of the 
process. The main result of the paper is that there exists a 
bijective correspondence between, the innovations-like 
representations and the periodic solutions of a certain 
Difference Periodic Riccati Equation (DPRE). Furthermore, 
by making reference to the notion of zeros of a periodic 
system, it is shown that there exists a unique minimum-phase 
(canonical) representation corresponding to the (unique) 
stabilizing periodic solution of the DPRE. 

Besides being of independent interest, such a “cyclosp- 
cctral” factorization theory is shown (Section 5) to play a key 
role in the prediction problem for (univariate) ARMA 
models with periodic coefficients, i c. 

- i)yit - i)+^ eXf - f)e{t - + C{l), (1) 

»’ I (1 

where fly( ) and r,(’) are periodically varying parameters; 

aXt) = aXt^T), r,(0-(t+7). 

and f() is a While Gaussian Noise (WON) with nonsingular 
periodic variance: 

c()'-WGN(0, R()). H{i)^R{t + T), V/ 

In the paper, wc will provide a complete solution to the 
previous problem for both minimum- and nonminirnum- 
phasc PARMA models. 

rherc IS a strict parallel arrangement between our theory 
and spectral factorization theory for stationary stochastic 
processes. Now, it is well known that discrete-time periodic 
systems admit an alternative representation in terms of a 
time invariant system with augmented input and output 
vectors (lifted representation) see Section 2. This kind of 
isomorphism between periodic and invariant systems 
somewhat justifies the striking analogies between spectral 
and cyclospectral factorization theory, and one may also 
expect that, by the use of the lifted representation, the 
discrete-lime periodic results can be derived as (more or less) 
straightforward corollaries of the time-invariant ones. 
However, this is not entirely true. As a matter of fact, the 
analysis carried out in Bittanti and De Nicolao (1990) mainly 
relied on the correspondence between a periodic system and 
its time-invariant lifted representation. This line of reasoning 
allowed us to show that any positive semidefinitc solution of 
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a suitable DPRE gives rise to a representation of the given 
CS process, but failed to extend the same result to negative 
semidehnite and nondcfinite periodic solutions. Conse¬ 
quently, no bijective correspondence could be established 
between periodic solutions and innovation-like 
representations. 

Another possible approach to the study of the 
cyclospectral factorization problem could consist of specializ¬ 
ing to the periodic case known results in the spectral 
factorization of nonstationary processes, sec e g. Anderson et 
al. (1%9), Kailath (1969), Anderson and Moylan (1974) and 
Anderson and Moore (1979). In these papers, nonslationary 
spectral factors are associated with the .solutions of a 
time-varying Riccati equation. The natural way to select the 
periodic spectral representations out of the infinite 
time-varying ones is to eliminate the transient effect due to 
the particular choice of the initial condition of the Riccati 
equation. However, according to Anderson et al. (1969) the 
factorizations obtained by moving the initialization time of 
the Riccati equation towards the infinitely remote past, turn 
out to be independent of the particular initial condition. 
Hence, such an approach would provide only one periodic 
spectral factor. 

In the present paper, most of the analy.sis is carried out by 
exploiting tools belonging to the theory of linear periodic 
systems, with the time-invariant lifted representation playing 
a marginal role. Such a ‘periodic approach' performs better 
than lifted representations or the ‘periodicity’ ol time-varying 
results, in that it succeeds in establishing a bijective 
correspondence between innovations-likc rcpre.sentations 
and periodic solutions of the DPRE. The main results of the 
paper are the periodic extension of classic lime-invariant 
results, such as in Lancaster et al. (1986), to mention a recent 
discrete-time reference. However, it is worth mentioning that 
the translation of the proofs of Lancaster et al. (1986) to the 
periodic case is not trivial. In fact, when dealing with 
periodically time-varying systems one cannot resort to zeia 
transforms, and alternative rationales m the time domain 
have to be worked out. Among other things, our analysis 
calls for a new property of the periodic solutions of the 
DPRE (Theorem 2). 


2. Frelimmanes 

2.1. Periodic systems and cyclostafionary processes. 
Consider the .stocha.slic periodic system 

Jt(t J) = ^■(0■^(0 C7'(t)v(t), (2a) 

y(t) = H(t)x(t)i' h^(f). (2b) 

Matrices T().Z —G(and H().Z~* 
are periodic of period 7 ; i;( ) and iv( ) are white 
noises with Ciaussian distribution, zero expected value and 
covariance matrix; 


Var (i;(/)'w(f)’ 


QU) 

S{ty 


S{t) 

RiO 




where Q(t) = QU + T), S(t) - SU + 7 ), and RU) = R(t ^ T) 
are periodic matrices of appropriate dimensions, and 
R{t)>i), Vr. 

Denoting by W;.(t, t) the transition matrix relative to F( ), 
let 


i^^{t) = yp,it-^T, t), 

be the monodromy matrix at r In general 0;.(t) depends on 
t; however its eigenvalues, which arc called characteri.stic 
multipliers turn out to be time-independent, see e g. Bittanti 
(1986). The characteristic multipliers arc the periodic 
equivalents of the poles of a time-invariant system. Indeed, 
system (2) is (asymptotically) stable iff all its multipliers 
belong to the open unit disk If this stability condition is met 
with, then any solution y(’) asymptotically converges to a 
zero-mean cyclostalionary process (Gardner and Franks, 
1975) of period T, i.e. a proce.ss 5() such that 
y(t, /) y(T+ T, f T), where y(t, t) == Eff(T)5(01 
In the sequel, the following observability criterion, that 


straightforwardly follows from Lemma 1 in Bittanti et al. 
(1986), will prove useful. 


Observability criterion 

System (2) is completely observable iff rank [VfO] n, \ft, 
where 

V(t) = lH(ty ^,(/-f l.f)'7/(r+l)' 

+ 1 , tyHit-hnT-\yy. ■ 


As for stahilizability and detectability, several equivalent 
criteria have been worked out. In particular, one of these 
criteria stales that system (2) is stabilizabic (detectable) iff its 
uncontrollable (unreconstructable) pari is asymptotically 
stable. A modal notion of stahilizability can also be 
introduced by making reference to the .stabi’oy of the 
so-called unreachable characteristic multipliers. Precisely, a 
characteristic multiplier A is said to be (F( ), G( ))- 
unreachable at time t (Bittanti and Bolzern, 1985) if there 
exists f} ^ 0, such that 

cp,.(T)G; = Ar/, G(/- l)''F,(T.y)'7; = 0. 

Vy e |t - /' 4 1. t|. 

For more discu.ssion on the basics of periodic systems see 
Bittanti (1986). 

2.2. ZeroA of periodic systems. A consistent definition of 
zeros of discrete-time periodic systems has been first given in 
Bolzern et al. (1986). To be precise, consider the system 

A(t4- \)-A(t)x{t) 4 fl(f)w(/), (3a) 

y(/)-n0jr(r)4Z)(/)u(r), (^b) 

with A( ), Z?( ), r( ), D( ) periodic matrices of period T 
Letting 

H^ = \yVJr-^T, T4 l)/f(T) ML,(T-h T r 4-2)71(1 4 1) 

• H(i:+r- 1 )l, e/?"""’ 

c;-|c'(T)' T^(T-f 1 , T)'r(T+D' 

■ • + r-1. T)'C(T * /• 1)']', 

D. i.j-l.2 .7. 

f 0 !<./ 

(D,)„ = ]d(t + /-I) i-i, 

[ (’(t -t- I - + I - I, t +y)/J(T +; - 1) i >] 

u,(A) = |u(t + TcT")’ u(T + A7'fl)' 

■ • • u(r + (A + 1)7' - 1 )']' 

consider the lime-invariant system 

x^(k 4 1) - A.x^ik) -I- (4a) 

yA^) = ^ D^u^(k). (4b) 

Definition 1 (Bolzern et al., 1986).z is a zero at time t of 
system (3), if il is a zero of the time-invariant system (4), i.e 
if del N{z) — 0, where 


N(z) = 


zI~A, 

Cr 



As discussed in Bolzern et al. (1986) and Gra.sselli and 
Longhi (19HH), the non-zero zeros together with their 
multiplicities are in fact independent of r. In Bolzern et al. 
(1986) It has also been proven that one can associate to any 
zero of the system a suitable initial state and input function 
that result in the null output. In other words, the 
transmission-blocking property that characterizes the zeros of 
linear time-invariant systems holds in the periodic case too. 
In Grasselli and Longhi (1988), it has been shown that zeros 
of periodic systems are invariant wilh respect to state 
feedback. The invariance with respect to output feedback, as 
stated in the following lemma, also holds (Bittanti and De 
Nicolao, 1992a). 
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Lemma 1 Consider the following periodic system generated 
from system (3) by means of a j>eriodic output feedback 

i(r + 1) ^A{t)x{t) + Eiorn + (5a) 

y(0 = C(0x(0 + O(0M(0. (5b) 

where £(•) is an arbitrary periodic matrix of suitable 
dimension. Then, the zeros at time t of (3) coincide with the 
zeros at r of (5). 

3 The DPRE and its periodic solutions 
Associated with system (2) is the discrcte-lime periodic 
Riccati equation 

p{t +1) = F{t)PU)F{ty + G{t)omty 

- \GU)S(t) + F{t)P(t)H{iy\lH(t)Pii)H{ty 
+ R{t)\ ‘\G{l)S(i) + F{l)PU)H(iy\', (6h) 

or, equivalently, 

P(t-H) ^ A(i)P{t)A{iy + ii(i)B(iy 

- A{i)P(i)uuy\H{,)PU)H{iy 

+ «(()) 'H{l)P{l)AUy. (6b) 

where 

AU)= F(l)-GU)S{l)R(n '//(ri. (7a) 

fl(f)fi(f)' = 6'(0C^(J)6’(f)' - 

- f;(/).v(r)/?(() '.s(n'c;(()'. (7b) 

Several results on the periodic solutions of such an 
equation are available, sec Bittuiiti et al (1‘^KHa, lOMO, 1991), 
dc Souza (1989a, 1991). In the sequel, particular attention 
will be paid to the periodic solutions delincd below, where 
the symbol A(i) denotes the closed-loop matrix, which can 
be given two equivalent expressions as follows 

A(i)=-AU)-\{t)!i{i)- t U)- («a) 

K(t) = \G{i)S{t)^ F{i)P{t)U{t)\ 

x|W(t)r(()W(0'+ «(«)! (Kb) 

A(l)P{l)H(iy[H{l)PU)HUy + RU)] ‘ (Sc) 

Definition 2. A symmetric periodic solution Pi ) ol the 
DPRE is said to be 

maximal if P{l)^Pi ), Vt, for all symmetric periodic 
solutions Pi ) ol the DPRE; 

strung it the charactenslic multipliers ol /1( ) belong to the 
closed unit disk; 

stabilizing if the characteristic multipliers of A( ) belong to 
the open unit disk. ■ 

As for the Symmetric Periodic Positive Semidefimte (SPPS) 

solutions, the lollowing results can he found m the literature 
(Bitlanti cr fl/., I9KKa, 1990; dc Souza, l9K9h, 1991), 

Theorem 1. 

(1.1) If (>!(■),//(■)) is detectable, then the DPRE admits a 
maximal symmetric periodic solution P^i ), which is in 
fact positive .semidelinile and strong. 

(1.2) If (A( )fl( )) is stabilizable and (/4( ), Hi )) detectable, 
then the DPRE admits a unique SPPS solution, which 
is in fact stabilizing. 

(1.3) The DPREi admits a stabilizing solution ill (A( ), //(■)) 

is detectable and there is no unit-modulus (/!( ), ff('))- 
unreachable characteristic multiplier. Moreover, such 
a stabilizing solution Ps( ) is unique and coincides with 
the maximal one. ■ 

Methods for the numerical compulation of the maximal 
solution include direct integration, quasilinearization (Bit- 
tanti et ai, l9K8a) and time-invariant reformulation (Bittanti 
ei al., 1990). A comparison between the merits of these 
different approaches can be found in Bittanti et al. (1988b). 

Later on, we will resort to the following theorem, the 
proof of which, given m Bittanti and De Nicolao (1992b), is 
here omitted for the sake of conciseness. 


Theorem 2. Assume that the DPRE (6) admits a maximal 
solution P*( ). Then, Pi ) being any symmetric periodic 
solution ol the DPRE (6), H{t)Pii)H{ty + R(t) > 0, V/. ■ 

4. Periodic innovations representation of cyclostationary 
processes 

Suppose that F( ) is asymptotically stable. Then, system 
(2) defines a unique cyclostationary process >^(-), and can be 
.seen as a stale-.space representation of such a process. We 
will now introduce a family of alternative state-space 
representations of the following typic 

i(i + l)-f(/)i(/) + L(r)F(/). (9a) 

>-(/)-H(f)i (0 + E((), ( 9 b) 

where e(/) is a zero-mean white noise with a periodic 
covariance matrix R(f)-R(f+r). Moreover, Lit) is a 
T-periodic matrix. If, for some R( ) and Li ), the 
cyclostationary process defined as the output of (9) coincides 
with the cyclostationary process obtained as the output of 
(2), then (9) will be said to be an innovations-like (IL) 
representation ot the process. If, in addition, the zeros of (9) 
belong to the open unit disk, such a representation will be 
termed innovations representation. In this section, it will be 
shown that there exists a one-to-one correspondence 
between the symmetric periodic solutions of the DPRF: (6) 
and the IL representations (9). 

Theorem 3. Suppose that /*'( ) is stable and (F( ), Hi )) is 
observable. Consider any symmetric periodic solution Pi ) of 
the DPRE (6), with the associated gain Kf) defined by (8b). 
rhen, letting Lit) = Kit) and Rit) Hit) ^ Hif)Pit)Hity, 
equation (9) provides an IL representation of the 
eyeloslationary process v( ) defined by (2). Moreover, 
different solutions of the DPRE lead to different II. 
representations. 

Proof The observability notion is output-feedback in¬ 
variant. Hence, the observability of (/ (•), Hi )) is equivalent 
to the observability of iAi ), Hi )). 3hen, Theorem 1.1 
implies the existence of a maximal symmetric periodic 
solution P^i-), so that, by 3'heorcm 2, R(0 + 
/y(0P(t)/^(0' 'd. Vt. Thus, Rif) can indeed be interpreted 
as a covariance matrix 

C'on.sidcr now the covariance matrix function A(t, t) ~ 
Elv(0>’(^)'] ^he cyclostalionary process defined by (2). 
This function is given by. 

A(^ r) 

^\Hit)^>,it. r4 1)1 E(t)A^(t)//(t)' +r;(T).V(T)], t>T, 

\Hit)Xit)Hi(y Rit), t^x, 

( 10 ) 

where the covariance matrix of the state, A"(r) = E|jc(/)jc(/)'|, 
IS the unique symmetric periodic .solution of the Lyapunov 
equation: 

Xit^ \)- Fit)Xit)FUy + Git)Qit)Gity. ( 11 ) 

On the other hand, the IL representation (9) defines a 
unique cycloslationary process too. Its covariance matrix is 
given by 

A(f, t) 

H(t)^^it, T 4 \)lFir)X(T)HiTy T LiT)Rix)], t > r, 
HU)Xit)Hity ^ Mt), 

( 12 ) 

where A'(f) ^ E|.f(r)i(r)'] is the unique symmetric periodic 
solution of: 

Xit -f 1) = Fit)Xit)Fity -h Lit)Rit)Lity. (13) 

Now, by subtracting (ha) from (11), it is easily .seen that 
A"(0 ~ Xit) — Pit), Consequently, after .some cumpntalions. 
it follows that A(f, r) = A(t, r). 

Finally, it is proven that two different solutions P|( ) and 
E 2 ( ) of the DPRE cannot give rise to the same IL 
representation (9), Indeed, by contradiction, assume that 
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there exist two symmetric periodic solutions Pj(') and P 2 ( ) 
of the DPRE such that and /Cj( ) = 

where R,(r) = /?(/)-»■ Af(0P,(0«(0' and /C,(r) = [G(05(ri + 
A(r)P,(r)//(01(//(r)P/(0/^(0' + A?(0]‘\ » = 1- 2. Then, let 
Xi(‘) be the unique symmetric periodic solution of the 
Lyapunov equation 

XXt‘^i)^F(t)XXi)FUy + KMRX^^^^^ 2 . 

Since the known terms of these two equations do coincide, it 
follows that XXO^^iiO ^n the other hand. P,(0 = 
X(t) - XXO’ St) that /',(■) and F 2 ( ) niust in fact coincide loo. 


Conversely, once the cyclostationary process satisfying (2) 
is given an IL representation, one can wonder whether there 
exists a symmetric periodic solution of the DPRE generating 
such a representation. The answer is affirmative, as stated 
precisely in the following. 

Theorem 4. Suppose that /*'( ) is stable and (F( ), //(•)) is 
observable. Consider an IL representation (9) for the 
cyclostationary procCsSs generated by (2). Then, there exists a 
symmetric periodic solution P( ) of the DPRE (6) such that 
L(i) = K(t) and R(t) + H{t)P(0N(ty. 

Proof. Let A'(/) and X{t) be the covariance matrices of the 
state x(r) of the original representation and the state jf(r) of 
the IL representation, respectively. By assumption, the 
associated output covariance matrices A(f, r) and A(r, r) do 
coincide. By comparing their expressions, (10) and (12), for 
r = T, r+l. ..,T + NT, it follows that 

R(t)^R(t)-^H(t)lX(t)-X{tmty 

V(T+l)[F(T)Af(T)/y(T)'^G(r)5(T)l 

= ViT^i)\F(T)X(T)HiTy + L(T)/?(r)l. 

In view of the observability of {F( ), Hi )), the Observability 
Criterion (Section 2) guarantees that K(t+ 1) is full-rank, 
Vt. Therefore, 

L(/) - [G(r)J(0 + F(0iX(t) - XU))H{ty]R(t) V 

By substituting the expressions for R(t) and L(f) in equation 
(13), the conclusion is drawn that A'r)-^( ) satisfies the 
DPRE (6). Consequently, R(/) and L{t) can be seen as 
generated by, a solution of the DPRE, i.e. R(r) = R(ri + 
and L(0 - (C(«).SW + ' 

with P(f) = Jf(() - ^(') ■ 

From Theorems 3 and 4, one can conclude that, under the 
observability assumption, there is a one-to-one corre.spon- 
dence between the symmetric periodic solutions of the 
DPRE and the IL representation of a given cyclostationary 
process. 

As in the stationary case, among the various IL 
representations, a major role is played by the innovations 
representation, which is characterized by the property that 
all iLs zeros belong to the open unit disk. As shown below, 
this is equivalent to requiring that the associated symmetric 
periodic solution of the DPRE is stabilizing. Indeed, an IL 
representation can be rewritten as 

xU-^ \)^A{t)x{t)-^ K(t)y(t), (14a) 

y(t) = H{t)x(t) -I- f(r), (14b) 

where A(t) = F(t) - F(f)//(0- As seen in Lemma 1, a 
periodic output feedback does not affect the zeros of a 
periodic system. Therefore, the zeros of the IL repre¬ 
sentation (14) coincide with the zeros from f( ) to y(‘) of the 
.system: 

x(r+l) = A(0x(/), 

y(0 = /f(0^(r)+c(0- 

According to the definition of zeros of a periodic system 
given in Section 2.2, such zeros turn out to be the 
characteristic multipliers of A('). Since A() is the 
closed-loop matrix, the conclusion is that the innovations 
representation exists if and only if the DPRE admits a 
stabilizing periodic solution. 


5. PARMA models: minimum-phase condition and 
innovations representation 

In this section the notions of poles and zeros of a PARMA 
model are introduced. Moreover, the problem of the 
existence of a canonical (minimum-phase) PARMA repre¬ 
sentation of a cyclostationary process is discussed. This is 
done by resorting to a state-space representation of the 
original PARMA. The input-output model (1) is equivalent 
(i.e. has the same impulse response) to the following linear 
periodic system 


where 


xO + 1) = F(t)x(l) + G(0p(0. 
y(0 = H(iU(0 + e(r). 




"o 

0 

... 0 

1 

0 

... 0 

0 

1 

... 0 

_0 

0 

... 1 


- 

«„(') +C„(l 



i(') + '„ 

r)- 


:(') + ‘n 




“t(') 


t’lO) +1|(<) 


(l-Sa) 

(15b) 


(Ifiu) 


(Iftb) 


Correspondingly, we will say that p is a pole of the PARMA 
(1) if it is a characteristic multiplier of F( ). As for the zxros, 
in view of Lemma 1, the zeros of (15) coincide with the zeros 
of the system 


x(t-hl)=A0)x(f), (17a) 

yO)^H(t)x(t)-heOy (17h) 

where A(t) = F(t) — G(r)/Y(f) is given by 



0 

0 

... 0 



1 

u 

... 0 

|(0 


0 

1 

. . . 0 



_() 

0 

... 1 



According to Definition 1, such zeros turn out to be the 
characteristic multipliers of A{ ), thus leading to 

Definition 3. z is said to be a zero of the PARMA (1) if it is 
a characteristic multiplier of /4( ). ■ 

It is immediately verified that, in the particular case of a 
time-invariant ARMA, such definitions coincide with the 
classical notions of poles and zeros. As a natural 
consequence of Definition 3, we will say that the PARMA 
model (1) is minimum-phase, if/!( ) is stable. 

Note that the state-.space representation (15) of the 
original PARMA model (1) coincides with system (2), 
provided that F(i), G{t), H(t) are defined according to (16). 
As for the variances Q(t), R{t), and the cross-covariancc 
S{t) of the disturbances u( ) and iv( ) appearing in (2), one 
has to take Q(t) S(t) - R(t), where /?(t) is the variance of 
c(/). As a consequence, matrix B{t) defined in (7b) turns out 
to be a zero matrix Vt, so that the pair (/4( ), B( )) is 
completely unreachable. Conversely, by means of the 
Observability Criterion (Section 2.1), the pair (A( ),/Y( )) 
turns out to be completely observable. 

Since B(f) = 0, Vt, the DPRE relative to the stale-space 
representation (15) of the PARMA is given by 

F(/ -h 1 ) - A{t)P{t)A(ty - A(t)P(t)Hity[H(t)P(t)Hity 

-^R{t)]^H(t)P(t)A(ty. ( 19 ) 

We now consider the case when the PARMA is .stable, i.e. 
all its poles lie inside the unit circle. Then, as already 
observed in Section 2.1, the output of the PARMA 
converges to a cyclostationary process: we will say that the 
PARMA model is an input-output representation of such a 
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cyclostationary process. Now, one may wonder whether, as 
in the stationary case, there are several PARMA 
representations of the same cyclostationary process and 
whether a canonical (minimum-phase) representation exists. 
The answer to these questions can be found in the following. 

Theorem 5 {Periodic spectral factorization theorem). 
Consider a stable PARMA and the associated cyclostation¬ 
ary proces.s v(‘). Given any symmetric periodic solution P{ ) 
of the DPRE (19), let - eft) + ft), where yft) is 
the fth entry of the Kalman gain f (f) defined in (Rc). Then, 
the PARMA 

n n 

yiO = 2 “((' - '))'(' - i) + ^ c,(l - i)e(l - i) + e(l), (20) 

I « I I I 

where e( )~ WGN (0. «(■) + H( )P( )A/( )'). provides a 
representation of the cyclosialionary process y( ). 

Furthermore, there exists a minimum-phase representation 
iff the additional assumption is made that the PARMA (1) 
has no unit-modulus zero. Then, the minimum-phase 
representation is unique and is obtained in correspondence 
of the unique stabilizing solution P.,( ) ol the DPRE. 

Proof. Consider the state-.space rcrcsentation (15) ol the 
PARMA. In view of Theorem 3, letting /^(■) — R( )-t- 
N( )P( )//( )' and L( ) = fC(-), system (9) provides an IL 
representation of the cyclostationary proce.ss v( ) Note, that, 
since S(f) ^ P(t), from (7a), (Raj it follows that A"(/)- 
C/(0 + 1 (0* Recall that (J) is the input-output repre¬ 
sentation of the triple (T( ), G( ), //(■)) given in (16): by 
comparing the triples (F( ). G'l ), /-/(■)) and (F( ). G( )+ 
r( ), A/( )), it is ca.sy to sec that (20) is in fact the 
input-output representation of the IL system (9). 

Finally, the zeros of the PARMA (20) coincide with the 
characteristic multipliers of A{ ) If ) - K{-)H{ ) ~ A{ ) - 
r'(•)//(■). Therefore, the canonical minimum-phase repre¬ 
sentation is obtained in correspondence with the stabilizing 
solution Ps( ). Since (/!(•), //(•)) is completely observable 
and (A( ), //(*)) completely unreachable, I’heorem 1.3 
implies that such a stabilizing solution exists iff A() has no 
unit-modulu.s characteri.stic multiplier, 

6. PARMA models, optimal prediction 
This section is devoted to the solution of the optimal 
one-step-ahead prediction problem lor PARMA models. 
Our approach involves the application of Kalman prediction 
theory to the state-space representation (15) in order to 
derive the optimal predictor for the PARMA (i). 

It IS well known that the optimal periodic predictors 
associated with (15) can be expre.sscd as follows 

x{t + l/r) - P{t)x(t/t - I) -^ K{t)\ v(0 -y{t/t -- 1)] 
^Ait)x{t/i-\)^\ft)[y{t) 

-y(t/t~\)]^Git)y{t), (21a) 

y(tlt-\)=-H{t)xitlt-\), (21b) 

where r(/)[A(0] conforms to (8c)((Rb)), in which P( ) is an 
SPPS solution ol the DPRE (19). 

We arc now in a position to solve the optimal prediction 
problem in the minimum-phase case (Theorem 6) as well as 
in the nonminimum-pha.se one (Theorem 7). 

Theorem 6. Consider the PARMA given in (1) and assume 
that it is minimum-phase. Then, the optimal periodic 
steady-state predictor i.s 

fi 

>■((/( - 1) = - ^ f,(( - i)yU- ill - I - 1) 

/ - 1 

+ X|c,('-') + fl,('-')l>'('-0- (22) 

f - I 

Moreover, such a predictor is stable. 

Proof. The minimum-phase a.ssumption entails that A( ) is 
asymptotically stable and ihcrelore system (17) is stabilizable 
and detectable. Consequently, the unique periodic solution 


of the DPRE (Theorem 1.2) is the trivial one. F(0 = Oj Vr, 
which is also .stabilizing. It is a matter of simple computations 
to derive (22) from (21). The stability of (22) follows directly 
from the stability of A( ). As a matter of fact, the zeros of a 
minimum-phase PARMA become the poles of the associated 
optimal predictor. ■ 

If the minimum-phase condition is not met with, A( ) is 
not stable and the uniqueness of the SPPS solution of the 
DPRE (19) is no more guaranteed. Depending on the initial 
covariance of the .state of system (17), there are, in general, 
.several optimal periodic predictors, each of which cor¬ 
responds to an SPPS solution of the DPRE. Among all the 
predictors it is then wise to choose the stabilizing one, if any. 
This motivates the following theorem. 

Theorem 1. Consider a PARMA system with no unit- 
modulus zero, Then, the stable optimal predictor is given by 

y(r/f -\) - - ^ eft - i)y(t - i/t - f - 1) 

I I 

/I 

+ zL|e.('-') + a,(f-01>'(0. (23) 

1.. I 

where c,(/) = r,(0 + ^ i(0. Vf^) being the / th entry of the 

Kalman gain [ (r) associated with the stabilizing solution 
Pf ) of the DPRE (19). 

Proof. By a.ssumption /4( ) has no unit-modulus characteris¬ 
tic multiplier. Moreover, wc know that the pair (v4( ), //(■)) 
is observable. Wc can then apply I'hcorcm 1.3 to conclude 
that the DPRE (19) admits a stabilizing solution, which is 
positive semidefinile. Consider now the corresj^nding 
optimal periodic predictor (21). The PARMA (23) is in fact 
the input-output representation ol such a predictor. The 
stability of (23) stems from the stability of the closed loop 
matrix A( ) = A( )-r( )//( ). ■ 

From Theorem 5, 6 and 7, it is apparent that solving the 
prediction problem for a stable nonminimum-pha.se PARMA 
is equivalent to finding a minimum-pha.se PARMA 
realization ol the as.socialed cyclostationary process >(■) and 
then applying (22) to such a “canonicar’ PARMA. 

7. Conclusions 

In this paper, the factorization pn.)blem for cyclo.srationary 
procc.sse.s and the optimal prediction problem for ARMA 
models with periodic cocHicients (PARMA models) have 
been studied. It has been shown that there i.s a onc-to-nne 
correspondence between periodic factori/.alions and periodic 
solutions ol a difference periodic Riccati equation. On the 
basis ol such a result, the prediction problem for PARMA 
inodeKs has been given a solution that can be outlined as 
follows 

(a) Check the minimum-phase condition by verifying 
whether the zeros ol the PARMA system arc stable, i.e. 
whether the multipliers of A( ) given in (18) belong to 
the open unit disk. 

(b) If m the affirmative, the optimal one-slcp-ahead 
predictor is given by (22) 

(c) Othcrwi.se, if some zeros of the PARMA lie on the unit 
circle, there is no stabilizing optimal periodic predictor. 

(d) If no zero lies on the unit circle, compute the stabilizing 
.solution Pf) of the periodic Riccati equation (19) and 
the corresponding Kalman gain r( ). Then, the optimal 
predictor is given by (23) 

Note that, if the original PARMA is stable, this last step 
can be interpreted as applying the standard prediction rule 
(22) lo its canonical minimum-phase representation. 
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Abstract —We address discrete-time control systems contain¬ 
ing a memoryless nonlinear element and characterize the 
error response to certain inputs when the circle condition for 
stability is met. 


where the operator L is defined by 

t 

(Lx),= "Z S, r’f,. 

t - I) 


1. Introduction 

Thf conclpt of steady-state errors often plays an important 
role in the design of control systems (see, for example, Kuo 
(1991)). For the discrete-lime case, altcrilion is frequently 
focused on the linear unity-feedback system of Fig. 1, and on 
Its error response e to certain .standard inputs r such as step 
functions, ramps, etc. For such systems, it is a classical result 
that the number of poles of G at the ^x)int (1 +/()), often 
referred to as the "system type", determines whether the 
system error is unbounded, approaches a nonzero constant, 
or approLches zero. 

2. Results for nonlinear systems 

In this paper we report on an extension of the theory for 
linear systems described above. The extension concerns 
control systems of the form shown in Fig 2, where N takes 
into account a memoryless nonlinear element of the sector 
type. We show that much more general results along the 
lines of those for the classical linear case hold. This involves 
a hypothesis analogous to the circle condition (Sandberg, 
1964) for the stability of conimuou-s time systems. Our 
results arc along the lines of those in Sandberg and John.son 
(1993) where only continuous-time systems arc treated. 
Throughout the paper when we say that a limit exists, we 
mean that it exists as a real number. 

For the systems we consider, an additional degree ol 
complexity arises concerning the evaluation of the limit ol 
the steady-slate response when it exists, because the systems 
are nonlinear. We give a convcigeni algorithm lor computing 
the limit whenever it exists. 

It will become clear that our inputs are more general than 
merely steps, ramps, etc. We consider more general inputs to 
establish that under our conditions the system response 
possesses a degree of robustness with respect to inputs. 
Additional comments arc given in Section 3.1. 

2.1. The nonlinear system and assumptions. Let 'Jy be the 
set of nonnegative integers, and let denote the set of 
real-valued functions defined on . 

Referring to Fig. 2, it is assumed throughout that r, e, n, 
and y belong to ,5, and that y and are related by 


for X E S. Here ^ e S is the inverse 2 -lransform of 


0(z) 


(z - \Y‘q{.z) ' 


where p f is the number of poles of G at z - 1 -I- /() and p 
and (z - arc relatively prime real polynomials such that 
deg (p) ^ deg ((z - 1)^V/). As usual, (z - 1)" means 1. The 
function deS takes into account initial conditions, and it is 
assumed that d is the inverse z-transform of 


0(z) —, 

{z-\Yq{z)' 


where m is a real pnilynomial such that D is proper. With 
regard to the nonlinear pmrtion of the system, wc suppose 
that (Nx), = t for x where p maps ^ into 
such that 7/(0) = 0 and there are two real constants a and ft 
such that 


a- - 

a — b 

for a ^ h. Define c,, e by 

and assume throughout that 

rhe system under consideration is governed by 

r,-d, =-e-, + 2 Sr-r^krl. (0 


We assume that the following three conditions arc met, 
and will refer to this assumption as B.l: 


(') • +'■iiSii‘I. 

(ii) 1 c,|(7(z) ^ 0 for |zI 1, and 

(iii) J(^-Qr) sup |//(z)|^'l where H denotes the rational 

|z| - 1 

function defined by 


H{z)^ 


1 ^ Ci^G{z) ’ 


y,^{Lv), + d,, 


* Received 28 August 1991; revised 23 April 1992; 
received in final form 14 May 1992. The original version of 
this paper was not presented at any IFAC meeting. This 
paper was recommended for publication in revised form by 
Associate Editor R. V. Patel under the direction of Editor 
H. Kwakernaak. 

t Department of Electrical and Computer Engineering, 
University of Texas at Austin, Austin, TX 78712, U.S.A. 


at all points z at which G is regular. (By (i) and (ii) this 
function is proper and has no poles on or outside the 
unit circle. We will use the fact that H is the z-transform 
of an ^i(^+) function h.t) 

Hypothesis B.l is analogous to the key condition A,1 of 
Sandberg and Johnson (1990) which is stated in circle- 
criterion form; B.l too can be expressed in that form. 


t Of course, e .'i: 5] 1 .p„1 < 
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G(z) 


Fig. 1. Linear control system. 





N 

V 

G(i) 

y 

J ^ 










Fig. 2. Nonlinear control system. 


If gu = 0, e, in (1) is defined explicitly for each r. and so it 
is clear that for each r e S there is a unique e e S such that (1) 
is satisfied. It can be shown that this existence and 
uniqueness holds even if go 0. * 

Define fj: 9?—» 9? by rj{a) = T}(a) — a g 9?. The follow¬ 
ing proposition plays a central role in the statement of 
Theorem 1 our main result which apf>cars below. 


3. Proof of Theorem 1 

Assume that the hypotheses of the theorem are met, and 
let / denote the identity map on .V. We will use the following 
lemma. 

Lemma 1. The inverse of (/ + Ci^L) :S-*S exists and has the 
representation 

f 

[(/ + C|,/,) 'j:], = -q, 2 tX-,, reatf. 

T-O 

for X 6 S (where as indicated earlier h is the inverse 
z-transform of H). 

Proof of Lemma 1 

It is clear that g and h are related by 

t 

Si-\ = q)Ss. (2) 

T-n 

Using (2) one finds that y e 5 defined by 

I 

y,^x,-c„ 2) h, (3) 

T-n 

satisfies 

I 

v. + foS K. liJ',. (4) 

T-n 


Proposition 1. Suppose that the assumptions in Section 2.1 
are met. Then for each y g there exists a unique ^ g .9ft 
such that 

r = | + w(i)fj(5), 

and one has 

?= lim ?*. 

if—« 

where f* ♦ i = V “ ^nd is arbitrary. Also, 

c=i(/3-a)|//(l)|<l and 

If - if*i I?.. - y + «a)»j(5n)i. * 21. 

This follows from the contraction-mapping fixed-point 
theorem and its proof (Kolmogorov and Fomin, 19.57) using 
the inequality (see B.l) [(fi - a) |//(1)| < 1 Referring to the 
proposition, let 0 ; 9?—be the map that takes y into $• 
Our result, which concerns the relationship between the 
nature of c and the system type, is the following. 

Theorem t. Assuming that the assumptions described in 
Section 2.1 are satisfied: 


for jr G^. On the other hand, using (2), wc find that if v e .V 
satisfies (4) for some jr g .S’, then y is given by (3) Thus 
(7 -f- c„L) IS one-to-one and onto, which completes the proof 
By Lemma 1. (I) is equivalent to 

(l + c„l.) '(r-d) = f+ (/ + r„/.) 'LNe, (5) 

for r e 5, where (/Ve), = fj\e,\ for a\\ i e , and 

t - I) 

for X G 5. 

The z-transform of b, = (/+ c,,/.) 'd, which appears on 
the left side of (5), exists, is proper, and is 


(z ~ 

By B.l, (6) has no poles on or outside the unit circle So, 
►O as /—►'». There lore 

I 

-I-[(/+r„L) = + ,r;(£\), tF "/,. (7) 

r (I 

for r G 5, where (^, is as described 


(i) If r approaches a limit I as t—>^, then lim e, exists. 
Moreover, a 0 if and only if / ^ 0 and p = 0, and then 

a = e{/[l + r„C(l)| 


(ii) Assuming that r - £ a/ for t e , where the a, are 

;^rj ^ 


real, v is a positive integer, and =^0, 

(a) e IS unfounded if v > p, and 

(b) if vs=p, then e approaches a limit as r—This 
limit is 0(fl^v! q(l)/c„p(l)] if v-p and zero 
otherwise, t 


• In this connection, gn = 0 in the important case in which 
G(z) represents a .sampled system consisting of a zero-order 
hold followed by a strictly-proper rational transfer function 
Q. This follows from gf, = lim G(z), the initial-value 

|.C|—CO 


theorem for Laplace transforms, and the standard relation 
C(2) = (l-Z where Z(.t 'e(r)} denotes 

the z-lransform of the inverse Laplace transform of i' 'Q(s). 

t Referring to the argument of 0 in (ii) of Theorem 1, for 
pedagogical reasons some readers may wish to observe that 
<^u[7^(l)/^(l)l 's simply the product of the “average gain” 
2 (a + ^) of N times the coefficient of the (z -- 1) ^ term in 
the partial-fraction expansion of G. 


Proof of Part (i) 

We need two lemmas. 

Lemma 2. If r,f e 9ft a.s 1-*^, then [(/-ft„L) V], 
approaches 

/(i-coi/.,). 

^ T-(l ' 

This follows from Lemma 1 and a version ol the linal-valuc 
theorem for z-transforms. 

Lemma 3. Let yG.9ft, and let 6eS be such that »>0 as 
r—►oc If 


■5, + r-q+X ^ W 

T-O 

then e,—►0(y) as / — 

Proof of Lemma 3 l^t ^ = 0(y). Then 

T-0 

and so 

y- i M(?)=£+X*, .m). W 

1 T -“-0 
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Since the difference of the left sides of (9) and (8) 
approaches zero as and B.l holds, by Theorem 2' of 

Sandberg (1965), one has e,—►0(y) as as claimed. 

We shall use Proposition 1 and the observation that 
^(0) = 0 to prove the remainder of Part (i). 

Clearly I = 0 implies a-i). U p>0, p{\)¥=Q and 

u(i)^ _ Bill __i 

(z - \yq(z) + C(^p{z) c,/ 

So, then \ ~ CoH{\) = 0, yielding 0 = 0. Therefore, 
implies I # 0 and p = 0. 

Now suppose l^i) and p = 0. Then I - L(,//(1) = [ 1+ 

' and/[I + C(,G(1)] ^ 0, showing that a#0. 

Proof of Part (ii). We begin with two definitions: let w, e 5' 
be defined by ’ 

I 

-‘n X XT'. ie%. 

T--() 

for ; = 1, 2, . . . , V. For every positive integer k, let x ^ r* 

mean that x e S and there is a nonzero constant p such that 

xjt'^-*p as /-♦ 00 . 

We will u.se the following lemma. 

Lemma 4. 

(a) j> 

(b) q(\)lc„p(\) as / —», and 

(c) / <" 0 as f —► 00 . 

Proof of Lemma 4. Choose j, and let W, denote the 
z-transform of w^. which clearly exists. One has 

Wfz)^Z[n\\-c,H(z)], 

where Z{t^), the z-transform ol is of the form 

7J(z)/(z - ly '' With Tj{z) a polynomial m z ol degree ; with 

no zeros at z ^ I. Thus, we have 




_ T/z)q(z) 

•iy*'-"|(z-ljV2) + r„p(^)r 


Since p and (z - \)^*q arc relatively prime, by B.l, we have 
(z - ly'qiz) -f Cj,p(z) for |z| P.- 1. 


Our results are, of course, not restricted to the 
configuration of Fig. 2. As is well known, more general 
systems are governed by an equation of the form (1). A 
related paper is Miller e( al. (1989) which considers the 
effects of quantizers on the error performance of control 
systems. 

Cases in which a linear subsystem /C(zl precedes N are of 
interest. Two results for K(z) ~ (z - I) , along the general 
lines of the proof of Theorem 1, are given in the next 
section. In addition, using a discrete-time analog of Wiener’s 
tauberian theorem it is shown that, under a natural extension 
of the assumptions in Section 2.1, it is not true that the 
system error e approaches a limit whenever e when r) - 0 
approaches a limit. This shows that certain specific 
cxten.sions of the proof of I'hcorem 1 can be carried out, and 
that others cannot. 

4. Systems with an integrator preceding N 

In this seciion we consider the system of Fig. 3, in which 
the blocks labeled N and G{z) are as described in Section 
2.1, and r, e, x, and y are members of .S' with r the inverse 
z-transform of a proper rational function. The block labeled 
(z - 1) ' represents an integrator. It introduces the relation 

r I 

J(, = -tn+ E ‘■f 2, . . .)• (11) 

r- (I 

Let B.2 stand for the condition that B.l is met with G(z) 
replaced with (z - 1) 'fj(z) Under B.2 it is stiaightforward 
to show that x in Fig. 3 satisfies an equation ol the form 

( 12 ) 

I = 0 

where w and u are inverse z-tran.sforms of proper rational 
functions, that w t and that we have 

- fi) sup |VT(z)|' 1, 

where W is the z-transform ol w. Assume that B.2 is 
satisfied. 

I'he set S contains a unique solution x of (12). By (11), one 
has 


Thus, part (a) follows Irom the fact that Wj has > + l-p 
poles at z ~ 1, and part (c) from the fact that has no poles 
at z - 1 when j - - p. If / = p, then has a simple pole at 
z = 1 and the associated residue is 7;(l)r/(l)/c„p(l) Since 
7^(1) - j\, we have also proved part (b). 

Continuing with the proof of the theorem, let r be as 
de.scribed in part (ii). Referring to the left side of (7), 

(/4 £■„/.) (10) 

I <' 

II V > p, 

V P V 

E “/**) ^ E + E 

/ 0 / -x. 0 / — 4- 1 

in which, by (a), (b), and (c) of Lemma 4, the first sum on 
the right side is bounded and 

E 

/-e +1 

Since h e /|(^< ). e bounded implies that the right side of (7) 
is uniformly bounded in t. Therefore, e must be unbounded if 

V > p. 

When v<p, the right side of (10) approaches zero as 
/^Qo, by (c) of Lemma 4. Thu.s, using Lemma 3 and 
Proposition 1, e,—> 0 as r—► 

Finally, using (b) and (c) of Lemma 4, if v = p, the right 
side of (10) approaches li^v! c/(l)/c„p(l) and, by Lemma 3, 
e, approaches the limit stated in the theorem. This completes 
the proof of the theorem. 

3.1. Comments. Referring to Part (ii) of Theorem 1, in 
classical linear theory attention is focused only on the case in 
which V e {1, 2) (and r is a monomial). We have considered 

V 6 fl, 2, . . .) to illustrate the general pattern that results. 


I 1 “ t E 7 ^ , 

and using (12), 

I 

, 1 - M, = (-<,11 -“ ».) + E xl'l(.*l .l)- 

t II 

4 , ,7)(.»„), It 7,. 

Hence, we have 

I 

E .l^(-f., i)-^(4,)l 4 K',, (e;/,, 

T-O 

(13) 

where e^„ =-u,, ^ - u„ r e ^ . By - fi{h)\ ^ \{ft- 

or) |fl -- /)( for real a and h, 

'"((jr,, i) - = </',(•»,, I - Jt,). 1 e , 

with \</>,\ -- i(^ - ix), and usinp (13), one has 

<’i„ + J, ^ + E 'f 'lr, , (14) 

I* II 

where d eS is given by d, = - w, ^ i^(-ri)) hir each t. 

Below results arc given related to (hose of Section 2. 
Notice that in (13) in the system error when the nonlinear 



Fkj. 3. Nonlinear control system. 
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part ff of f) vanishes * We show in Theorem 2 below that 
(i.e. Pfj,* >0 as if and only if e-^0. Theorem 3 
asserts that e is bounded if and only if is bounded. 
Observe that these results are along the lines of the proofs in 
Section 2. On the other hand, in mntrast with the results in 
Section 2, Theorem 4 shows that it is not true that e„ 
approaches a nonzero Limit implies that c approaches a limit. 

Theorem 2. Under the condition indicated, ►() if and only 
if 

Proof of Theorem 2, If Lbcn (Pj, 4 t/)—>0 and, by (14) 

and Tlieorem 2' of Sandberg (l%5), p->(). 

Conversely, let p-^0 and consider (14). Since w e/,(J,). 

( 

e-^0, and qp is bounded, it follows that w, —►O. 

Thus (using 0) d- 

Theorem 3. Under the conditions indicated is bounded if 
and only if e is bounded. 

Proof of Theorem 3. Consider (14). Since we/,(5,) and 
W(z) is rational, we have t"w e /|(3^0 (notation of Sandberg 
(1965)). In addition, d is bounded and as indicated above 
^ O') all t. Thus, p„ bounded implies that e is 
bounded, by Theorem 2' of Sandberg (1965). 

On the other hand, suppose that is unbounded. Then by 
(14) the hypothesis that e is bounded leads to the 
contradiction that e,, is bounded. Therefore c is unbounded. 
This completes the proof 

Comment. With regard to explicit corresponding results in 
the context of Section 2, we see that if c„ denotes the left 
side of (7), then e in (7) is bounded if and only if e^^ is 
bounded, and similarly e in (7) approaches a limit if and only 
if approaches a limit. + 

Returning now to the system ot Fig. 3, we prove the 
following 

Theorem 4. Under the conditions indicated, the existence ot 
lim docs not imply that lim e, exists. 

Proof of Theorem 4. We use Lemma 5, below. 

Lemma 5. A.ssume that the conditions indicated are met, 
that p(e for e | -.tt, .7r)(p(2) is the numerator of 

C(z)), and that the limit ot i) " f/(*/)l (^/1 r ' 

fails to exist fur any strictly monotone sequence jr„, jt,. 

in [¥l such that \x,^ ^ - j:,| y. t e .^J ^ , for some y fc 
Assume also that lim exists and is nonzero. T hen lim c, 
fails to exist. ' 

Proof of Lemma 5. Let the hypotheses hold and consider 
(14). Suppose (to obtain a contradiction) that /^lime, 
exists. Then I ^ (), by Theorem 2. ' 

/ 

By (14), lim 5] w, exists Since q) is bounded, and, 

r 0 

by Theorem 3, e is bounded, is uniformly bounded in t. 
In addition, the z-transform of w is y?(z)[(z'- l)^'^'qf(z) + 
^upi^)] ^ showing that W{e for By a 

general form Rudin (1962) of Wiener’s lauberian theorem, 

lim qy^p,.. (15) 

exists. 

* Notice that wc use p,, for two different purposes. Here it 
is a function: earlier il denotes p at / = 0. 

t Theorem 2 holds also in the selling of Section 2 (sec (7) 
and Theorem 2' of Sandberg (1965)). 


By the existence of (15) and / #0, q\ must have a limit at 
00 . But since /=^(), X is unbounded, eventually monotone, 
and there is a such that |x, +, - jr,| s |/| + 1 and 

- jr, ^ 0 for / ^ T, Since 

[^(jr,n)-‘ = ?’r + to. (16) 

the limit of the left side of (16) must exist. This contradiction 
completes the proof of the lemma. 

Now choose f/(z) ~ { and T}(a) - a fj{a), where fj(a) is 
odd symmetric, continuou.s, piecewise atfine and such that 

I as(oi2k-’\),o(2k)), Ae{L2, ...} 

-i, a^ioflk), a{2k + \)), keJ,, 

n 

where a(n) - H A:, n e In this case, 2 

A=() 

r}(b)](a~h) ‘ ^ J for a ^ 6, r„ - I, 1r „(2 - l)~'G(z) ^ 0 
for |z|^L and l+t„g„=^(), where is the value at 
/-O of the inverse z-tran.sform of (z-l) 'f;(z). Also, 
W(z) (i.e. (2-1) 'G(z)[l^( 2 - 1 ) 'G( 2 )] ')-M2z-l) 
sup |W( 2 )| = 1, and so \(P - a) sup |W(z)| < 1, which 

Izl^'l “ U’l 1 

shows that B.2 is met. Since rj and p here meet the 
conditions of Lemma 5, the proof of Theorem 4 is complete. 

5. Conclusions 

The concept of steady-slate errors often plays an important 
role in the design of control systems. Attention is frequently 
focused on the response of systems to certain standard inputs 
such as step functions, ramps, etc. For stable discrete-time 
linear systems, it is a classical result that the number of 
z-dornain poles of the loop gain al the point (1 -H/O), often 
referred to a.s the ‘ system type”, determines whether the 
system error is unbounded, approaches a nonzero constant, 
or approaches zero. 

In this paper, we address nonlinear control systems of the 
form shown in Fig. 2 and show that, when a hypothesis 
analagous to the circle condition for the stability of 
conimuous-time systems is met, corresponding proposilions 
hold concerning the relationship between the nature of the 
error response and the system type. A convergent algorithni 
is given for computing the limit of the slcady-.stale response 
whenever it exists. Results are also given concerning cases in 
which an integrator precedes A. 

Our approach can be used to obtain related propositions 
concerning the error response of systems with both inputs 
and disturbances. 
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On the Generic Controllability of Continuous 
Generalized State-space Systems* 
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Abstract —In this paper using a duality properly between 
regular and generalized state-space systems, the necessary 
and sufficient conditions for the generic controllability of 
continuous generalized stale-space systems arc established, 
the necessary and sufficient conditions for the generic 
observability of such systems arc also derived. 

1. Introduction 

This papier is devoted to the problem ol generic 
controllability of continuous generalized stale-space (g.s.s.) 
systems, i.e of systems described by Verghose et al (19KI). 
Lewis (1986) and Dai (1989), etc. 

£i(0 “ >D(0 + ^'^(0. (Lla) 

y(rN- Cx{t), (l.lb) 

where E, ziER"’"', /JeIH"*"' and C’eR'’"". The g.s.s 
system (i.l) for the special case whcie f ^ / is called regular 
and for the special case where dot /•' - 0 is called singular 
Systems of the form (1.1) cover a variety ol cases in many 
areas of science and technology such as electrical circuits 
(Newcomb, 1982) network theory (sStott, H)79), power 
systems (Campbell and Rose, 19H2), nuclear reactors (Reddy 
and Sannuti, 1975), robotics (Mills and Cioldenbcrg, 1989), 
aircraft systems (Stevens, 1984), economical systems 
(Luenherger and Arbel, 1977), neurological events (Zeeman, 
1976), catastrophic behavior (Sastry and Desoer, 1981), 
demography (L.csiic model) (C ampbell, 1^80), etc. 

Generic controllability, as well as all other problems which 
refer to generic system properties, refer to physical systems 
whose true values ol their parameters are characterized by 
small deviations from their nominal values. Hence, the 
parameters of the system may he either fixed zeros or 
indeterminate (arbitrary) entries. Thus, the coefficients of 
the system matrices may be considered either fixed al zero or 
mutually independent free parameters. A matrix in such a 
form is called a structured matrix (Wonham, 1974; Shields 
and Pearson, 1976). 

The form of a structured matrix is directly related to the 
physical topology of a system. In other words, even though 
there arc many equivalent rnathemaliCiil de.scriplions of a 
system, there must be one in which the elements of the 
system matrices arc in structured form Most of the rest ol 
the mathematical descriptions may have system matrices in 
which the free parameters are bound among themselves. 

For the case of g.s.s. systems the property of generic 
controllability has been studied only lor discrete, casual 
systems. For this type of systems the following form is argued 

* Received 31 May 1991; revised 28 June 1992; received in 
final form 23 August 1992. The oiiginal version of this paper 
was not presented at any IFAC meeting This paper was 
recommended for publication in revised form by Associate 
Editor V. KuCera under the direction of Editor H. 
Kwakernaak. 

t National Technical University ol Athens, Division of 
Computer Science, Department of Elcclrical Engineering, 
15773 Zographou, Athens, Greece. 


to be structured (Yamada and Luenherger, 1985) 

\ ^ k (L2) 

For the ease ol continuous g.s.s systems no results have as 
yet been reported. Ihis paper is devoted to establishing the 
first results on the subject and in particular the necessary and 
sufficient conditions for generic ccjnlrollubihty of continuous 
g.s s. .systems. I hcsc results arc analogous lo those reported 
in Shields and Pearson (1976) lor regular systems. Note that 
the causality property for the discrete system (1.2) lacihtated 
the solution of the problem in Yamada and I.uenbcrger 
(1985, 1986) For the continuous case, the analogous 

assumption can not he made. For this reason, the generic 
controllability problem for the continuous case is inherently 
more difficult lo solve than that for the discrete case 

The proposed structured dcsenpiion lot g s s. systems is ol 
the following lorm 

r(0-F-3(0 4 /!»(/)-l riw(0. v(0- C‘x(/). (1.3) 

Clearly, (J.3) is mathemaliealy equivalent lo (1.1). Note that 
description (1.3) is often u.scd lor describing many practical 
.syslems (large scale production (Luenherger and Arbel, 
1977; Campbell, 1980)). 

10 study the generic controllability of g.s s syslems, a 
duality property is proposed relating regular and g.s.s. 
systems. 4’his duality property is based on a transformation 
which gives an alternate description of the g.s.s, syslems. 
This is an i.soniorphism between regular and g.s.s. systems. 
This isomorphism makes it possible lo take advantage of the 
knowledge wc have tor regular sy.stcms, to study the generic 
controllability of g.s s systems Based on our apprtiach for 
the study ol the generic controllability of g.s.s. wc readily 
establish the necessary and sufficient conditions for the 
generic observability o( g.s.s. systems. 

11 is menlioncd that the results ol the present papci is part 
of the material reported in Koumboulis (1991) 

2. Duality property 

The g.s.s. system (1,1) in the frequency domain is 

sEXU)^ AX{,s) + £T(U- ) -F BU{s), Y{s) =- rX{s). (2.1) 

The system is assumed solvable, i.e it is assumed that 
del [.rE - A] ^ 0. For this case, let p be a real number such 
that del + A| ^ 0. Such a real number always exi.sts and 
may take any value except the real roots {p ^ X.) ol 
del |(“A)E — A] “ 0. The number of these roots is less than 
or equal to n. If wc add lo both sides of (2.1a) the 
quantity pEX{s) and after prcmultiplying the resulting 
relation by {pF T A) ' then (2,1) becomes 

(.V +/i)/:A'(3) - A"(,s) -F £>(()-) + BU{s), T(,v) - LA(.v), 

( 2 . 2 ) 

where 

E = {pEEA) 'E, B^ipEYA) C - C. 
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Next, consider the regular system 


to hold Vj e C - {/i}. The matrix given by 


or equivalently in the Laplace domain w 



1 

s 


( 2 . 11 ) 


wZ(w) = EZ(w) -I- r(O-) -I- BV(w), 'PCiv) = CZ{w), (2.3) 

where Z(w), V(w) and 4 ^(h’) are the Laplace transforms of 
z(t), u(f) and i/^(r), respectively. The solution of system (2.2) 
for ^(.r) and y(^) is 

X{s) = l(s + n)E - /] '£x(0-) + [(s + /i)£ - /] ‘£t/(5), 

(2.4a) 

VCO = c|(j + ^)£: -/] '£;((0-) + r[(j + M)E-/| 'BU(s), 

(2.4b) 

and the solution of system (2.3) for Z(w) and ^(w) is 

Z(w)-=(w/-E) 'z(0-)-»-(w/-£) '£V(w), (2.5a) 

V(^v) = C(w/-£)'’z( 0 -)-^ C(w/-£) ^BV(w). ( 2 . 5 b) 

It is remarked that the two systems (2.2) and (2.3) involve 
three matrices, namely, the matrices E, B and C which are 
identical for both systems. Due to this characteristic, these 
two systems are strongly interrelated. If wc let 

w — —5-. r = —-ii, V(iv) L/(—~ li) , 
j 4 u w \w / 

2(0-) = £>(()-), (2.6) 

then, according to (2.4) and (2.5), we derive the following 
relations 

X{s)^-wZ(w), Z{w) =-{sfA)X{s), (2.7a) 

y( 5 ) - -w'V(w), M'(w) = -is 4 ^i)Yis). (2.7h) 

Also, the transfer function matrices //^(a) and H^s) of 
systems (2.2) and (2.3), respectively, are related as follows 

Hgis) - -wH,iw), H^(w) = -(s 4 n)Mgis). (2.H) 

From (2.7) and (2.8) it is observed that, for every g.s.s. 
system ( 2 . 1 ), which can always be written in the form ( 2 . 2 ), 
there exists a regular system of the form (2.3) having the 
characteristic in that it behaves similarly to the g.s.s. system 
( 2 . 2 ) after a frequency reversion. TTicse observations reveal 
that there exists a strong interrelationship between systems 
(2.2) and (2.3). This interrelationship is called here duality 
property and the two systems arc called dual. It is noted that 
description ( 2 . 2 ) has the advantage over that reported in 
Zhou et al. (1987), Pandolh (1980) and Chrislodoulou and 
Paraskevopoulos (1985) in that for its derivation no stale 
vector transformations arc required. Finally, it i.s remarked 
that description (1.3J is a special form of description (2.1) 
(let E = E, A - I - A, B ~ ~B and C = C). 

Before closing this section, the equivalence between the 
controllability of the g.s.s. system ( 2 . 2 ) and its dual regular 
system (2.3) will be proven using the duality property 
presented above. This result will be useful for the study of 
the generic controllability that follows. To this end, we refer 
to the results reported in Cobb (1984), where it is shown that 
the g.s.s. system ( 2 . 1 ) is controllable if and only if 

Im (A£ - A) 4 Im £ R", VAeC and Im £ 4 Im ^ R”. 

The above conditions may obviously be written as 

rank [A£ - A ; B] = n, VAel and rank[£;fl]-n. (2.9) 

If we let j = A - /i and upon premultiplying the matrices in 
the rank operators in (2.9) by the matrix (nE ^ A) \ 
conditions (2.9) may be expressed equivalently by the 
following two conditions 

rank [(a 4 ^)£ - I [ B] = n, Va e C, (2.10a) 
rank [£ i £] = fi. ( 2 . 10 b) 

Since, for a -/i, condition (2.10a) is an identity, it suffices 


is well defined for every s^-fi. If wc postmultiply the 
matrix in the rank operator in ( 2 . 10 a) by ^^^.(h/), relation 
( 2 . 10 a) may be written as 

rank [w/-£;£) = Al, Vw 6 C-{ 0 }, w= (2.12) 

Combining relations (2.10b) and (2.12) wc conclude that 
the criterion for the controllability of system ( 2 . 1 ) is 

rank [w/- £ I £) = n, Vw e C. (2.13) 

Relation (2.13) is also the controllability criterion for the 
dual regular system (2.3) (Kailath, 1980). We have thus 
proven that system ( 2 . 1 ) (or equivalently ( 2 . 2 )) is 
controllable if and only if its dual regular system (2.3) is 
controllable, i.c. if and only if 

rank[B : £R| - !£" ^B\^n. (2.14) 

Using the duality property we may also prove (he 
equivalence between the observability (Cobb, 1984), of the 
g.s.s. system (2.1) and its dual regular system (2.3), i.c. the 
equivalence between the observability of the g.s.s. system 
( 2 . 1 ) and the criterion 

rank [C^; £^C^ i - ;(£^)" 'C^l - n. (2. I.S) 

It is mentioned that criteria (2,14) and (2.15) have also been 
derived in Shayman and Zhou 1987) using a different 
approach. 

3. Generic controllability 

This section is devoted to deriving a generic criterion 
(Wonham, 1974) for the controllability of g.s.s. sy.slcms, 
under the assumption that system (1.3) is structured. 

Definition 3.1. The g.s^s. system (1.3) is called structured il 
its system matrix [E \ A ' B ’ C] is a structured matrix. Note 
that a matrix is structured if its elements arc cither fixed al 
zero or mutually independent free parameters. 

Lei N be the number of the free parameters that appear in 
the matrix [£ ; A ; £ ; C] and let 5 denote the yV-dimensional 
vector of these parameters. Partition ^ as follows 

t'-l 0 : r : ;r ; rj], 4 N, 4/V, 4 =/V, (3.1) 

N, N. ^4 

where 0 , r, tt and o are the free independent vectors of the 
parameters appearing in E, A, B and C, respectively, i.e. 
£ - £(0), A = Air), B - Bin) and C = Cio). 

In order to facilitate the generic manipulation of (1.3) we 
present three definitions, one proposition (Proposition 3.1) 
and prove one lemma (Lemma 3.1). 

Definition 3.2 (Wonham, 1974). A variety l^crR" is defined 
to be the locus of common zeros of a finite number of 
polynomials ^ 1 . ^2 • - 

= .= -- 1.2. k), (3.2) 

V is proper if V R^. 

Definition 3.3 (Shields and Pearson, 1976). We will say that 
a property FI, (11(5) :R^-^ 1)) 'i* generic if there cxi.st a 

proper variety V such that; If 

n(5) = 0. then 56 K (3.3) 

Definition 3.4 (Yamada and Luenberger, 1985). An mxn 
matrix A/( 5 ) is said to be a column structured matrix 
(C.S.M) if: 

(i) each clement of A/(5) is either fixed at zero, or a rational 
function of the free parameters 5 = (li. . Sa/) ^ ‘‘nd; 
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(ii) there exists a set of rational functions , .): 
X R i = ]^ , _ ^ n, such that, for i = 1, 2, . . . , n. 

Af(g)diag(l, . . . , 1, fl, 1, . . . , 1) 

= M{q,{la)) and ^^(5,1)=^. (3.4) 

Proposition 3.1 (Yamada and Luenbergcr, 1985). A 
structured matrix is always a C S.M. 


functions of r]. Hence, the first of the conditions, 
appearing in Definition 3.4, is satisfied. 

in order to investigate if the matrix E* satisfies the .second 
condition of Definition 3.4, we .start by noting that since 
£(0) is a structured matrix, then according to Proposition 
3.1 it is also a C.S.M. Hence, there exists u set of rational 

functions fl)(^,(. , .) x RR^'), /-I . n 

such that 


Lemma 3.1 The matrix / - A is generically non.singular. 

Proof. Since the matrix A has been a.ssumed structured, it is 
obvious according to Definition 3.2 that it is al.so a C.S.M 
Hence, the matrix A has v(v4) generically nonzero 
eigenvalues A,, ^ 2 . , A^^^^ which are mutually distinct 

(Yamada and Luenbergcr, 1985). Here v(A) denotes the 
generic order of A, and it is defined by 

v(/i) = max|A 2 del (y4|i,, i,.(S-")) 

where >![*,, 12 * . . . , i*] is the submatrix of A consisting of 
the columns and rows i|, 12 * » ^nd the summations 

range over all possible combinations of integers satisfying 
1 < i, < L < ‘ <ih Hence we have 

del [/ - 4] = (1 - A 1 ~ A-,) ■ (1 - A, (3.6) 

Define 


qir)^6cill - A(r)\. 

Clearly, since A(r) has been assumed structured, the 
function q{r) is a polynomial m the elements of r. Thus the 
values of r, for which i/(r) = 0, constitute a variety V in R^^ 
'This variety is proper (V' ^ R^*), since for r = 0 we have that 
/4(r)=^0 and hence from (3.6) we have 1. Define the 

property 


nM(r)) [i’’ 


If ,,(r) = () 
it </(r)#0 


£(0) diag (1, . . . , 1. fl, 1_1) = E(q,(4>. a)). (3.9a) 

q,(0. 1)=0. (3.9b) 

Defining the rational (unctions 

9.*(10.'■!.«) = [9,(0. 

for i - 1, .... n and using (3.K)-(3.10), one may derive that 

£•([0, r])diag(l.l.fl. 1.1) 

= Z) '('•)£(0)diag(l,. . . , l,fl. 1,_1) 

= D '(r)£(4,(0, fl)) 

= £'*([4,(0. o). '■]) 

= E*{q:(\<t>.r].a)). (3.11) 


where a is in the ith position. Finally, it is remarked that 
from the definitions of q* appearing in (3 10), and using 
(3.9b), we have that q*(\(p, r\, 1) - [q^ip, 1), r] - \<p, r]. 
Thus, Lemma 3.2 has been proven with regard to E*. The 
proof for B* is directly analogous to that for E* ■ 


Further, we present the following two definitions (Shields 
and Pearson, 1976). 


Definition 3.5. The g-rank( ) denotes the generic rank of a 
matrix and we say that the generic rank of [£ j 6] equals to n 
if the properly 


11(|£(0); fl(.'r)l)= 


if rank IE((p) ; B{7i)\ < n, 
if rank \E((I>) ; B(^)) “ n, 


(3.12) 


From this property it follows that n(/i(r)) = (), Vr e V and 
ri(/i(r)) = L Thus |/-A(r)|#0 Vr; r e V'\ From 

this last result, together with Definition 3.3, wc conclude that 
the matrix I ~ A(r) is generically nonsingular. ■ 

Lemma 3.1 is important since it allows the manipulation of 
system (1.3) in the generic sen.se. Indeed, using Lemma 3.1, 
system (1.3) is expressed equivalently as 

+ y = C\. £*-/)'£. 

'B, D^l-A (3.7) 

In the sequel, instead of .studying the original matrices E, A 
and B of system (1.3), wc will he studying the matrices E* 
and B* of system (3.7). The matrices E* and B* have the 
property that they are C.S.M., a fact which will be useful in 
proving the necessary and sufficient condition.s for generic 
controllability. This property is proven in the following 
lemma. 


Lemma 3.2. The matrices E* and B* are C.S.M. 

Proof According to definition (3 1) and relation (3.7) we 
may write 

£* = f:*([0,rl) = D '(r)£(0). (3,B) 

From the definition D = l-A, we readily ob.scrvc that the 
elements of the matrix D(r) arc structural zeros or 
polynomials of r. Thus, the elements of D ‘(r) structural 
zerOvS or ratios having in their numerators and denominators 
polynomials of r Using this last observation, together with 
the assumption that the matrix E(<p) is a structured matrix 
(Definition 3.1), we readily conclude that the element.s o( the 
matrix £*([^, r]) = D \r)E(<p) are .structural zeros or sums 
of products involving rational functions of elements of r and 
0. Thus, the elements of £* arc structural zeros or rational 


is generic. 


Definition 3.6. An n x (n -f m) matrix [£* ; B*\ is reducible 
if for .some permutation matrix P the following holds 


P 

r-S 




(3.13) 

It is irreducible if it is not reducible. 

Using all above results, wc next establish the following 
theorem which constitutes the main contribution of the 
present paper. 



Theorem 3.1. The structured g.s.s. system (1.3) is generically 
controllable if and only if 

(i) g-rank [£ ; B) =/i(full), (3.14a) 

(ii) [D ’£;D is irreducible. (3.14b) 

Proof. As it has been indicated in Section 2 a g.s.s. system is 
controllable if and only if its dual regular system is 
controllable. Thus, system (3.7) (or equivalently .system 
(1.3)) is controllable if and only if its dual regular system 


z{t)^ E*z(t)^ B*v(t), (3.15) 


is controllable. Thercfoie, investigating the generic control¬ 
lability of system (1.3) is equivalent to investigating the 
generic rank of the matrix |B* ■ £*B* 1 . .. [ (£*)” '£*]. To 
inve.stigate the generic controllability of (3.15), wc start by 
noting that in Lemma 3.2 it was proven that Ihe matrices E* 
and B* of the regular system (3 15) are C.S.M. Applying the 
results in Yamada and Luenberger (1985) for a regular 
system with C.S.M. system matrices to system (3.15), we 
have that the necessary and sufficient conditions for the 
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generic controllability of (3.15) are 

g-rank[£* ;5*] = n(full), (3.16a) 

[£*; B*] is irreducible. (3.16b) 

Substituting the deBnitions E* and B* appearing in (3.7) wc 
observe that (3,16b) reduces to (3.14b), whereas (3.16a) 
takes on the form 

g-rank [D ; D = (3.17) 

or equivalently 

g-rank {D''|E ; B]] ^ n. (3.18) 

Clearly, since D is generically nonsingular (Lemma 3.1) and 
its parameters r,, . . . , are all different than 0^. . , 0^, 
and ;t,, . . . , of the structured matrices E and B, 
C>)ndition (3.18) reduces to that in (3.14a). Thus, Theorem 
3.1 has been proven. ■ 

Before closing this section, we focus our attention to the 
solvability of .system (1.3). From Lemma 3.1 we have that 
the matrix / - y4 is generically nonsingular. This means that 
the characteristic polynomial det [sE + /t - /] is not identi¬ 
cally equal to zero since, for i ^ 0, it reduces to det [/ - /t]. 
Thus description (1.3) with E, A and B structured matrices is 
always generically solvable. 

4. Generic observability 

Based on our results for the generic controllability of g.s.s. 
systems wc will now establish the necessary and sufficient 
conditions for the generic observability of g.s.s. .systems. 
Description (1.3) is generically observable if and only if 
description (3.7) is. As it has been noted in Section 2, a g.s.s. 
system is observable if and only if its dual regular system is 
also observable. Therefore investigation of the generic 
observability of system (1.3) is equivalent to that of 
investigating the generic rank of the matrix 

[C^ : (£'*)^C^ ;...; ((£'*)^]” ‘r^|. Following the steps of the 
proof of 'Dieorem 3.1 we readily conclude that g-rank 
[<?’■: {E'fC '... [(£*)^r-'C' | = n if and only if 

!r-rank !(£•)’■ ;C^) = n(full) (4.1) 

|(£*)^ ; cl] i.s irreducible. (4.2) 

Since D is generically nonsingular, then substitution of (3.8) 
in (4.1 and 2) yields 

g-rank [^j = n(rull), (4.3) 

[D 'EV. . , 

- I** irreducible. (4.4) 

We have thus established the following corollary. 

Corollary 4.1. T3ie structured g.s.s. system (1.3) i.s 
generically observable if and only if Conditions (4.3) and 
(4.4) are satisfied. 

5. Conclusions 

In this paper the necessary and sufficient conditions for the 
generic controllability and observability of continuous g.s.s. 
systems are established (Theorem 3.1 and Corollary 4.1, 
respectively). The proposed approach appears to be a very 
useful tool for studying other related problems such as the 
generic stabilizability, detectability and stability of g.s.s. 
systems, as well as the generic controllability and 
observability of implicit systems. These problem.s arc 
currently under investigation. 
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Reachable, Controllable Sets and Stabilizing 
Control of Constrained Linear Systems* 
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Abstract—We consider discrete-time linear systems with 
constrained controls. We present an algorithm for computing 
an external representation of the N-step reachahlc set and 
controllable set. It is an improvement of previous methods. 
In particular, it is shown that for eon.struciing the set ol 
initial states that can be driven to the origin asymptotically, 
one can restrict the search to a polyhedral set in the unstable 
subspacc of the autonomous system, which yields a drastic 
reduction in the computational burden. Application to the 
synthesis of stabilizing leedbacls controllers discussed. 

1. Introduction 

In this paper, we are concerned with discrelc-timc liiieai 
systems with con.\trained controls, ^ 

Uji, ^ u. Polyhedral constraints on the stale can also be 
incorporated. Such discrete-time models can be inherently 
discrete systems or sainpled-dala systems or even ap¬ 
proximations of continuous systems Most real life systems 
have physical limitations on the controls and surprisingly 
enough, apart from papers related to optimal control issues, 
few studies directly incorporate this feature. For example, 
the design ol feedback eontrnllcrs oltcn ignores this feature 
and when a control saturates its bound, the linear system 
exhibits a nonlinear behavior. Among possible con.se- 
qucnccs, stability properties are difficult to analyse and linear 
properties (for state estimation for instance) are lost 

In the design of feedback corilrollers that take into account 
control constraints, some pnlyhetlral set (necessarily included 
in the controllable set) mu.st be positively invariant (see for 
instance Dcnzaouia and Bin gal (l^K8), Biisoris (10S8), 
Hennel and Bezial (1991), Hennct and C'astclan (1991), 
Pajunen and Lrdol (1990), Vassilaki and Bitsoris (1989) and 
Vassildki ct al. (198H)). Choosing such a set is not trivial 
(Blanchini, 199()h). Ideally, one would try to build a 
controller such lhal the controllable set is positively 
invariant. Therefore, comparison n( this invariant set and the 
controllable set may yield a measure of the quality of this 
controller (the larger this set (he larger the number of slates 
that can be driven to the origin). Also, knowledge ol the 
reachable or controllable set helps to solve minimum time 
problems. 

Concerning reachable and controllable sets, previous 
rc,sults in the literature provide numeiical procedures to test 
whether some given stale is controllable or not (see Barmish 
and Schmitendorf (1980) and Van Til and Schmiicndorf 
(1986) lot example) (and sometimes in a more general 
framewoik) while others try to explicitly characterize the ,sel 
of controllable stales (see lor example Desocr and Wing 
(1961), Fisher and Gayek (1990), Gutman and Cwikel 
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(1987), Hamza and Rasmy (1971), Lasserre (1987), Lasserre 
(1991a), Rumchev and James (1989) and Wilsenhausen 
(1972)). Our contribution falls into the latter category of 
papers. 

We first determine an exact external representation (i.c. 
bounding hyperplanes) of the A-step reachable set from any 
given initial slate and the controllable set, i.e. the set of 
initial states from which the system can be driven to zero in 
A-steps. It is worth noting that if zero is an admissible 
control, the A-step controllable set is an inner approximation 
(the larger A the better the approximation) of the set of all 
controllable states, which is nol polyhedral in general. Our 
method is different from others in the literature (Hamza and 
Rasmy (1971); Lin (1970); Ciutman and Cwikel (1987)) and 
is less demanding in terms of computational burden, a critical 
issue in this ease. An interesting exception concerns the 
two-dimensional case where an exact characterization with a 
closed-forjj|i expression has been p^rovided with an algorithm 
of polynomial complexity, (Sec Lasserre (1991a) for a 
lepie.sentation in terms of bounding hyp>erplanes (exiernal 
representation) and Fisher and Gayek (BM)) in terms of 
vertices (internal representation).) 

More importantly, after some analysis, wc show lhal it 
suffices to restrict attention fo the unstable subspace of the 
autonomous system. Once a controllable set C’ is determined 
in this subspace, it is shown that all the stales in a cylinder 
(whose projection in the unstable subspace is C) can be 
driven to the origin asymptotically. This technique can yield 
substanlial computational savings since the dimension of the 
problem is decreased. For cxampilc, it there arc only two 
unstable eigenvalues, the problem reduces to finding a 
controllable set in dimension two for which a closed-form 
expression and a computational procedure in polynomial 
lime exist (see again Lasserre (1991)). 

Finally, in the last section we present a technique to build 
a tccdback controller that satisfies the control constraints. 
Given an initial state whose projection in the unstable 
subspacc is in C, wc first drive this projection to a smaller set 
C' by using a simple feedback control. Then, using simple 
linear algebra techniques as in Hennel and Bczial (1991), a 
stabilizing linear slate-lecdback coniroller is buill such that a 
set (whose pio|cciion m Ihc unstable subspacc is the set C') 
is pKisilively invariant for the closed-loop system. 

2. Preliminary results 

In this section we present preliminary results that will be 
extensively used in the computational procedure of the 
A-step reachable and controllable sets, presented in the next 
section. We show how to compute in a simple manner the 
Minkowski sum of a convex polytopc and a line segment in 
R”. T he bounding hyperplancs of the resulting polytope are 
deduced from the original polytope by simple formula. 

Consider an (m, n) matrix A and a vector b g such that 
(ji e y?” I >4.*: ir 6} defines a nonempty (bounded) convex 
polytope Q Let /i be some vector in R^. Let us recall that 
the Minkowski sum + Q 2 convex polylopes Q, and 

Qy IS the convex polytope (x | 3y e z e x = y + z). 
Let /(C2) be the set of integers (1,2, . . . , r). which are the 
row indices of the matrix A. Let y ' be the vector with /Th 
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component max|0,>'J. Moreover let vertex (O) be the set 
defined as: 

I,. I(Q); (»i. 12 , ■ • . e vertex (fl) 

»3jcei2, V;t = l_p. 

where stands for the i* row of matrix A and b,^ for the 4 
component of h. 

Proposition 2.1. The Minkowski sum S{Q, h) of the convex 
pfolytope n and the convex polytope {y | y = A e (0, 1)}, 
is the convex polytope defined by 

Ax^h + {Ah)\ (1) 

{v,A, + v^^)x < v^b, + v^bj, V(i, j) e P(S2), (2) 

where P(n) c vertex(fi) is defined as 
P(Q) = ((i, j) 6 vertex(fi) | {A,h ). < 0, 

i.e. 3v,, Vj ^ 0, {v,A, + = 0). (3) 

Proof First we prove that /i) is the polytopc defined by 
(1) and (2) where in (2), instead of P{Q) we consider the set 
A/(n) of all the pairs {i, j) such that {A,h). (Ajh) <0. Then 
we show that only those pairs in vertex (Q) arc worth 
considering. 

Consider an x which satisfies Ax^h. Then /l(x + Ah):s 
b + (Ah) * and {v^A^ + VjA^)x ^ v,h, + v^b^ for all (/, ;) which 
satisfy (A^h). A^h) < 0. 

Conversely, consider a vertex x* of the polytope defined 
by (8) and (9). It must satisfy: 

Ay = f). + (A,h)\ Vi e J(x*) c: /(fl), 

{v^A^ + VpAp)i = + Vpbp, V(^. p) E M'(fi) CL M(U), 

for some subsets I(x*), M'(Q). Actually, either A,h -^i) or 
A,h^i) holds for all / e/(x*) otherwise if A,h>i) and 
A^h <0 for some i, j e f{x*) the constraint i^A^x + i^^A,x ^ 
v,h, + Vjhf is violated for the scalars v,, which satisfy 
ViA,h 4- VfAjh ~ 0. 

(i) Consider the case where A,h >0, V/ 6 /(x*). 

Then, A^x* - h) = b,, Vi6/(x*). Moreover, A^{x* - h)^ 
for all k $ 7(x*)y A^h ^6. For those k such that A^h < 0 
wc cannot have A^(x* —since otherwise the 
constraint t;,A,x 4 v^A^ s v,h, 4 is violated at x* for the 
scalars v,, which satisfy v,A^hv^A^h = i), /e/(x’). 
Hence A(x* - h)^h. 

(ii) Consider now the case where A,h sO for all i e i{x*). 

A,x*^b,, Vie/(x*) and for all k i I{x*), s.i. A^/i^-0. 
Moreover, Ay>b, for some i such that A,h '0 is 
impossible, since otherwise the constraint {v^A, ■¥ v^At,)x ^ 
i»,6,4 which the scalars u,, satisfy 4 

Vi^Ai^h = 0, k eHx*) \s violated at x*. Hence Ax* ~h. 

In summary, any vertex x of b) is the sum of an 
element y of Q and kh with A being either 0 or 1. Therefore 
h) is the Minkowski sum Q 4 A/i. However, many 
constraints of the type (2) are in fact redundant. 

As before, consider a vertex x* of 4 A/i. 

(i) If A^x* = f?, 4 Ajh, A,/i 2:0, Vi e /(x*) then wc have seen 

that X* - h is a vertex of fi. A constraint of the type (2) is 
not strictly redundant if and only if it is binding at some 
vertex of 5(Q, h). Consider a binding constraint 4 

u A^)x at X*. Since x* ~h is a vertex of £2, the constraint is 
also binding at x* - /i. But since x* - h is a vertex of this 
is possible if and only if Ai,(x* -h) = b^ and A^(x* - h) = bp 
which means that (fe, p) e vertcx(Q) and therefore {k,p)e 

P(n), 

(ii) If A,x" = ^, Vi6/(x*) (and A^h < 0), then we know that 
X* is also a vertex of Q. Therefore, the only binding 
constraints (v^A, 4 i 4 A^)x binding at x* arc those for which 
necessarily Ay — b, and A^x*~ b/,. But again this means 
that (i, A) e veriex(ri) and therefore (i, A)eP(Q), so that 
the proof is complete. □ 

An immediate consequence is the following: 

Proposition 2.2. 'Hie Minkowski sum |^|) ol the convex 
polytope Q and the convex polytope [y \ y = kh, A e (-1, 4 


1)} is the convex poly tope defined by 

Ax :£ 4 |AAi|, (4) 

(v,A, + v,Ai)x^v,h, + v,h,, V((,/) e/’(Q). (5) 

Remark 2.1. Let us call A^ the matrix of the constraints 
which define 5(£2, |/i|). Let be the index of the 

constraint (t>/A, 4 VjA^)x s tf^b, 4 From the demonstra¬ 

tion of Proposition 2.1 we also conclude that 

• (i,y) e vertex (,V(Q, |A !))<=> (i,/) e vertex (£2) and 

• (i, n(i,/)) e vertex (lS(£ 2, |/i|)) and (/, n(/, y)) 6 vertex 
(S{Q, \h\)) 

• (i, n(p, ^)) 6 vertex (.V(£2 |/i|))C^>((i, p) or {i,q)) and 
(p, q) fc P(H), and (i, p, q) E vertex (Q) 

- n(/. p))g vertex (.V(£2. |^|))0(i,y), (f. p)fcP(£2) 

and (i, y. p) e vertex (H) 

• (n(i.j), n(p, ly) 6 vertex (5(£2. |/i|))0(i, y). (p, ty) e P(Q), 
(i, p) and (y, q) 6 PfQ) or (i, q) and (y, p) e P(£2), and 
(*» y. Pf 9) f vertex (£2). 

Therefore, to determine all the pairs of rows of A,, which arc 
in vertex (.*>(£2, |/t|)) it is not necessary to check all pairs. 
This property will be used to save computations in the 
procedure below. 

3. Reachable and controllable sets 

In this .section wc present a computational procedure to 
determine the boundary hyperplanes ol the N-step reachable 
and controllable sets. No controllability assumption is 
required, and the matrix A is not required to he 
non-singular. If the system is not controllable, then the 
reachable and controllable .sets will be degenerate, i.e. they 
will be contained in an affine variety of dimension strictly less 
than n. We will just observe that some constraints are ol the 
form gx^r and i.e. they define the hyperplane 

gx — c. Wc also compare our procedure with others m the 
literature. For simplicity of notation we restrict to discrete 
time-invariant linear systems with bounded controls, but m 
the sequel, it will become obvious to the reader that the 
same methodology applies to non-slationary linear systems. 

Let us consider a discrete time-invariant linear system with 
hounded controls 

x^^ — Ax^ 4 , uTiUf^^-u, k - , N, (fi) 

where x is the state vector (eR''), u is the control vector 
( E /?"*), A is the matrix of the dynamics of the uncontrolled 
system, and y, u are the (vectors) bounds on the control 
variables. 

For any matrix M, Af, will stand for its /th row vector and 
for any vector u, |u| will stand for the vcctoi ol absolute 
values. 

3.1. A computational procedure. Let us recall that the 
N-.step reachable set is the set of v e R'‘ which satisfy 

N I 

y-A\,= A‘‘Bu,. |u*|£u, VA, 

k -d 

for some control sequence m,,. . . . , ,, or equivalently 

.V - A\, = X X e 

A — 0 p - \ 

where stands for the yjth column of B and where, to 
simplify notation, the index of control at period k has been 
switched from k io N — {k -E \). 

The last equation simply means that y - A^X(, is the sum of 
Nm polytnpes (line segments), each one of them being 
defined a.s ku where v is some vector of K” and A a scalar in 
some interval (-a, 4a). I’hesc polytopes are in tact zonoids 
(Bolker (1969); Witsenhausen (1972)). Therefore, since the 
Minkowski .sum of piolytopcs is associative, the idea is to use 
Proposition 2.2 iteratively to compute the bounding 
hypcrplancs of the reachable set. 

Let R(k,p)k^N-\. p ^ m, be the convex polytopc 
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defined by 

/J(l, p) = jjr 6 R" I jf = e -(u,, u^), V;j, 

{k 1 m 

xeW" |jt= 2 

1-0 i = l 

^ ^ i/» w, ^ € ( —iiy, u^), Vf, y ^ 2. 

Therefore assume that 

/?(/c, /?) = {jf e I n(/:, /j)jr s b(k, p)}, 

where Q(A,p) is some (r(/c, p), n) matrix and b{k,p) some 
r(fc, p) vector. 

Then, according to Proposition 2.2, if p < m. 
R{k,p-^])=^S(R(k,p),\A'^ 
and if p -- m, 

R(k^]. \) = S(Rik,m).\A^B^'^\r4^), 

where 5(. , .) is the Minkow.ski sum defined in Proposition 
2.2. 

One way to initialize the procedure is to compute R(\, 1) 
which is the convex polyhedron 

(jr I - I_ri - I}, 

where w,, i = 1, . . . , n - 1 arc n - \ linearly independent 
vectors orthogonal to i.e. which sati.sfy uJB^^^-A), 

i ~ , n - Therefore we have. 

Theorem 3.1. If p>l, R{k,p) is the convex polytope 
(deduced from R{k,f)- I)) delined as the set of all jt e/?" 
such that 

r2(A.p- {)x^h{k,p- \ )-\-\iMk,p - (H) 

p ™ 1) + VjQ,{k, p - l))jr -I v,h,{k, p - 1) 

4 u,/7,(A:,P - 1). (fJ)F_PiRik,p 1)). (9) 

and similarly if p “ 1, R{k + 1, 1) is defined as the set of all 
X f ff" such that 

i2(k. m)x^b(k. m) + \Q{k. w,. (10) 

(i\i2,(/c. m) + v^Q^ik, m))x " i\b,{k, m) 

+ m), {i.j)eP{R(k,m)l (11) 

At each step, the number of bounding hyperplancs increases. 
This number depends on the cardinality of the set P{Rik, p)) 
which in turn highly depends on the dimension n of the state 
.space. It is also immediate to check by induction that the 
right-hand-side h{k, m) is always positive for all k, m. 
Moreover, since all the constraints on the controls arc 
symmetrical, R{k, p) is a symmetrical convex polyiope for all 
k, p. i.e. the constraints are of the form ±cx - a. Therefore, 
we immediately have the following. 

Lemma 3.1. The /V-step reachable set from inilial state x^^ is 
the set of vectors ye/?” which satisfy: 

|fi(N, m)(y - HbJ. ni). (12) 

and the N-siep controllable set is the set ^^f vectors y e R"' 
which satisfy; 

|Q(/V, m)A^y\ h(N, m), (13) 

where m) has twice as few rows as in Theorem 3.1. 
Note that the /V-step controllable step is an inner 
approximation of the set of all controllable states Uyv,-i ^ n 
since if x g then x f for all p > A/. 

Remark 3.1. In the computational procedure, to obtain the 
hyperplanes in (9), a large number of pairs (/, /) need not be 
considered. It suffices to apply the rules described in Remark 
2.1 to characterize vertex (R{k,p)), i.e. the pairs (t,y) of 
interest. Following thc.se rules yields great computational 
savings. 


Remark 3.2, Note that C,^ is the .set of all the states that can 
be driven to the origin in finitely many steps. When the 
system is not controllable, some states not included in can 
be driven to the origin asymptotically (sec Gutman and 
Cwikel (19K7) for a simple illustrating example). 

3.2. Comparison with other procedures. For linear systems 

with general linear constraints, a complete characterization 
of both reachable and controllable sets is presented in 
Lasserre (1987). The associated computational procedure, 
based on Farkas’ lemma (Schrijver (1986)), is not realistic 
even for moderate size problems. For two-dimensional 
systems, computation is tiactable and easy and even a 
closed-form expression of both sets is given in La.sserre 
(1991a). In the case treated here, Hamza and Rasmy (1971) 
have proposed a procedure to compute the bounding 
hyperplanes of the reachable set in the single input case with 
obvious changes for the multi-input case. Their procedure is 
an improvement of Lin's procedure in Lin (1970), where 
computing the extra hyperplancs of the type (9), (11) 
requires solving („"’%) (a large number even for reasonable 
N) linear systems of n - 1 equations in n - 1 unknowns. In 
Hamza and Rasmy (1971) still Imear systems need be 

considered but alter a simple lest, the new hyperplane 
eventually built is also a linear combination of two vectors 
like in our procedure. However, m our procedure; we need 
not consider those („'”%) linear .systems. The hyperplancs are 
obtained directly from those of R{k, p ■ 1). The pairs of 
rows candidate to build a new hyperplane, arc obtained bv 
using Remark 2.3 without calculations. 

In a more general setting, Gutman and C'wikel (1987) have 
proposed a procedure to compute controllable sets and 
reachahles sels. They consider the case where the state is 
constrained to belong to some prescribed set X and the 
control IS con.slraincd to belong to some set £2. However, 
their procedure requires knowledge <ol the vertices of and 
at each step the set of vertices of R{k, p) The vertices of 
k{k + 1, m) are deduced from those of R{k, m) by adding m 
all possible ways a vertex of R{k, m) and a vertex of £2 and 
then taking the convex hull of all these points, a tedious 
procedure. Moreover, the intersection with the set X must be 
computed and finally the vertices ol this new polytope mu.st 
al.so be computed which requires a highly combinatorial 
procedure with some linear algebra. 

In the ca.se where the control set is defined as 
(m eI M =1 t2w w) where £2 is a nonsingular square 
matrix, then by an obvious change of vHriablc^, one is back 
to the case ij ^ u li. Al.so, our procedure can incorporate 
state constraints of the type Mx^ ^ a. Indeed, once Rik, m) 
has been computed it suffices to intersect it with 
{x\Mx'^a}, I e. the new R{k, m) slM is now ilchned 
through the inequalities; 

£2(A:. 1 )a - 6(A:, m - 1) + |A" 

(i;,£2,(fc, m ~ 1) + u^£2^(A', m ~ l)).v v,b[k, m - 1) 

+ ni ^ 1), 

V(i.7)GT(/<(it. m- 1)). 

Mx a. 

The only extra work required is to determine at each step 
(A, m) the couples (/,/) of rows of this matrix which arc in 
vertex {R(k, m)) 

3.3. A particular rarr. An interesting particular ca.se is 
when A' - fil for some integer r and some scalar /). Then it 
suffices to compute the set R{r, m), and the /Vr-step 
controllable set is the convex poly tope defined by 

I “ 

|SJ(r, m)/('^Jt| s l>{r, m), 

and when |j9| > 1, the closure of (he sel ol all the controllable 
stales, i.e. the set C = Un -i t’/v. 'ho convex polytopc 

|iJ(r. m)jc|£—m) 
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When \p\ < 1 (A stable matrix) and the system is 
controllable, C is the whole space 

4. Asymptotically controllable set 
In this section we refine some results of the previous 
section. In most of the practical applications, one wants to 
design a stabilizing control so that we only need to know the 
set of states that can be driven to the origin asymptotically 
and not necessarily in finitely many steps. With this in mind, 
we show that we only need to focus on the unstable part of 
the matrix A. We first informally discuss the underlying idea. 
Without loss of generality, supp>osc that A is already in the 
block Jordan decomposition form: 



where A, is a {s,s) matrix with (unstable) eigenvalues of 
modulus larger than 1 and Aj is a n — s) matrix with 
(stable) eigenvalues with modulus less than 1 . A vector 
xeR" will be written [jfj.jrJ where x^eR' and ’ 

According to (13), the N-sicp controllable set is the set (\ of 
vectors x e R" which satisfy 

|Qi(A/, rn)Aj^x, + f^)A 2 X 2 \ ^ b{N, m), 

where Qi(N, m) and Q 2 (^''”) submatriccs (of 

appropropriate dimension) of Q{N, m). 

As the set of asymptotically controllable slates is 

unbounded in the subspace spanned by the last n — s columns 
of A. 

To sec this, consider a vector x e f?” which satisfies 

|Q,(/V, frTM^*,|£h(Af, m) (14) 

By definition there is a sequence of controls u^^, . . . , , 

such that the initial state [jc,, 0 ] is driven to zero in N steps. 
Therefore, given the initial state and this 

sequence of controls, the slate Z/^ after N steps is 
^ \ 

2^ = 0|+ 2 +/4~[0.^j] = 0 + /1?'|0,jt,|, 

A=-() 

and thus, 

Therefore, any such stale can be driven to the origin 
asymptotically if one applies the control w - 0 at periods 
k ^ N. Tlius, we observe that if one focuses attention on 
asymptotic controllability only, it makes sense to restrict the 
analysis to the A-unstable subspace. The .set of asymptoti¬ 
cally controllable slates should be a cylinder in R'’ and the 
directions of the cylinder should span the >l-stable .subspace. 

However, to compute the matrix Qj, we still need consider 
the whole system in R". The question is, “Can we compute 
the asymptotically controllable sel by just reasoning in the 
(n - j)-dimensional A-unstable subspacc? ’ The answer is yes 
and therefore great computational savings can be expected. 
L^t us consider the linear system in R' 

i + * = f,- N-\ (15) 

- Uf^ ^ ^ Uf^, k - 0, . . . , N " \ , ( 16 ) 

where L is the (.v, n) matrix [l^ | 0] and /, the (.v, .v) identity 
matrix. 

This system is the original system where the stale y e is 
just the firsl s components of the state x e Suppose that 
we compute the N-step controllable sel for (15)-(16) defined 
as 

C%-^(ytR' I |Q.(/V,m)/4^>-|fifc,(N,m)}. (17) 

We have the following result: 

Theorem 4.1. For all N^], any initial state jr = [jr,,X 2 l 6 
C^x /?" * can be driven to the origin asymptotically. 

Proof. By definition of C%, jt, 6 there exists a 

sequence of controls Uf,, . . . , u^y , such that after N steps 
one reaches a state of the form [0,z], z e R'' \ Now at 
periods ^ ^ -H 1, apply the control ~ 0, V/c. Since A^ has 
all its eigenvalues with modulus less than 1 , starting from a 


state [0, z| and applying the sequence of controls 
0,. . . , 0,. . , , the system is driven asymptotically to the 
origin. Therefore, all the states x e x J?" * can be driven 
asymptotically to the origin. □ 

Remark 4.1. This set C% is easier to compute than the one 
described by (14) since it only involves vectors in R^ instead 
of R", It is also larger, because the vectors x, satisfying (14) 
have also the property that the slate [x,, 0| can be driven to 
the origin in N steps, which is not true in general for a vector 
jr, € rjy. Of course, since a C% for all N' ^ N, the larger 
N is the large! is C^. _ 

Ideally one would like to compute C* C'lv T"his set 

is convex but not a polytope in general. Each C)y is an 
inner approximation of T"". As N increases, more and more 
bounding hyperplanes are created. is a convex polytope if 
and only if A^l - A7, for some p and some scalar A. However, 
if the smallest eigenvalue (in modulus) is not too close to 1, 
one can expect that C^y is a good inner approximation of C' 
for rea.sonable values ol N. 

Remark 4.2. If A, is a (2,2) matrix, i.e. if s=2, then a 
closed-form expression exists for C)y and a polynomial time 
procedure as well (see Lasserre (1991a)). 

5. Stahilizinff state-feedback controllers 

In the pa.st few years, some rc.scarchers have focused on 
methods to compute stabilizing state-feedback controllers in 
the presence of constraints on the controls. In Gutman and 
Cwikel (19K7) for example, some sophisticated nonlinear 
controllers have been proposed and are computationally 
involved. However, all the controllable .stales can be driven 
to the origin. The linear state-feedback controllers such as 
those described m Blanchini (1990), Benzaouia and Burgal 
(1988), Hennet and Be/.iat (1991), Vassilaki and Bilsoris 
(1989) and Pajunen and Erdol (1990), are easier to build but 
with a smaller controllable region. Some polyhedral sel M is 
chosen and one computes a linear state-feedback controller 
u - Fx such that M is positively invariant for the closed-loop 
system + i (A + BF)xj^ and is included in the polyhedron 
(jc I |Fjr| u). Therefore, the sets C\ built in the previous 
section could be used to evaluate the quality of the controller 
F. The set M should be as close as possible (in the Hausdorff 
metric for example if they are bounded) to reasonable 

values of N. Ideally, one would like to compute a 
state-feedback controller such Ihat is positively invariant. 

If the set M is too large, a linear state-feedback controller 
may not exist. On the other hand, one would like to control 
states not necessarily clo.se to the origin. 

As a possible tradeoff, wc propose a simple feedback 
control law. The idea is to first drive the system to a 
prescribed relatively small set around the origin and then 
build a linear state-feedback controller such that this set is 
po.sitivciy invariant. 

Assume that wc have computed off-line the set lor 
some N. Suppose also that the initial slate x ^ (jr,, X 2 \ is such 
that ^ 

ISJJA', m)/(^'jr||sbJ/V, m), 

i.e. If can he driven to the origin asymptotically (and its 
projection x^ in the A-unstable subspace can be driven to the 
origin in N steps). Driving an initial stale to the origin would 
be achieved in two steps. First, by using a simple feedback 
controller we drive the projection x^ of the initial slate 
fx,, ^ smaller set around the origin, C\ for example (or 

aC\ for some positive scalar cr<l). At this time the state 
z = |z,, Zj] is such that 

2|ec; and \\z^\\^K=\\a’: ||.t 2 || + d.5, (18) 

where 6 = max, | H,u and S = E;io IMzIl- 
Then, by using Linear Algebra techniques, as in Hennet and 
Beziat (1991), wc build a linear state-feedback controller 
u^Fx such that the set F — C] X [-7C, AC]" " is positively 
invariant by the linear mapping A + BF. A nonlinear 
feedback controller driving an initial slate [jr,, jr 2 | from C)y to 
say C’j is described in Lasserre (1991b). 

5.1. A stabilizinf^ linear state-feedback controller. Now wc 
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assume that the initial stale X 2 \ has been driven to a state 
[zi, Zj] 6 r = C'j X [-A', A']” \ We want to build a linear 
state-feedback controller u - Fx such that F is positively 
invariant for the linear mapping A F BF and is included in 
the set {x\Fx:^u}. Such a controller is proved to be 
stabilizing (Bitsoris (1988); Hennet and Beziai (1991)). 
Various techniques have been developed to compute such 
controllers (Hennet and Beziat (1991); Vassilaki et ai 
(1988); Pajunen and Erdol (1990)). Let us rewrite (18) as 

r~ { V e/^" I |Cy| s. w), 

where 


c = 

"fi.(L m) 

_ 

1 

0 ~ 

, (0 = 

6,(1. m) 


0 1 



K 


Then by Parkas' Lemma (Shriver, 1986), L is positively 
invariant by ^4 HF if and only if there exists some matrix 
/y, such that 

C(A + BF) - //, r and |//, | (o •' w, 

i.e. |//1 “ / is an Af-matrix (Ciantmatcher (1959)). 

Moreover, to be consistent, the control u - Fx should 
satisfy |m|5;:u, i.e. xeV^^Fx^ u. Therefore it suffices to 
find two matrices /Y,, H 2 such that 

C(A T BH.C)- H,C 

|H,1 oj'- (1 - e)tij 
|f/j| 10 U. 

for some 1, and Ihc linear strile-lcedbaLk controller is 
u = H^C'x. This feedback controller can be computed by 
simple Linear Programming techniques 


5.2 Example. Let us consider the following system 
already considered in Hcnncl and Laslelan (1991), but now 




-0.219 

1). ^83 

0 529 

0 

0 526' 

0.047 

0.519 

-0 671 

0 687 

-l.(K)3 

0 

0 831 

0 

0.589 

-0 654 

-0.679 

0 

0.383 

0 930 

0.416 

.-0,934 

0.053 

0.067 

- 0.1.39 

0. 


r 

0 910 

-0 328 





0.762 

0.633 

0 




0.262 

0 756 

-0.723 

"a. ^ *■ 



0.047 

0 

0 753 




0.736 

0.365 

().651_ 


a change 

ol 1 

nasis to 

get the 

block 


■ ( 1 . 


After 

decomposition of A we finally consider the new system 



■-0 

3025 1. 

0279 0 

0 


0 


-1 

.0279 ‘ 0. 

3025 0 

0 


0 


0 

0 

0.3455 0 


0 


0 

0 

0 

0.7452 

0.1402 


_ 0 

0 

n 

-0.1402 

0.7452 



' 0.5130 

-0.0789 

0.5639- 





-0.5286 

0.3307 

0 0184 



X 

k + 

-0.2760 

-0.3264 

1.2683 


k -0, 



0.1382 

0.(K)95 

0.‘1189 





_ 0.4160 

0.7176 

-0.7071. 



Since there 

are only two unstable complex 

eigenvalues, 

restrict the analysi.s to the two-dimensional sysiei 

m 


- 

-0.3025 

1.02791 





1 “ 

-1.0279 






0.5130 

-0.5286 

-0.07K9 

0.3307 

0..S639] 

O.OI 84 J"*' 

k 

= 0, . . . 


We have a two-dimensional system with a (2,3) control 
matrix B. In (Lasserre (1991a), a closed-form expression for 
reachable and cvmtrollabic sets was derived for two- 
dimensional systems but with (2, 1) or (2, 2) control matrices. 
It can be adapted to this case by separating the (2, 3) matrix 
into a (2,2) and (2,1) matrix. Two controllable sets are 
computed separately for each control matrix and finally the 
controllable set is the Minkowski sum of the two sets. After 
some calculations, we get 




-1.4158 2.4703' 
-5.3713 3.0339 


'2.469 

3.403 




4.4869 1.4812_ 

L -1 

_3.8725_ 

J 




X 6 I ± 


2.5848 
4.1.315 
-0.1289 
2..5(HI6 
-2.1828 
-4.9173 
1.2641 
0.4173 
1.7734 
-0.0326 
-0.4671 
0.9037 
- 1.2977 
-0.4485 
1.0916 
-0.5503 


0.6167' 

4.0096 

-2.4765 

-4.7577 

0.7699 

-0.9809 

1.7503 

4.660b 

0 

1 

-1.5873 
-0.2351 
0.8376 
-0.7467 
0.9402 
0.5988 _ 


-0.3025 
-1.0279 


1.02791^ 
-0 3025 J 


X-., 


12.67' 
25 60 
10.23 
24 62 
11.52 
23 92 
9.29 
19.18 
8.82 
4.2 ' 

7.08 
4.63 
7.78 
3.70 
6,55 
3.90 . 


and a stabilizing linear slale-leedback of the form 


u - Fx 




0.2137 

0.7072 

0.2979 


-0.8722 0 0 
0.2335 0 0 
-0.5198 0 0 


0 -*2 

-0.0390 r, 

-0.0481 . 


L ^ s 


Starting from an initial stale y,, - [0.6925, 3.5521, 10, 10, 10], 
where x.,- |0.6925. 3,5521] is in Q and applying successively 
the controls [-1, 1, -1], [-1, I, 1] and [1; -1; 1] one reaches 
the state [0.4483, 1.2564, 1.9945, 6.6314, 0.6939] where 
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2 - 1.5 - 1.0 - 0.5 0 0.5 1.0 1.5 2.0 2.5 

Fig. 1 Example of state trajectory. 

jc, = [0.4483, 1.2564| is a vertex of C\. Then, one applies the 
linear state-feedback control u = Fx. In Fig. 1 the 
corresponding trajectory is displayed in the /4-unstable 
subspace only. The solid line is the contour of C’, and the 
dashed line is the trajectory. 

6. Conclusions 

In this paper wc have presented a simple new method for 
aimputing the reachable and controllable sets for input- 
constrained linear systems. In particular, to characterize the 
set of initial states that can be driven to the origin 
asymptotically, it suffices to restrict attention the unstable 
subspace of the dynamical matrix, thus yielding .significant 
computational .savings. A stabilizing feedback controller has 
also been presented. 
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Necessary and Sufficient Conditions for Global 
Optimality for Linear Discrete Time Systems* 
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Abstract —This paper extends the results introduced by 
fluong and Li (1989, /m /. Control, 50, 2341- 2347) to 
discrete lime systems. Necessary and sufhcienl globally 
optimal conditions, in the form of a matrix eL|Udlion and a 
matrix ineL|ualily, are presented lor the existence ol the 
optimal constant output teedback gain ol discrete time 
invariant system. Furthermore, it is shown that it the optimal 
output gain /.„ exists, it must salisly where is 

the optimal stale feedback gain An example is given to show 
that d globally optimal output law may not he found even if 
the system is stabilizable by output feedback (i.e. 

c/))i(C ^)). These results shed some light on the 
fundamental issues ol output feedback and justify suboplimal 
solutions. 

I. Introduction 

CoNSiDiK till rrii.iowiNfi discrete time invariant control 
system. 

X, ,, F Hu^, (1) 

( 2 ) 

where e is the slate vector; e Jl("' is the control 
vector; and A c H c and C e ))V are constant 

matrices. Assume the system given above is stabilizable and 
the matrix C has a lull-row rank (i e rank (f ) - /) l.ct the 
performance criterion have the quadratic form 

J ^ ^ ( 3 ) 

where R e symmetric positive delinite matrix, 

Q ~ h 'b and (A, F) is detectable. 

Concerning the above optimal control problem, il is 
known that there are two kinds of optimal constant Iccdback 
problems- 

Problem 1. Let 7(x,). Kx^) denote the cost (3) when the 
control lakes the slate variable feedback lorm and 

the initial state is a,,. Consider finding the optimal constant 
leedback gam e IK so that 

A„, rj'^y(a'n. J. Var e ! VA e IK. 

where IK ^ {K\a{A f/fA)c UJ),} in which o( ) denotes the 
spectrum of a square matrix and HI), the unit ilisk m the 
complex plane. 

Problem 2. I.et ifa:,,, Ava) denotes the cost (3) when the 
control takes the output feedback form = A>\ and the 
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initial state is Consider finding the optimal constant 
feedback gam A„ e IL so that 

7(a„. AyJ, Va f VL e H, 

where IL = (L. a{A 4 BLC) c" O,}. 

Il IS well known that ul ~ A„jr^ represents the optimal 
control law for Problem 1 where A,, is the solution of an 
algebraic Riecali equation. Due to the impressive approaches 
to the matrix Riccati equation, the existence and uniqueness 
problems have been solved completely (Dorato and Levis, 
1971; Franklin and Powell, 1980). 

However, il is often the case that a complete set of state 
variables is not directly available for feedback purposes. 
Therelorc, the output leedback problems such as stabi¬ 
lization, pole assignment, and optimal control have greater 
value than complete slate variable feedback problems Irom 
the practical viewpoints. 

rhe optimal output feedback problem has received great 
attention. Direct minimization of cnlerion (3) would produce 
feedback gams that are dependent upon the initial slate a„ 
In order to eliminate this dcf>cndency, it was proposed 
(O’Reilly, 1980) that the expccled value of criterion (3) be 
minimized assuming that the initial stale is normally 
distributed with AJa,,}--!). Therefore already obtained 
results are suboplimal instead of globally optimal. Until now 
the fundamental problems on optimal output leedback have 
not been answered and still occupy control system 
researchers. These fundamental issues are as loilows. 

(a) The existence and uniqueness of an optimal control 
solution when the system is stabilizable by oulpul feedback. 

(b) The relationship between the optimal law of Problem 2 
and that of Problem 1. 

This paper studies the above issues which relate to 
Problem 2. I’he contribution can be summanzed as 
answering the existence and uniqueness problem, showing 
the relationship between the optimal oulpul feedback gain 
and the optimal stale variable feedback gain A,,; that is, if 
exists, there must be Ly^C - K^y I'hc technique in the 
present paper more closely follows that ol the eontinuous- 
lime system counterpart ol Huang and Li (1989). 

This paper is organized as follows. In Section 2 the global 
optimality conditions to Problem 2 are applied and the 
necessary and .suflieient conditions of existence are derived. 
The mam Theorem 2 is introduced m Section 3. An example 
is presented in Section 4. It shows that the system, in some 
cases, may not have an optimal output feedback law even if 
it IS stabilizable by output feedback. 

2. A necessary and sujficicnt c ondition of existence 

Propcfsitiori. Consider the system (1). (2) and (3). If the 
control Uf, - LC\y such that A + BLC is stable, then the 
performance index (3) can be rewritten as follows; 

y=jr, V xU{A + HLC)'r(A + BI.C) 

A II 

+ Q + {L(YR{LC)~P]x,. (4) 
where P e is a positive definite matrix. 
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Proof. Noting that for a stable system = 0, observe that 

We can therefore add zero, in the form of the left-hand side 
of the above equation minus its right-hand side, to the 
performance index (3). Also, by substituting - LCx^. we 
get (4). □ 

The equation can be written as the perfect square of a 
norm with respect to (B^PR + R) (McReynolds, 1%6): 

(5) 

Notice that the minimum cost (5) is reached when the 
summand is zero (assuming L - L,, is the optimal output 
gain). In other words, when the matrix equation 

{A + BL„CyP{A + BLt,C) + Q + (L„C)^«(Z.„C) - P = 0. 

(6) 

Theorem 1. 7 = jr„/’o^r) ~ ^-uyk minimum cost 

and optimal control, respectively for Problem 2 if and only if 
the following hold; 

(a) (A + BL„CfP,,(A + BL„C) + Q 

■¥(L„CVR(L„C)-P„ = y). (7) 

(b) {A + BLCyP„(A + BLC) + Q 

+ (LCfR(LC)-P„i:ti. V/.eL (8) 
The proof of Theorem 1 follows from equations (4) and (5). 

3. Main result 

In order to derive our main ITicorem 2, we first need to 
introduce a lemma. This lemma has been proven by Huang 
and Li (1989). 

Lemma 1. For any matrix S e S 9^0. and any full row 

rank matrix there exists a matrix X e 

” 1, a vector v-e JH'’, |(u|l:,, and or^^O such that 

v' [SXC + iSXC) '']v < -- fr. (9) 


following holds: 

E\>^l(XCflR + B^P^,B](XC)]v - ear<(), (12) 

for € small enough. This contradicts equation (11) which 
shows that S must be a zero matrix, i.e. 

S = [(f? + B^P^,B)L^,C + B^Pf^Af - 0. 

Furthermore, 

+ = a:,,. (13) 

□ 

From the above main theorem, we can see that finding a 
globally optimal output feedback law reduces to the 
following procedure. 

(a) Obtain P^^ and ATj, by solving the discrete Riccati 
equation; if JH(A'([) c: iH(r^), then there exists so that 

ATm, i.e. Problem 2 is solvable, otherwise unsolvable. 

(b) 'K(/f,,) cr JH(C^) if and only if 

f^(r) e l^(A„) - f^((/? 4 B ^P^,B) 'B^P^,A) 

^NiB^P,^)=A 'f^(/?'), 

or equivalently 

/’,/tf^(C')c=I^(B'), (14) 

This means lhal if /^|/1M(C') cr ^). Problem 2 is solvable 
Otherwise unsolvable. 

As we know. P,, determined by the discrete Riccati 
equation depends on the given A, B, Q, and R. In general 
cases condition (14) can hardly be satisfied. In the particular 
case when rank(P)^m -/-rankfC), for instance, (14) is 
never satisfied. I'o show that Problem 2 is nearly unsolvable 
even if the system is stabilizable by output feedback, we 
present the following example. 

4. An example 

Consider the following di.scretc linear system: 

Jir. I - M - Wi'K, 

where 



where H U,, is the Frobenius norm of the matrix. 

Definition. Tlic optimal output feedback gam is said to be 
a derivative solution of the corresponding optimal state 
variable feedback problem if L^C = /Ci,. where A"„ is the 
optimal feedback gain for the latter problem. 

Theorem 2. The optimal output feedback gains are all 
derivative solutions of the corresponding optimal stale 
variable feedback gam. 

Proof. Let P„ and L,, Ihe optimal solutions of Problem 2, 
so that Pf, and L,, satisfy (a) and (b) of Theorem 1. 

Consider Ln(e, X) = eX, ||3f||, = 1. Then there exists 
e„>U such that Ln(c, X) e IL(A, B, C), for |r| < This is 
true because 1{A, B, C) is an open .set and L„ e L(>I, B, C). 
Therefore 

1^ 4 B{Lu + (X)CYPuIA 4 B(L„ 4 eX)C] 4 Q 

+ [(t„+ eX)CYR\(I.,,+ €X)C\- P„--i). (10) 

Using (7), (10), can be rewritten as 
f\XCf[R + «'/'„fl)(A'r)+ t||(« + B''P„B)L„C 
+ B^P„aY{XC) + (A-Q^KW + B'P,,B)L„C + arO. 

(II) 

If S = HR&B^P„B)L„C + B'P,,aY =^0. from Lemma 1, 
there cxi.sti AC 6 IK"’"” with HACH,. - I and e Jit" with 
||u||> - 1 so that 

fi'^(SACr+(,S’A'C’)']t;< -ftr, a ^0. 

Since v^UXCY {R + B'P„B](XC)]v is finite, then the 


The system is controllable and observable. The characteristic 
polynomial P(A) can be written as 

P(A) = |A/-(/\-6/c)|-A* -A4/. 

we notice that if I = ,1, the roots oJ P(A) will be within the 
unit circle. This means the system is stabilizable by output 
feedback. 

Consider the performance criterion 




Brief Paper 


539 


The optimal state variable teedback gain is 

+ [p„ p,4 

Obviously, the globally optimal output feedback problem is 
unsolvablc even though the system is stabilizable by output 
feedback. This example shows that JK(ACcf) or there 

exists no output gain L such that LC = 

5. Conclusions 

ITie fundamental theorem on the optimal output feedback 
problem with quadratic performance criterion is given. The 
theorem shows that if there exists a globally optimal output 
feedback law Lo, then it must satisfy ~ /C,,. where /C„ is 
the unique solution of the discrete Riccati equation. 
Moreover, an example to show that ihe condition (14) is 
hardly satisfied even if the system is stabilizable. These give 
the reasons for finding suboptimal laws that stabilize the 


system. As fur as output feedback problems such as 
stabilization, pole assignment, and optimal control arc 
concerned, there utc still many questions awaiting 
investigation. 
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The Safety of Process Automation* 
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Abstract —The effect of automation on process safety is not 
clear. On the one hand, automation is blamed for posing risk 
and for increasing the chance of human error in situations 
involving disturbances; on the other hand, it is admitted that 
automation enables sophisticated process control and 
handling of disturbance situations without human inter¬ 
ference. The methods of safety analysis can be applied 
during the designing stages of safe process automation. I'he 
hazard and operability study makes it possible to lake into 
account the potential process disturbances and to develop 
countermeasures for them. Action error analysis studies the 
consequences of potential human errors in task execution. 
Fault tree analysis can be u.sed to study the causes of 
potential accidents and to examine the control actions 
suitable for providing protection against them thereby 
reducing the probability of accidents. Event tree analysis is a 
method for considering the consequences of potential 
hazardous situations and for developing countermeasures to 
reduce such consequences. Failure mode and eflecl analysis 
is a method for checking that the potential failures of the 
control and automation system are not overlooked. 
Reliability assessment can be used with safety analysis 
methods to study the bottlenecks m the design and to 
prioritize ihe countermeasures whereby the risk can be 
reduced to attain an acceptable level 

Introduction 

Thlrl art methodologies and procedures available for 
analysing hazard.s in process industry, hut the safety of 
computer-controlled systems is not as well established. This 
paper examines whether the methods of conventional safety 
analysis can be applied to the safety design of a computer 
controlled system. To start from the beginning, the 
definitions for "safety” and related concepts given by various 
sources arc studied, and shown in Tabic I 

Safety describes the non-existence of risk posed by a 
system, or to the system. The basic elements of .safety arc the 
causes and the consequences of accidents. In the definitions 
of safely, the consequences are defined to be the effects on 
human life, property or the environmenl. 

According to the Shen’s definition (Shen, 1986), safely is a 
measurable concept. In any case, it is possible to say whether 
the system is in a safe state or an unsafe state. Anothei 
question is whether, or not, it is possible to define if a system 
will be safe enough throughout its whole life cycle. 

In some of the definitions safe means zero-risk (under 
defined conditions): safe enough is a condition where there is 
no possibility of an accident. In many cases this would mean 
that the intended function ol the process plant is impossible. 
For example, ihe risk posed by a chemical plant will in most 
of the cases be at its minimum when the plant is not 
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operating, but this would not be reasonable for the 
production target of the plant. A more realistic safety policy 
is risk minimization or a risk optimizalion target, under 
certain constraints. This includes the aspect of concentrating 
on the weak points of the proce.ss m order lo make them 
safer. An even more sophisticated approach would he to 
minimize thi total cost of the plant during its life-cycle. This 
lotal cost con.sists of, e g. the costs of design, start-up, 
operation and maintenance (both preventive and corrective 
maintenance), unavailability costs and costs of accidents. 

When the causes of accidents are discussed, the concepts 
of failure, fault, error and mistake are often mentioned. 
System failure is defined as ihc inability of a system to fulfil 
its operational requirements. Systematic failures cause the 
system to fail under some particular combinations of inputs 
or under some particular combinations of inputs or undei 
some particular environmental condition. Systematic failures 
could arise at any part of the safety life-cycle. Other types of 
failure arc those whose occurrence follows a stochastic 
model. Examples arc failures due to ageing ol mechanical 
components, and random failures ol electronic componenls. 
Fault and error arc defined as reasons lor failure. A mistake 
is defined as a human error or fault 

The concept ol hazard comes up Ircqiiently m accident 
research, ll describes the potential of accidents. T he concept 
of an accident or a mishap is rather vague. It lies between the 
causes and consequences, but the limit where the accident 
event begins and where il ends, i.e. which arc the causes and 
which the consequences, is not exact. 

In conclusion, the above definitions make it possible to 
derive the following' causes ol accidents can be divided into 
systematic failures and stochastic failures in systems. A 
software failure is always a systematic failure. Examples ol 
other systematic failures are, c.g. those caused by errors and 
mistakes m specification, design, construction, operation or 
maintenance. Ihc occurrence of failure does not cause the 
accident directly l ypically, a specific stale ot the process, or 
a combinalion of failures, are needed lor a hazard to occur 
and to develop In be an accidcnl (Fig. 1). A viewpoint is that 
all Ihe accidents arc caused by a human being, cither by the 
design organization which has not designed the system 1(3 be 
safe enough, or by the operational organization which has 
nol been able to handle the dislurbunee situations. Human 
aspects in accidents are studied, for example, in Rasmussen 
(1986, 19^X)), Rasmussen ('( al. (1987) and Reason (1990a,b). 

An automation system itself is rather safe. No great 
amounl of energy is bound lo it. The hazards arise in the 
proce.ss controlled by the automation. Ehe safely of a process 
plant IS based on the process design, which defines the 
number of possible unsafe states and their probability. 
However, responsibility for many safety features is allocated 
to the automation design. In a hazardous situation, the target 
of the control system is lo prevent an accident from 
happening by keeping the process in a safe state or by 
bringing the process back to a sale stale before any serious 
consequences have occurred. Sometimes il may not be 
possible lo maintain a sale state; m such case, the control 
system should minimize the consequences. Another impor¬ 
tant safety-related target of the automation design is to 
prevent hazardous situations from being caused by 
automation (Fig. 2). 
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Term 

Safety 


System failure 

Systematic failure 

Fault, error 

Mistake 

Hazard 

Accident, mishap 

Unsafe-stale 

Safe-state 


Table 1. Definitions for saffty and related c'oncepts 

Definition 


Freedom from those conditions that can cause death, injury, occupational illness, or damage to or loss 
of equipment or property (MIL, 1984; lEC, 1989a,b). 

Safety is a measure of the degree of freedom from risk in any environment (Leveson, 1986), 

Safety of a system is the probability that, when operating and/or residing under stated conditions, the 
system will not be injured significantly for a specified interval of time (Shen, 1986). 

A system failure occurs when the delivered service deviates from the specified service, where the 
service specification is an agreed description of the expected service. A failure, in short, is the 
manifc.station of an error on the sy.stem or software (lEC, 1989a,b). 

Failures due to errors (including mistakes and acts of omissions) in design, construction or use of a 
system which cause il to fail under some particular combinations of inputs or under some particular 
environmental condition (TEC, 19896). 

The cause of an error is a fault which resides, temporarily or permanently, in the system. An error is 
that part of the system stale which is liable to lead to failure (lEC, 1989a,b). 

A human error or fault. A human action (in carrying out any system life-cycle activity) that may re.sult 
in failure (lEC, 1989a,b) 

A condition that is prerequisite to a mishap (MIL, 1984). A physical situation with a potential for 
human injury, damage to property, damage to the environment of some combination of these 
(Anon., 1983). 

An unplanted event or series of events that results in death, injury, occupational illness, or damage to 
or loss of equipment or properly (MIL, 1984). 

A state that have been determined to be unacceptable in that their risk exceeds a specified threshold 
(Leveson, 1984). 

A state when the specified hazard no longer exists, or, a state of a defined system in which there is no 
danger to human life, economics or environment under certain as.sumptions and specified conditions 
(lEC, 1989a). 



Fig. 1. The accident is calised by unintended events in human activity, in the technical equipment or m the environment 


In practice, safety design consists ol identifying possible 
priM^ess disturbances and potential accidents, whereafter 
preventive measures are designed for them. Some rules of 
thumb arc used to define what is safe enough and what is 
not. In some cases, and for restricted problems, more 
.sophisticated methods and safety measures are used for 
decision-making. 




Fig. 2. The salety of the process automation is closely linked 
with the safety features of the process itself. 


When designing the process automation, the designer has 
the following option.^ for reducing the risk related to a 
specific accident (sec Fig. 3). 

Firstly, potential causes ol an accident which exist in 
automation can he eliminated or Ihcir probability minimized 
This measure includes reduction of the number of potential 
sy.stematic failures. It can be accomplished, for instance, by 
designing procedures, through management actions, by 
having skilled and experienced designers and by thorough 
testing, Stochastic failures can be reduced by using 
components of adequate quality, by ageing procedures and 
by testing procedures for stand-by components. Redundancy, 
both functional and architectural, can be employed on 
different levels, e.g on component, circuit-board, station 
and system levels. Secondly, il is possible to prevent the 
unsafe state from occurring by designing countermeasures for 
hazards by which the system can be brought back into a safe 
state. This can be done, for example, by designing alarms 
and proper operator actions for them, by designing 
automatic control actions, by designing fail-safe systems, 
interlocks and irip-sy.slems, and through building redundancy 
and stand-by systems. Thirdly, ii may be possible to control 
the unsafe slate or the consequences of the hazard, and to 
minimize them, i.e. to keep the process in a slate which, 
though perhaps unsafe, still prevents any more harm from 
occurring. This can be done, e.g. by means of safety 
protection equipment or by keeping people and materials out 
of the hazardous area. 

Accidents caused hy automation systerm 

This chapter represents some accident .studies of process 
industries, and interprets them I rum the automation point ol 
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causes 


hazard 


accident 


f 


consequences 


I 


Minimizing Control actions 
the probability to prevent 
of the causes the hazard 


Control actions and 
operator support 
to prevent the accident 


Control actions 

to restrict 

the consequences 


Fig. 3, The auloiTialion engineer can affect the probability and severity of accidents in many ways. 


view. The purpose is to get an overview of the problems and 
of the safety studies which are needed to detect the potential 
causes of mishaps before their occurrence. 

The VTT’s safety engineering laboratory has studied 31 
accidents or disturbances in the process industry. Most of 
the accidents were selected from the VARO-register, which 
is comprised of accidents which occurred in Finland. In 
addition, .some accidents reported in the literature (Kletz, 
1983; Lees, 1980) and one from the FALl^S database were 
studied. The first criterion for inclusion in the study was that 
the automation or control system had contributed to the 
accident. Another criterion was that either the process was a 
batch process or the accident took place in a sequential 
operation m a non-batch process. Such operations arc, for 
example, the start-up or shut-down of continuous processes. 

The purpose of the study on accident.s was to identify ways 
to improve the safely design of sequence logic functions. 
Owing to the assumption that discontinuous lunctions arc 
both more complex to control than continuous ones 
(Rosenof and Ghosh, 1987) and more critical to safety 
(Rasmussen, 1986), the results of the study on accidents can 
be used to gel an idea of the accident causes and, thus, 
clarify requirements for safety analysis and design of process 
automation. 

7'he accidents were classified according to the status of the 
process at the moment when the disturbance look place, the 
cause of the failure, and according to the role of an interlock 
system in the accident (Tabic 2) In most cases there were at 
least two causes. For example, if in the design ihe possibility 
of an equipment failure had not been considered, both a 
design error and an cquipmeiil failure were thought to be the 
causes of the accident. Thirteen of the cases had no interlock 
system, but after the accident it was concluded that such a 
system would have been easy to realize, feasible and would 
have prevented the accident. In three of the cases there was 
an interlock system which, however, had been overridden. 
Furthermore, in three of the cases the interlock had failed to 
function. 

Tabll 2. Thf rfsults of ihf siudy on 31 acx idfnt.s by 

VTT AND THh STUDY ON 17 INCIDI NIS BY PlTBlADf> tl AL 

(1989). Thf; in( idrnts wfrf c ausfj) by failurfs in 

AUTOMATION AN INUIDF-NI I.S HAVING MANY ( AUSF:S WIIK’II 
MAKES THf . TOTAL NUMBER OF CAUSES EXCEI D HIF NUMBFR OI 
SAMPLFS 


Slate of the unit: 


Commissioning 

1 

Start-up 

7 

Shut-down 

0 

Maintenance 

5 

Normal operation 

18 

Cause of the accident; 

Design error 

23 

Installation error 

1 

Operator error 

11 

Equipment failure 

12 

Role of the interlock during the accident 

Did not exist 

13 

Was overridden 

3 

Failed to function 

3 


One result of the study was the conclusion lhal possible 
equipment failures and their consequences should be 
considered systematically when the control or automation 
system is being designed. In this way the control system will 
be able to handle the situation in case of failures in process 
or control system equipment. Correspondingly, the pos¬ 
sibility of operator errors should be taken into account. 
Especially human errors in disturbance and failure situations 
should be studied. The trips and interlocks should be 
developed so that they do not disturb the operation of the 
unit in any situation which could be considered to be normal 
and safe. When a solution has been found that is acceptable 
with regard to both availability and salety. operators need 
not disable the protective function. 

Pitblado et ai (1989) studied 17 hazards which occurred 
over a four year period in computer controlled plants in The 
Netherlands. The hazards were related to the computer 
system or to human interaction. Hardware failure was 
involved in nine of the hazards Software failure was 
recognized in three of the hazards, one failure in system 
software and two in site software implementation. Human 
errors were classified according to the life-cycle of the 
process. Five design errors were found, two installation 
errors, one in the commissioning and testing phase, 10 in 
operations and two in maintenance. Lhe most common type 
of error (accounting for 10 errors in all) was that due to 
inadequate, insufficient or incorrect information supplied to 
the person(s) involved. Recognition failure, despite an 
adequate supply of information, took place in the ca.se of two 
hazards. Failure to follow procedures correctly was another 
type of error. Eight such errors were involved in the hazards. 

Several other studies have investigated accident causes in 
process industry in general, c.g. the works of Rasmussen 
(1986) and Pipatii (1989). In the following a brief overview is 
given about what they found with respect to the automation 
or control system (Tabic 3), Rasmussen studied the accidents 
which involved hazardous chemical reactions. The study was 
comprised of 190 accidents, and the factors which had caused 
the accidents were classified on many levels. Tlie causes 
related to automation or control were as follows: poor 
instrumentation was identified in 19% of the incidents; 
operator error was likewise involved in 19% of the hazards. 
Poor instrumentation included inadequate alarms, interlock 
systems, control systems, etc. Such incidents may have 
resulted from a variety of reasons: some failure of 
equipment, incorrect or unsatisfactory design of the control 
or protective system, inadequate inspection, or some human 
error by the operator. According to Rasmu.ssen, poor 
instrumentation, insufficient design and operator error were 
noted very often in combination with batch operations. 

In the accident study of Pipatti (1989), failures in 
instrumentation were present in about 5-15% of all accidents 
(depending on the data sources). Human errors (about 
17-26% of all the accidents) had been caused mainly by 
negligence or ignorance. For example, notification of 
changes which had been made in the proce.ss was not given to 
everyone who needed that information. Typically, the human 
errors were made during the start-up of the process. In some 
cases, the consequences of the human errors might have 
been smaller had the degree of automation been higher. 

Tlie accident reports and statistics reveal that the 
automation or control system as a cause of accidents has not 
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Table 3. Summary of the ac c ident siudies ok Rasmussen and Pipati i 


Reference Sample Automation related 

causes of the 
accidents. 





Poor 

instrument¬ 

ation. 

Human 

error. 

Rasmussen 

190 accidents which 


19% 

19% (only operator 

(1986) 

Pipatti (1989) 

involved hazardous 
chemical reactions. 

84 accidents which 


5%. 

errors) 

26%; 

happened in Finland. 
184 accidents which 


15%. 

17% 


happened outside 
Finland. 


been studied in detail. It is hard to find statistics on the effect 
of automation failures or automation deficiencies on the 
availability or safety of process plants. One reason for this 
lack may be that accident reports do not consider automation 
as a "unit” itself, but rather a part of the procc.ss unit 
controlled by the automation. 

On the basis of the accident studies mentioned, safety 
design of process automation should take the following 
aspects into account. Firstly, on the basis of Rasmussen and 
Pipatti, the co-operation between automation and human 
operator is important to avoid human errors during 
operation, especially in case of discontinuous process 
functions. vSccondly, the occurrence of equipment failures as 
causes of accidents bring up the requirement that potential 


failures, both in measurement and control equipment, and in 
process equipment, should be studied, and the design should 
prepare lor them, an equipment failure of the system should 
not lead to an accident. In the same way, potential human 
errors should be recognized and prepared for. A key feature 
here is the quality of the process information provided to the 
operator. 

Design procedures for a safe automation system 

Roughly, the safety design should involve the following 
steps. 

Firstly, the elicitation of requirements is performed. The 
safety targets of the automation should be derived from the 
safety features of the process, and the requircmcni elicitation 


Tabi I 4. Saf i iy anai ysis mi thuds s( opf and princ iim f 


Analysis Purpose, scope Principle References 

method 


Hazard and 
operabiliiy study 
(HazOP) 


Action error analysis 
(AEA) 


Fault tree analyses 
(FTA) 


Event tree analysis 
(ETA) 


Failure mode and 
effect analysis 
(FMEA) 


Reliability 

assessment 


HazOp IS widely used for hazard 
identification in process industry 
in order to discover potential 
hazards and operability problems. 


AEA considers the operational, 
maintenance, control and 
supervision actions performed by 
a human being The pi^lential 
mistakes in individual actions arc 
studied. 

FI A models the cause sequence 
leading to the I'OP-cvcnt. FI A 
can be used as a quantitative 
method. 

ETA models the potential 
consequence sequences of a 
hazardous situation or event. El A 
can be used as a quantitative 
method. 

In FMEA the possible failures of ihe 
system components or subsystems 
and the consequences arc analy.sed 
systematically. FMEA is 
commonly used for mechanical, 
electrical and electronic 
components. 

Reliability assessment means 
quantitative studies on potential 
compiinent and equipment 
failures, their causes and 
consequences. 


HazOp studies the potenlial 
deviations Irom the intended 
operation conditions. The studies 
are carried out by a 
multidisciplinary teani, and key 
words are used to guide the 
analysis. 

A short checklist is used. The effects 
of each potential mistake on safety 
and on system performance, 
recognition of the mistake, and 
potential countermeasures are 
planned 

The causes arc modelled backwards, 
and the probability of the POP- 
cvenl is assessed on the basis of 
this model and reliability figures of 
the system components. 

ETA reasons forwards starting from 
the hazardous situation, and the 
potential consequences are 
modelled. 

The components of the system and 
their failures and failure modes 
arc listed on a tabular sheet. 
Checklists can be used to support 
the analysis. 


Block diagram or FTA is used as a 
basis. The most important 
measures in reliability assessment 
are failure rale and lime concepts. 


Anon. {\^77) 

Lees {mO) 

Klctz (l9H3a) 
Nimmo el al. (IMK?) 
Pitbladn (198^) 
Wells (1980) 

Taylor(1981) 


lEC 1025 (1990) 
Wells (1980) 
Lees(1980) 
Anon, (1989) 
Malasky (1982) 
Wells (1980) 
Anon. (1989) 
Suokas(1988) 


Wells (1980) 
Malasky (1982) 
lEC 812 (1985) 
Uhtcia (1990) 


MIL 217 (1986) 
Lccs(1980) 
Anon. (1989) 
Dhillon and 
^Singh (1981) 
O’Connor(1981) 
Toola (1988) 
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should be based on studies of the safety aspects of the 
process. The questions of “safe enough’' and acceptable risk 
must be answered. The safety requirements may be either 
qualitative or quantitative. Safety may be measured in terms 
of the probability and severity of accidents, or it may be 
agreed to be achieved by using certain standards and design 
practices. In specifying the requirements, it should be 
remembered to specify them in a way that makes verification 
possible. 

Secondly, the design of the automation takes place. 
During this planning the safety aspects arc designed. It is 
much easier to design them in the early phase than to make 
extensive modifications to the design later (Watson, 1989). 


Finally, the safety of the automation is verified. The 
verification includes both the functional and non-functional 
features. The verification methods depend on how the 
specification is expressed. 

The design of safely critical control systems has been 
illustrated and guidance given, e.g. in Anon. (1986, 1987a, b, 
c) and Bloomfield and Brazendalc (1990). 

This chapter studies some safety analyvSis methods in 
automation design. The analyses are introduced in Table 4 
where their scope and principle are shown. References for 
further information are also given. Table 5 introduces the 
methods from an automation design point of view: the 
advantages of the methods and their usefulness in control 


Table 5. Safety analysis methods, advani ages. restrictions and deficiencies 


Analysis Advantages in 

method automation design 


Hazard and operability 
study (HazOp) 


Action error 
analysis 
(AEA) 


Fault tree 
analysis (FTA) 


Event tree analysis 
(ETA) 


Failure mode and effect 
analysis (FMEA) 


Reliability 

assessment 


The outcome is a list of action, e.g. design changes 
for consideration, cases identified for more 
detailed study, etc. The contribution for 
automation de.sign are the countermeasures 
related to the instrumentation or control 
actions, e.g. interlocks, alarms, measurements, 
trips or redundancy. On the other hand, a 
HazOp session is a way to enhance the 
information exchange between the automation 
designers, process designers and operational 
personnel 

AEA gives the automation designer proposals for 
instrumentation to help the personnel to 
monitor the process during the operation 
sequences and lo prevent the operators from 
making mistakes. 

In Ff A, the needs for control or protective 
actions to diminish the risk can be seen, and the 
effects of different control and protective actions 
on safety can be quantified (Hill, 1988). FTA 
also makes it possible lo verify those 
requirements, which arc expressed as 
quantitative risk values. An advantage of FI’A 
is that it can handle multiple failures. Software 
fault tree analysis (wSFT'A) attempts to verify 
that the program will not in any environment 
allow a particular unsafe output to occur. 

Tbc ETA makes it possible lo analy.se sysiems 
where the chronology of the events is stable and 
the events are independent. For instance, 
protective actions performed by automation arc 
often organized in such a chronology and are 
indepK'ndent of each other. 

FMEA studies the potential failures of automation 
system components and their effects on the 
function of system. It is also possible to ensure 
that potential failures will be taken into 
account, and countermeasures designed for the 
fKJlential process disturbances that they would 
cause. An advantage of FMEA is that it is 
simple and easy to learn. 

One purpose of reliability assessment is to check, 
before the system is commissioned, that it is 
reliable enough. The weak points of the system 
are identified, and the efforts can be made to 
improve them or to diminish the consequences 
of them. Furthermore, the need for spare parts 
can be forecast on the basis of the failure rale 
figures. Reliability assessment is used in 
connection with safety analy.sis methods to 
judge the conformance with the quantitative 
safely requirements. 


Restrictions and 
deficiencies 
in automation 
design 

HazOp is often a time-consuming and thus 
expensive method owing to the systematic 
procedure of the method and the number of 
people involved. HazOp is most often applied lo 
continuous processes or process situations. Only 
minor emphasis is given to the non-conlinuous 
processes and procc.ss situations like start-up, 
maintenance. 


AEA is an analysis of the technical system, and 
does not analyse the operator behaviour. The 
thoughts and intents of the human being, i.e. the 
reasons for mistakes, are not examined. 

The major risks have to be well known before 
FTA, for instance, on the basis of a PHA, and 
specialists arc needed for the quantitative 
analyses. A basic assumption of FFA is that all 
failures in a system arc binary in nature. I'hat is, 
a component or human being either performs 
successfully or fails completely. Similarly, the 
failures arc assumed to be instantaneous. In 
addition, the system is assumed lo be capable of 
performing its task if all sub-components are 
working.t 

A limitation of El A is that the model consists of 
the intended actions, and the effects of the 
failures lo conclude the intended actions arc 
modelled. No attention is paid lo the possible 
extra actions, or incomplete actions, including 
those taken too early or too late. 

Like most of the safety analysis methods, FMEA 
docs not study multiple, simultaneous failures 
without tremendous increase of required labour 
for studying all the different failure 
combinations. 

When failures in automation arc analysed, 
there should be a systematic way to take account 
of all process situations. This might be difficult.t 

Quantitative reliability analyses require specialized 
reliability engineers and are quite laborious. An 
uncertainty is to find the data, i.c. the failure 
rate figures, which are the basis of quantitative 
reliability analyses.t 


t FTA, FMEA and reliability assessment are possible to be done both on the controlled system and on the automation system. 
When analysing the controlled systems, analyses define requirements for the automation; when analysing the automation the 
requirements arc verified. 
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design, as well as some of the restrictions and deficiencies are 
listed. The analyses which arc covered are hazard and 
operability study (HazOp), action error analysis (AEA), 
fault tree analysis (FTA), event tree analysis (ETA), failure 
mode and effect analysis (FMEA) and reliability assessment. 

Different safety analysis methods cover different aspects of 
automation design. According to accident studies, dn 
important feature in automation safety is the information on 
process states given to the operator, but safety analysis 
methods do not consider this aspect explicitly. Only AEA 
has a comment on it; asking by what way the operator 
notices his or her mistakes. AEA is an analysis of the 
technical system, rather than that of operator behaviour. If a 
more thorough understanding of action errors is needed, 
other types of studies are required (Rasmussen, 1986, 1990; 
Rasmussen et ai, 1987; Reason, 19iK)a,b). However, safety 
analyses provide information on the process disturbances 
which might lead to unsafe states, and, in this way the 
analyses can be used as an aid when designing operator 
interface. Direct use of the results of safety analyses which 
are made during the design phases for operator support have 
been studied, e.g. in Suokas et ai (1989). 

Another major deficiency of the safety analysis methods is 
that most of them have been developed for studying 
continuous process functions. An exception is AEA which 
studies changing priKCss conditions. However, it is limited to 
those functions, where human interfere is needed. Another 
exception is a modified HazOp which is introduced for the 
non-continuous proce.ss functions (Anon., 1977). However, it 
is not used very widely. 

Most of the safety analysis methods study hardware 
aspects. Some attempts to apply safety analysis to software 
exists. Software fault tree analysis (SFTA) is an extension of 
fault tree where the TOP event is a critical software fault 
determined, e.g. by the .system fault tree; and the software 
is studied backwards through the program to the software 
inputs That is, SFTA attempts to prove that the program 
will not in any environment allow a particular unsafe output 
to be occurred, (I^vcson and Harvey, 1983; Cha ei al., 
1988). The concept of software reliability differs to some 
extent from the reliability of hardware. This is mainly 
because of the different nature of failure or fault. There are a 
number of views as to how software reliability should be 
quantified. A commonly used approach is to use an analytical 
model whose parameters are generally estimated from 
available data on .software failures (Goel, 1985; Dale, 1991; 
Laprie and Littlewood, 1991). The mca.sure of stiftware 
reliability is suppo.sed to be useful in planning and controlling 
resources during the development process so that high 
quality software can be developed. It is also supposed to be 


useful as a measure for giving confidence in software 
correctness. However, it is not used in software production 
in industries. The concept of software reliability, and the 
reliability modci.s are also discussed in Rook (1990) and 
Musa et al. (1987). 

Most of the safety analyses study the possibility of 
stochastic failures. Because software failures are always 
systematic, SFTA and software reliability assessment can be 
considered to cover systematic failures. However, other 
methods, like HazOp, FMEA and FTA reveal deficiencies in 
the system, e.g. needs for redundancy, checks, alarms; and 
these deficiencies are often some kind of systematic faults. 

A drawback of most of the methods is that multiple 
failures are not studied systematically. Especially, simul¬ 
taneous feailurcs of different type, such as a human error in 
connection with a component failure, arc difficult to study 
with a single method. However, human errors, especially, 
are more probable during abnormal situations than during 
the normal course of actions. Methods which include a more 
formal modelling of the hazards may reveal these multiple 
failure situations. Such methods for hazard identification and 
modelling are FI'A, ETA and reliability assessment which 
can, in principle, also include modelling of human mistakes 
and omissions. However, none of the safety analysis methods 
mentioned here systematically study human aspects. Aspects 
which arc covered by different safety analysis methods are 
shown in Table 6. 

It is obvious that no safety analysis method covers all 
aspects of safety design. Each method has its own target and 
is applicable to specific problems, though in principle, it is 
possible to find a combination of methods which is optimal 
for each design problem. 

Conclusions 

The safely and availability performance ot process 
automation affects the flexibility and profitability of 
production. Accidents and injuries not only cau.se economic 
losses, they may cause human suffering and environmental 
damage. To judge whether the design is safe enough or 
.should be improved, the designer must have a concept of 
what kind of safety and availability performance is required. 

Automation designers do not get data on the safety 
requirements for automation from the process engineers in a 
form which enables direct verification. The automation 
engineer has to analy.se the hazards associated with the 
process, safety analysis methods can be used here. I'hc 
hazard and operability study is one of the most common 
methods applied to analyse the safety of process engineering 
Other methods adaptable to automation engineering, and 
which could be used in co-operation with process engineers, 


Table 6. The outlined safety analysis methods cover differen'i safety RELATiiD aspects of 

AUrOMATlON DESKiN 




HazOp 
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FTA 

ETA 

FMEA 

Reliability 
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Identification 


■ 
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of accidents. 
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■ 
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of the consequences 








of failures. 
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■ 
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■ 

Software failures. 




■ 
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Quantitative aspects. 




■ 

■ 

0 
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■ 
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■ 

O 
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are action error analysis, fault tree analysis and event tree 
analysis. The reliability and safety features of the automation 
or control system can be further studied by means of failure 
mode and effect analysis. Quantitative reliability assessment 
can be combined with other methods to gel comparative 
values for the hazards. 

This paper has discu.s.scd safety of automation and the use 
of safety analysis methods in automation design. The safety 
of computer controlled processes includes the question of 
software safety. Fault tree analysis and reliability assessment 
have modifications to analyse software. Even if the other 
methods which are mentioned here do not study software 
safety explicitly, they support the application software 
development process by providing methods for requirement 
specification. According to accident studies, an important 
feature in automation safety is the information on process 
stales given to the operator. I he safety analysis methods do 
not consider this aspect explicitly. However, safety analyses 
provide information on the process disturbances which might 
lead l() unsafe states, and, thus the analyses can be used as a 
standpoint when designing operator interface. Another 
major deficiency of the safety analysis methods is that most 
of them have been developed for studying continuous 
process (unctions. Furthermore, a drawback of some of the 
methods is that multiple failures arc not studied systemati¬ 
cally. Especially simultaneous failures of different type, e g. 
a human error in connection with a component failure, are 
difficult to study with a single methodology. Only methods 
which include a more formal modelling of the hazards will 
reveal these multiple failure situations. However, even if 
safely analyses do not guarantee that all potential hazards 
will be revealed and countermeasures planned, they provide 
a practical way fur automation engineers to discu.ss 
systematically with process engineers and operators the 
intended and unintended functions and stales ol the process. 
Thus, safely analyses enhance information transfer between 
process engineers, plant operators and automation engineers. 
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Abstmcl —Controller design is considered for system 
specifications which are not handled naturally by analytical 
methods. Using fuzzy sets and related theory, system 
specifications are translated into preference functions which 
are readily combined with search methods to determine 
adequate controller parameters. I'his contribution integrates 
the discussion of the theory and its step by-slep application 
to aircraft control during the flare-out phase of landing. 


controller parameters to be determined. Section 4 is devoted 
to the selection of preference functions which truly 
incorporate the control specifications. In Section 5 the 
preference function optimization problem over the controller 
parameter space is treated. Preference function optimization 
yields adequate parameter values which will satisfy the 
specifications. Results for the case .study problem and final 
comments are found in Sections 6 and 7, respectively. 


1. Introduction 

In many I'NGiNhi-RiNfi application.s controller dc.sign, tuning 
and adjustment have been necessary to guarantee the 
attainment of a variety of application specific performance 
and stability requirements. 

The control engineer, in general, is able to formulate 
control specifications informally, u.sing some sentences and, 
eventually, some sketches, Nevcrlhcle.ss a methodic ap¬ 
proach to the design, adjustment or tuning problem itself 
may be intricate, depending on these specification.s. As long 
as quadriilic performance indices or eigenvalue specifications 
are irscd, for example, controller design is almost 
straightforward. Often, however, such specifications do not 
arise naturally and have to be “designed” by themselves to 
be hopefully equivalent to the original specifications. 

In this context it is highly desirable to have a tool for 
generating a preference function which translates the 
engineers’ “informal” specifications into mathematical 
language. Such preference function should allow' for 
quantitative control system quality evaluation and be readily 
combinable with search methods to generate an adequately 
tuned or adjusted controller. 

The main concern of this contribution is to show how fuzzy 
sets and the related theory may be used to translate relatively 
informal controller dc.sign specifications into a preference 
function, which is then used lo determine adequate 
controller parameter values using an optimization algorithm 
such as that due to Nclder-Mead (see references and 
comments by Himmelblau, 1972). Thus fuzzy sets and the 
related theory arc used as a design decision tool. In the fuzzy 
set literature several decision applications were reported, 
such as those by Zimmermann cl al. (1984). However, a 
specific application lo control engineering is not known to 
this author. 

For better comprehension and illustration ol the simplicity 
and potential i>f the proposal, this contribution integrates ^he 
discussion of the theory and its step-by-step application to a 
well-known, nontrivial aircraft landing sy.stem design 
problem (Ellerl and Merriam, 1963). 

The contribution is set up as follows. In Section 2 the case 
study problem is described. Section 3 presents the chosen 
controller structure. This choice defines the number of 
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2. Aircraft landing problem definition 

This section closely follows Ellert and Merriam (1963) and 
Tou (1964). 

This ca,se study is concerned with the final phase of aircraft 
landing, also called the flarc-out phase. In this phase the 
aircraft must be guided along the desired flare-path from a 
given altitude until it touches the runway. It is assumed that 
the aircraft is guided to the proper initial conditions for 
flarc-out begin, which in this example range from KO to 120 0 
altitude, and 16-24 0 sec ’ descent rale. Variation of 
descent rate is zero al flarc-out begin. The aircraft is waved 
off for value,s outside these ranges. Al flare-out begin in any 
case the angle of attack value is sup|Hiscd to be 12.6“, which 
is 70% of its stall value. Through adequate control (not 
discussed herein) aircraft velocity is maintained constant 
equal to 256 ft sec ’ during flarc-out. Only longitudinal 
motion is considered. 

The linearized slate equations for the initial angle of 
attack, adopted henceforth as the nominal value, read; 

[de/dr 

dx/dl = Ax Hu, V - Cx, 

dh/dt 

L h . 

where x is the slate vector, y is the output and measurement 
vector, d6/dt is the pitch rate (rad sec '), is variation in 
pitch relatively lo the nominal value (rad), dh/di is rate of 
ascent (ft sec ') and h is the altitude (ft). System input u is 
the elevator deflection (rad). System matrices are given as. 

-0.6 -0.76 2.96875 x10 ' o" 

1 0 0 0 

0 102.4 -0.4 0 

0 0 1 0 _ 

"-2.375"! fl 0 0 0 

0 0 1 0 0 

0 ’ 0 0 10 ' 0 
^ 0 J Lu 0 0 10 ' 



In addition lo the stale variables, the angle of attack rx is of 
primary importance since for a - 18“ the aircraft reaches stall 
conditions its geometric relation to the pilch angle and the 
rate of a.scent is well known (see Etkin, 1982); 


a - 0 - sin 


, dh/dt 
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The following specifications and control value limitations are 
given for the flare-out phase of landing. 

(a) For reasons of safety and passenger comfort, the desired 
altitude ot the aircraft during flare-out is 

flOOe OsfslS 
i 20-1 15srs20' 

Thus the desired duration of the flare-out is 20 sec, including 
5 sec over the runway. 

(b) As a consequence of (a) the desired rate of ascent is the 
time derivative of h^(t). 

(c) The aircraft must touch the runway with a slightly 
negative rate of ascent to ensure proper landing. 

(d) During landing the angle of attack must remain below 
the stall value. The aircraft enters the flare-out phase in 
equilibrium with an angle of attack 0.7 times this value. 

(e) The pilch angle 0(T) at real touchdown time T (not 
desired touchdown time!) must lie between 0” and 15° to 
prevent either the nose or the aircraft tail from touching the 
runway first. 

(f) The elevator, which is the only actuator for this problem, 
has its mechanical stops at -35° and 15°. 

3. Selecting the control structure 

To satisfy requisites (a)-(e) under restriction (f), all the 
foregoing section, a linear state feedback controller structure 
is adopted. The choice of the controller structure depends on 
control engineering considerations which primarily involve 
realization simplicity and measure availability. In the case of 
this aircraft landing problem, measures of all state variables 
are available, which suggests the use of static state feedback. 
The use of two mutually reinforcing references h^^(t) and 
dhjdt is natural, .since the controlled system is expected to 
track them. Thus the control function becomes; 

u = - K 2 X 2 - - dhjdt) - 

where X,, K 2 , and arc the controller parameters. In 
the light of these considerations, the controller design 
process reduces to finding proper values for these four 
parameters. 

At this point it should be noticed that the design 
specifications will never be satisfied exactly because of the 
most likely existence of non-nominal permissible initial 
conditions of altitude and rate of ascent. Hence the 
formulation of the problem is uncertain in nature and luzzy 
set theory will provide the best means o! formulating 
mathematically which controller performs “best” in the most 
adequate sense. 

4. Generating a preference function 

Design specifications were formulated in a quite complete 
fashion in Section 2. In this section the specifications arc- 
translated into a preference function, a function that allows 
for the ranking of sets of parameter values as well as for the 
determination of the acceptability or not of a parameter 
value set. 

For the sake of generality it is assumed that the desired 
prop>erties of a successfully controlled system are formulated 
linguistically in terms of n linguistic variables A',, . . , X„. 
(see Kandel, 1986) as follows 

HX^ is A,,) AND (X. is A.,) AND . . . AND 

isA„,)lOR 

{{X, is A,2) AND (X. is A^J AND. . . AND 

(X„ isA„,)]OR 

[(A', is A,AND {Xj isA,,J AND. . AND 


( 1 ) 

The A,j are linguistic values characterized as fuzzy sets on the 
support set of X,. For the .sample problem a description of 
de.sirable landing behavior could read (the choice is not 


unique): 

(Pitch angle at touchdown is positive less than 15°) 

AND (Maximum angle of attack is below stall value) 

AND (Error in h(t) is low) 

AND (Error in dhjdt is low) 

AND (Time at touchdown is about 20sec) 

AND (Rate of ascent at touchdown is slightly negative) 

AND (Maximum rate of ascent is negative or about zero) 

How was this sentence generated from specifications (a)-(e)? 
As a general characterization of an acceptable landing 
performance in the light of the specifications. ' Pitch angle at 
touchdown is positive less than 15“” and "Maximum angle of 
attack is below stall value" are specifications (d) and (e), 
re. 5 pectively. The terms "Error in h(t) is low" and "Error in 
dh jdt is low" account for the requirements that altitude and 
rate of ascent should remain reasonably clo.se to those 
specified in (a) and (b). "Time at touchdown is about 20 
seconds'" enforces that touchdown should really r>ccur near 
the time .specified in (a). This is necessary to hinder the 
aircraft either from touching the ground before reaching the 
runway or from overrunning it. "Rate of ascent at touchdown 
is slightly negative" is specification (c). "Maximum rate of 
ascent is negative or about zero" simply points out that, in 
order to correct initial conditions and follow the flare-path, 
the aircraft should not ascend in a perceptible manner. The 
rea.son for this is passenger comfort, which was already 
implicitly stated in the formulation of the desired flare-path 
h^(t). Restriction (I) on elevator excursion is not included in 
the de.scription of landing behavior since it is supposed that 
the elevator deflection commanded by the controller will not 
reach saturation values. This supposition is confirmed later in 
Section 6. 

In the next step the linguistic formulation (1) of the 
desirable situation has to be translated into a preference 
function. As mentioned before, a preference function allows 
for the ranking of different sets of controller parameters; the 
set which produces system behavior with the highest 
preference function value being regarded as the most 
satisfactory one. Thus the membership function ol the set of 
desirable (i.e. satisfactory) controlled .systems would lend 
itself as preference function. To derive this membership 
function, recall that in the specific case of control systems, 
the linguistic values taken by X, in (1) generally will be 
singletons, which means that only one point x, of the support 
set will have membership 1 whereas at all other points 
membership will assume its minimum value. Hence the 
membership of a certain controlled .system in the .set of 
desirable (i e satisfactory) systems is the membership 
function induced by (1), which, according to general 
definitions of fuzzy logic (Kandel, 1986), is: 

H(x, .■*„) = max min (2) 

where x, is the actual scalar value on the support set of A",. 

There obviously remains the choice of the fuzzy sets which 
characterize the A,^ For computational efficiency the 
membership functions of these sets should be chosen as 
simple as possible, for example as piecewise continuous 
functions. 

In the case of the airplane landing system the X, in (1) arc 
defined as follows: 

Time of touchdown - T such that h{T) ^ 0 

Error in fi(i) = max |/irf(() - /i(r)| 

/e|0. / ] 

Error in dhjdt = max \dhjdt ~ dhjdt] 

f^\il 7 I 

Descent rate at touchdown ~ {dhjdt) (T) 

Maximum rate of ascent == max dhjdt 

T] 

Pitch angle at touchdown - H(T) 

Maximum angle of attack - max erfr) 

/fell). T\ 
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Lime al (ou hdown [s] 
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rale or aaieiit al inucliilosn 
(0 


B 

B 


0 


I 


\ 


mnximum rale of aiLsiil *fl/«] 

(g) 

Fig 1 Membership functions for the aircraft landing problem (a) Angles positive less than 15“ (b) Angles 
of attack below stall value (c) Low altitude error (d) Low rale of ascent error (e) Touchdown at about 
20jec (f) Slightly negative rales of ascent at touchdown (g) Negative or about zero maximum rates of 

ascent 
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1 ABi E 2 Summary oh results 


Table! Initial conditions 


dfl/df 

A6I 

d/i/d( 
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0 

-0 078! 

-20 
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-24 

80 



so 





20 - - - 

0 10 


2ft 


bine [a] 

(a) 


elpvalD^dcnprliiin [dpfrpga] 
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14 
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12 *-- 
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10 


Order 

Initial 

Determined 

Preference 

Number of 

No 

guess 

parameters 

function 

Iterations 

1 

0 

-0 4311 

0 2837 

34 


-1 

-1 0304 




-2 

-2 4636 




-1 

-1 2422 



2 

0 

-0 1171 

0 1519 

25 


-0 s 

-0 6675 




-1 5 

-1 7064 




-1 

- 0 7630 



3 

-0 7158 

- 1 mi 

0 3333 

28 


-0 7663 

-1 1194 




-3 S85S 

-7 6266 




-0 8224 

-1 8916 
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of KgcDtil ffl/s] 
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(d) 


limr [a] 

(c) 

FiCi Landing pertormance for parameter ^ct 1 


The membership functions for the A, (low, ilightly negative, 
etc ) are given in Fig 1 Membership value 1 corresponds to 
fully satisfactory performance with respect to X, Negative 
membership characterizes completely unacceptable situa¬ 
tions Membership m the range [0, 1] is therefore acceptable 
and membership 1 is ideal Even the casual reader is 
generally used to membership values in the interval [0, 1] 
only Flowever, it is easily seen that for our purpose the 
exact membership function image is not relevant, as long as 
It clearly expresses realistic membership ranking Those who 
dislike negative membership values may map the proposed 


function onto the interval [0, 1] and continue thereafter to 
achieve the same results, with an obvious increase in 
computational expense 

For the landing pioblem, controlled system specifications 
are given independently of initial conditions, which may vary 
within the given ranges But system behavior strongly 
depends on the initial conditions Thus for performance 
evaluation purposes the nominal and the two worst case 
initial conditions are considered The preference function is 
evaluated for those three cases, and the worst value (the 
lowest) then taken This procedure is adequate since the two 
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worst case initial conditions are easily identified as those 
corresponding to the situation in which the aircraft is 20% 
above or below the desired flare-path at initial time and 
diverges at the largest permissible rate. The three initial 
conditions are given in Table 1. 

5. Comments on the optimization process 

The controller selection procedure consists of finding a set 
of controller parameters which gives a good, possibly the 
best, preference function value. Starting at an initial guess, a 
meth(Klic optimization procedure is used to reach a local 
optimum. Such optimization procedure should not rely on 
gradient values. The chosen algorithm should be started at 
several different initial guesses to ensure that a good value of 
the performance function is reached. 

The Ncldcr-Mead procedure was adopted to determine 
the controller parameters for the aircraft landing system. At 
each step system performance was obtained through 
simulation for the three initial conditions of Table 1. Based 
on these results, the preference function was then evaluated 
as described in Section 4. 

6. Results 

The Nelder-Mead algorithm was started at three initial 
guesses. These initial gue.sses were determined from 
root-locus and pole-placement considerations in order to 
ensure at least system stability. For all three guesses the 
algorithm located satisfactory controller parameters The 
initial guesses, determined parameters, UKally optimal values 
of the preference function and the number of iterations 
needed by the algorithm to reach the solution are found in 
Table 2. 

Figures 2 and 3 show landing performance for the initial 
conditions of Table I and the calculated parameter sets 2 and 
3 of Table 2. Solid curves depict landing performance lor the 
first initial condition of Table 1, dashed curves depict it for 
the second and dot-dashed curve.s for the third. Dotted 
curves are desired performance. Although the system 
performances in Figs 2 and 3 are .similar, the controller gams 
differ considerably, which clearly influences elevator 


deflection. Nevertheless it is seen that restriction (f) (Section 
2) is not critical. 

As indicated by the performance function value, 
parameter set 3 really is the be.st since it keeps the aircraft 
the furthest from stall. 

It should be noted that the solution for the landing 
problem in this contribution is considerably simpler than that 
determined by Ellcrt and Merriam (1963) and Tou (1964) 
and yields better (qualitative) performance. 

7. Conclusions 

It was shown how fuzzy sets and related concepts may help 
to solve controller design problems via optimization. The 
illustrated approach is particularly useful when design 
specifications do nut relate directly to quadratic performance 
indices, system eigenvalues, singular values or other system 
parameters manipulated by analytic design methods. 

If in a particular application certainty factors arc involved 
and/or the X, in (1), (Section 4) take values which arc not 
singletons, the approach of this contribution needs 
refinements and extension using the concept of plausible 
approximate reasoning (see Kienilz, 1990). 
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MIMO Linear Systems Under Decentralized 

Control* 
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Abstract—This note deseribes an interaction measure which 
enables the design of decentralized control for 2x2 linear 
systems, A sufficient condition is presented which accounts 
for phase information and therefore provides a less 
conservative approach than those based on the Small Gain 
Theorem. A method relying on this new measure permits 
tighter, independent design of the two controllers, which also 
allows performance tradc-olT between the two loops to be 
accounted for. 


1. Introduction 

Dect.ntralizcd ctjNTRGi is probably the most commonly 
implemented scheme in MIMO systems, because of its 
relative simplicity and because it is potentially robust to sensor 
and control actuator failure The synthesis of such a scheme 
involves the solution of two problems: (a) the selection of 
control pairing; (b) the tuning of the feedback controllers. 
Conditions lor Decentralized Integral Controllability can be 
utilized to select adequate pairing (Morari and Zafiriou, 
1989), The issue of controller tuning recognizes that any 
interactions present in the process may cause the 
performance of the individual control loops to deteriorate. A 
sufficient criterion for the system stability can be based on 
the Rijnsdorp Interaction Measure (RIM) (Rijnsdorp, 1965). 
This convenient method permits the independent tuning of 
the controllers, where the tuning parameters arc constrained 
by a scalar function whose magnitude depends on the relative 
level of interaction in the process. One of the sources of 
conservativeness of the RIM test is due to the fact that it 
takes into an account only magnitudes of the system 
elements. Some other interaction measures arc described and 
compared by Grosdidier and Moiari (1986). Most of them 
also use only magnitudes of the matrix elements. For 2x2 
control systems we can make the stability test tighter by 
adding phase information. 


2. Description of the method 

The decentralized control structure illustrated in Fig. 1 can 
be transformed to the SISO closed loop system given in Fig. 
2, by simple block diagram manipulation, where: 


h,{s)- 


1 +P„(.v)c,(.v) ’ 


1 = 1,2, 


(I) 


arc the closed loop 
SISO loops and 


iransfei functions of the corresponding 
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FKi. 1. The decentralized control structure for a 2x2 
system. 



Fig. 2 Transformation of the structure in Fig. 1. 


Applying the Nyquisi test to this problem, we note that 
stability can be guaranteed for the positive feedback implied 
in Fig. 2, so long as the total openloop transfer function does 
not encircle the (1,0) point in the complex plane as 
frequency goes from zero to inlinity. Thus, a necessary and 
sufficient condition of the system’s stability can be stated as 
follows. If k(s) and both of the closed loops hfs) are stable, 
the closed loop MIMO control system is stable if and only if 
the plot 


W(ia}) = h 2 {i(v)K(i(o)h^{i(jj), (2) 

does not encircle (1,0) point on the complex plane. 
Unfortunately, this test does not enable independent tuning 
of the controllers because W((i;) depends on both of the 
controllers c,(a’) and C 2 (-y)- possible to make the criterion 
less tight (only sufficient) but comprising of two independent 
parts, each of which depends only on one of the controllers. 

Theorem 1. The Nyquist stability lest (2) can be transformed 
into three simultaneous necessary and sufficient conditions. 
Closed loop stability is guaranteed if the phase of IV(iui), 
(p(W), is bounded by. 

0 < 0(W(itu)) < 2jr. for (.1) 

the magnitude of W(/ra) is bounded by: 

|W(iuj)|<l, for aj>ai 2 . (4) 

and the critical frequencies are such that: (o* ui*. (5) 
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Proof. Condition (3) eliminates the possible intersection of 
the arc of the Nyquist plot with the positive real axis for 
while condition (4) ensures no encirclement of 
the (1,0) point for u> > taj. Thus, condition (5) implies that 
there is no encirclement of the (1,0) point for any frequency. 
Note, that if we neglect all phase information ((at = 0), the 
new theorem reverts to a statement of the Small Gain 
Theorem. O 

Because the total phase of the product of complex 
variables is the sum of their phases, (3) can be estimated as a 
consequence of the following two inequalities: 

0 < ^(ic(if4>)) + + 0(/i2(iw)) < 2jr, for 

(6) 

or 

-0(jr(w)) < 0(/i,(iw)) 0(/i2(ia))) < 2jr - 0(jr(w)), 

for (jj ^ tOp (7) 

The condition (7) can be transformed into two independent 
conditions, one for each loop, by introducing a parameter, 
f,(0< €y < 1), which defines the trade-ofif of performance in 
favor of the first loop. Then, the condition can be rewritten: 

--e,0(ic(ai))< 0(/i,(i(y))< e,(2jr- 0(ic-(aj)))| 

-(1 - Ey)(t>{K{a))) < 0(/i2(ift>)) < (1 - ei)(2jr - (#)(ji:(aj))) J 

for u) ^ 0 )*. (8) 

Similarly, inequality (4) follows from the following 
condition: 


|/?,(w)| < |x'(m)( ^ ^ 


(9) 


Each of the inequalities (8) and (9) contain only one of the 
controllers, therefore they form a suflicient stability test for 
independent tuning of decentralized controllers The 
proposed criterion is less conservative than the RIM test, 
which is equivalent to condition (9) alone when e, =0.5 and 
ai7==0. It 15 recommended that the value of be initially 
selected as c, = 0.5, indicating no preference to a particular 
loop performance. Increasing its value will bias the closed 
loop performance in favor qf the first loop 


3. Example 

Consider the following 2x2 system, where the time 
constants are given m minutes: 


P(s)- 


r 

(aTVY) 

-4e 

_(2av + \ ) 


_2.5e _- 

(\ 5 s +"l)(iN^~l) 

_I 

(35 + 1) 


( 10 ) 



FiG- 4. The Nyquist plot for the example problem with 
A, = 0.1 min. 

In this case, diagonal pairing with integral action cannot be 
guaranteed stable by the RIM test because |jf(0)|>l. The 
new stability criterion on the other hand, shown graphically 
in Fig. 3, indicates that closed loop stability can be guaranteed 
even with arbitrarily high gains. A value of f:, = 0.5 was 
selected, indicating no particular preference between the two 
loops. PI diagonal controller parameters were selected 
according to IMC tuning rules (Rivera et ai, 1986) with the 
gain chosen to be arbitrarily high (the IMC filter values used 
where A, = A2-0 1). As shown in the figure, w* = 
0.13 rad min ' and w* = 0.06 rad min ', thus satisfying 
equation (5) and therefore guaranteeing stability. In this case 
the complementary sensitivity functions of the loops 
coincide: 


/i,(.v) = 


\_ 

(A,s + 1)’ 


1-1.2 


( 11 ) 


The Nyquist plot for the diagonally paired system as tuned 
using the new method, is presented as the contour for W(ia>) 
m Fig. 4, showing that the (1,0) point is not encircled (as 
predicted). 

In fact, it is difiicult to u.se the norm-based RIM lest 
successfully even for the off-diagonal pairing Shown m Fig 
5, are magnitude and phase Bode plots for the new stability 
test applied to off-diagonally tuned PID controllers, with the 
controller gains for the two loops arbitrarily set to , =0.7 
and K^ 2 ~ "0.2, for which it can be shown that each SISO 
loop IS stable Again, the PID parameters arc IMC-based 
(Loop I; reset = 0.086 mm ', latc = 1.1)8 min, Loop 2. 
reset ~ 0 044 min rate = 2 22 mm. The controller gams are 
tunable). The resulting analysis clearly shows that on the 
basis of the KIM lest alone (Fig. .5(u)), nothing can be said 
about the closed loop stability in the frequency range 
0.05 < (ij <'0.35 rad mm ', since here, the sufficient stability 
condition is violated. On the other hand, the new approach 
shows that w* ' 1 rad min ', while uj* ~ 0.35 rad min ', 
indicating that the system is stable. I'his result can be 
confirmed by simulation. 



Fig. 3. The new .stability test performed on the eT^mple problem with t, =0.5. This analysis is for diagonal 
pairing. Shown are: (a) magnitude, with: - 1 /Vi*r| l/tj and 162 ! (coincident in this ca.se), (b) phase (in 
degrees): with: — upper and lower bounds (From equation (8)); <p(hy) and (pih^), again coincident. 
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Fig. 5. The new stability test performed on the example problem with This analysis is for 

off-diagonal pairing. Shown are: (a) magnitude, with: ~1/V|jc| |/i,l - (b) phase (in degrees) Bode 

plots The upper and lower bounds on the phase plot are those from equation (8). 


for ORICINOL p«trln«. FID pArAMtars; 
LOOF<l>: NC = B.BBM, RE = .2900, m = .0000 



KmbuI (• Por RCUERSE p«tpln«. PID p*T'*nPtap^ 
LfX)P(I>; KC = .7000. RE - .0061, RA = 1.0790 
LOOPC2> : KC c - .2000. RE = -0444, RA = 2.2220 



KmbuI (• Por RCUERSE p«tpln«. PID p*T'*nPtap^ 
LfX)P(I>; KC = .7000. RE - .0061, RA = 1.0790 
LOOPC2> : KC c - .2000. RE = -0444, RA = 2.2220 


Time (min) 


Fig. 6. Comparing the closed loop response for diagonal (left) and ofl-diagonal pairing (right) Both control 
schemes were tuned using the new stability criterion with e, - 0.5. 


Figure 6 compares the closed loop responses achieved 
using the diagonal pairing (1-1,2-2) with the ofT-diagonal 
(1-2,2-1) pairing, both tuned as described above The 
performance of the off-diagonal scheme is severely limited by 
the significant process delay times, while the diagonal .scheme 
can respond almost instantaneously to set point changes. 


4. Conclusions 

This paper has outlined a proposed sufficient and necessary 
stability condition which can be iKsed to design dcccnt^all^ed 
control for a 2x2 MIMO system. The method has been 
shown to be less conservative than alternative norm-based 
methods, because unlike them, it conserves phase informa¬ 
tion. The sufficient and necessary condition derived has been 
modified to provide a .sufficiency condition, still utilizing 


phase information, which allows the independent tuning of 
the parameters of the two controllers A irade-ofl parameter 
must be set, which delineates the relative importance of the 
performance of one loop over the other. 
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Comments on ‘On Absolute Stability and the 
Aizerman Conjecture’* 

XIONG KAIQlt 

Key Word§ Absolute stability; Aizerman conjecture; control systems; Lyapunov function. 


Abstract—This paper discusses the absolute stability of 
control system. We point out that Theorem 2 in the paper by 
Grujic is not correct. 


Explanation and correction 

In theorem 2 of the paper (Grujic, 1981) the author 
obtained a necessary and sufficient condition for system ( 1 ) 
to be absolutely stable. This short paper will show that this 
result is invalid, For this purpose, we are going to give a 
counterexample. 

Consider Example 1 of the paper (Grujic, 1981) 


djr^/-IL 

dt V -1 


0 

-10 




I-Jt. 



Ox 


e| 0 . 2 |, 


02 


[ 0 . 21 . 




For Jt, = < 0, X 2 = - UX) < 0, we have 

D(*,)i/(jc) = 23.66,02 - 0.86202 

+ 4626,(-JC,) -L 62 (-x,) 

= 33.6X200- lOx 100 + 0.8x200- 1 = 5879>0. 

I’his means that by the set f'n(L) of (7) we cannot ascertain if 
system ( 1 ) is absolutely stable in Tlieorem 2 
A mistake of Theorem 2 results from an error in (A. 6 ) 
(Grujic, 1981). In fact, (A. 6 ) must be corrected to the 
following 

0;,y(x) = b^{I^ + TNi\x\)D^\,\xl 

But, we can correct Theorem 2 to the following form 
Theorem 1*. If C is the identity matrix and L is a compact set 
then for F^^{L) 


and take that Lyapunov function as follows 

u(j;/) = 2|jt,| + |.r2l+2o[' ''^-^dr,+ 10[' — ^tr., 

J() T, X 2 

In particular, let 






02 (jt) = 


0, Jt. » 0 

2x2, 


Then, v = 42 ljt,| + IX 2 I, (x) = 426,(- llx, + 0.40, + 

0 . 802 ) + ~ 0 . 802 ), where 

«5,= 6 ,(x„i-) 

( L X, >0 or X, = 0 and x, > 0 , 

0 , X, = 0 and x^ = 0 (i = 1 . 2 ) 

-1, X, < 0 or X, = 0 and x, < 0. 


where 


A^*(x) = diag 


0 l( 2 ^) 02 ( 2 ^) 

a, sgnx, OiSgnxp 




To be a Lyapunov functional family of the system (1) on 
JV(f(L) it is necessary and sufficient that 

(i) the vector [(/„ + TT)h] be positive for every L c L, and 

(ii) the vector \RiL, T)b] be negative for every L 6 L. 

As Ls similar to the proof of Theorem 2 in Grujic’s paper, 
we easily prove that Theorem 2* holds. 
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Reply to ‘‘Comments on ‘On Absolute Stability 
and the Aizerman Conjecture’ ”* 

LJUBOMIR T. GRUJiCt 


Key Words —The Lyapunov method; ihe Lyapunov functions; Lurie systems; absolute stability; 
Aizerman conjecture; stability domains. 


Abstract —The note presents the refinements of some results 
of Gruji^ (1981) and of comments by Kaiqi (1992). 


Comments by Kaiqi (1992) on Grujic (1981) are correct 
under the following rcfmmcnls. 

Kaiqi (1992) contributes by introducing a new tentative 
Lyapunov functional family that should he denoted by F*y(L) 
rather than by where 

= = t zj A^'(v)dv)./e/V„(Z)]. 

With N*(ji) defined by 

N*(x) ^ diag {nt(j:,), 


n;(X,) = 


0, jr, - 0, 


1 = 1 , 2 . . 


, m. 


Theorem 2* by Kaiqi (1992) is a new development of 
Theorem 2 by Grujif (1981). The latter is correct provided 
Nf,(L) in it, as well as in its proof. Corollaries 1 and 2 and 
Examples 1 and 2, is everywhere replaced by N,y(L), where 
/V^(L) is the family of all odd / e N„(Z.). 
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Relay Auto-tuning in the Presence of Static 
Load Disturbance* 


c. C. HANG.t K. J. ASTROMi and W. K. HOt 


Key Words Disturbance rejection; limit cycles; PID control; process control; relay control. 


Abstract —Static load disturbances during the relay tuning 
experiment introduce errors in the estimates of the ultimate 
gain and ultimate period. This paper shows how an 
automatic bias can be used to overcome the problem. 

1. Introduction 

Auto TUNING of PID controllers has recently received much 
attention in the literature because of its potential applications 
in reducing system start-up time, and in lightening process 
control through regular rc-luning (Drislnl, 1977; Kraus and 
Myron, 1984; Iligham, 1984; Hess ct ai, 1987; Radke 
and Iserrnann, 1987; Astrbm and Hiigglund, 1988a; Hang 
ct al., 1991). Many commerical products for auto-tuning 
have appeared in the market since the mid 198()s.^ 

The relay feedback aulo-tuncr (Aslrdm, 1982; Astrdm and 
Hiigglund, l9K4a, b, 1988a) is one such product. It is based 
on the automatic measurement of the ultimate gain and 
ultimate’ period (the point of the Nyquisi curve that first 
intersects the negative real axis) from which the PID and PI 
controller paramelers arc computed. Its greatest merit is that 
a prion information about the process time scale and 
dynamic structure is not needed. Furthermore, it can 
automatically produce an excitation signal with frequencies 
around the ultimate frequency ol the process. Therefore it is 
also suitable for initializing other more sophisticated 
auto-tuning or adaptive control algorithms (Lundh and 
Astrdm, 1992). 

An enhancement of the relay feedback auto-tuner is 
introduced m this paper. It attempts to overcome the 
problem of static load disturbance during the relay 
experiment which results m inaccurate estimates of the 
ultimate gam and ultimate period. 


amplitude and the static load are 0.1 and 0.08, respectively. 
The static load and the relay amplitude are almost equal and 
the estimated ultimate gain and ultimate period are in error 
by -♦-14% and -\-2\%, respectively. 

2.2. Error detection and xelf-correction. Because of the 
periodic nature of the signals, the static load disturbance can 
be determined easily by considering the dc components of 
the process input and output. The dc component of the 
input, to the process is given by 






d f I, 


( 1 ) 


where d and I are the scl-poinl, process static gain, 

relay amplitude and static load dislurbancc, respectively; t^ 
and tj arc the positive and negative relay output intervals, 
respectively. The dc component of the process output, is 
given by 




e dt, 


( 2 ) 


where t is chosen such that the integration is performed over 
one period of the steady-slate oscillation. Since the dc gain of 
the process is hence 

and 


kJ}' + ’ d + /) - y, - -- • f 

'’V*p /,+fj / + 

Rearranging (3) gives 


e df. 




2. Static toad disturbance 

Static load disturbances arc common in process control. 
Their effects during Ihe relay exf>crimcnt can be illustrated 
using the following process 


and the step-up shown in Fig. 1. 

2.1- Errors in the estimates of ultimate f>ain and ultimate 
period. When the relay is connected in the feedback loop, 
for most common processes, oscillation with symmetrical 
ptisilivc and negative half-cycles would occur as shown in the 
first 10 sec m Fig. 2. When a static load disturbance of 
magnitude less than the amplitude of the relay occurs, the 
oscillation become asymmetrical (Hang and Astrdm, 1988). 
This is shown in the second 10 see in Fig. 2 where the relay 
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To restore symmetry, the effect of the static load must be 
cancelled. Therefore a bias, equals to the negative of the 
load should be added to the relay output. Thus 


Ji + r, *„((, +»j)J. 


( 5 ) 


where k is replaced by its estimate k^. 

This formula can be used fur retuning when the process 
gain is already known. For initial tuning, a guess of has to 
be made. In process control, a rule of thumb would be to set 
k = 1 as the process static gain is u.sually between 0.5 and 2. 
Il the bias calculated using (5) would not cancel the 

static load completely and the oscillation could still be 
asymmetrical. However, at this stage, the asymmetry could 
be slight and hence tolerable; if not, a second bias can be 
applied. With the bias u^,, (3) is modified to 




(ft) 


where t[ and from the new positive and negative relay 
output intervals, respectively. It now follows from (4), (5) 
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Fig. 2. Static load disturbance and corrective bias with 
^p = 0.5. 


and (6) that the process static gain can be obtained from 

+ ^2) I c “ (^1 “*■ ^2) I tr d/ 

L -- L -iL- (7) 

(r, -f t2)('i “ ' 2 )^ ^ (^1 ^ 2 ) j ^ cl/ 

With /tp known, / can be calculated from (4) and a second 
bias, Ub can be added to to completely cancel the static 
load. Hence 


“b + Wb= 

and 

Furthermore, once and / are known, a low order transfer 
function can be estimated from the wave-form of the 
oscillation (Astrbm and Hagglund, 1988b). 

The new feature of automatically biasing the relay once 
the asymmetry is detected can be easily incorporated into the 
existing relay auto-tuner. Its effectiveness is demonstrated in 
Fig. 2. At t = 22 sec, a bias was applied based on a initial 
guess of /[p = 0.5 (/Cp = 1) which did not result in symmetrical 
oscillation. However, notice that the asymmetry is slight and 
the amplitude and period of the oscillation are not too far 
from those obtained when the oscillation is symmetrical. The 
estimated ultimate gain and ultimate period arc out by only 
+2% and -2%, respectively. In practice, this could be an 
acceptable error and the relay experiment stopped at this 
point. Figure 3 shows for '^he same system, what could 
happen given a poor initial guess of k^. At f = 22scc a bia.s 
was applied based on the initial guess of Ap = 0.25. It is 
obvious from the figure that the bias was not effective. At 
r = 35 sec, kp was estimated using (7) and a second bias was 
applied. Symmetry in the oscillation was subsequently 
restored. 

The problem of asymmetrical oscillations also occurs for a 
relay with hysteresis. In this case, all the equations are still 
valid and the bias computed can likewise be added to the 
relay to restore symmetry and hence accuracy in the 
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Fkj. 3. Static load disturbance and correclivc bias with 
/c, =0.25. 


estimated parameters. The effects of a sinusoidal load 
disturbance ^during the relay experiment was discussed in 
Hang and Astrom (1988). A simple solution has not been 
found for this problem. 

3. Conclusions 

Auto-tuning of PID controllers using relay feedback is 
known to be a robust technique which requires little prioi 
knowledge of the process compared to other aulo-tuning 
approaches. To further enhance the existing relay feedback 
auto-tuner, automatic biasing of the relay to overcome static 
load di.sturbancc is recommended 
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Vector Lyapunov Functions and Stability 
Analysis of Nonlinear Systems* 
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sleenweg Noord 2, B-9f)52 Gent-Zwijnaardc, Belgium 

SiNCt THE BEGINNING of the last ccnlury mathematicians have 
sought and with considerable success been able to define 
physical systems and phenomena using differential equations 
for exploring the underlying harmony in an apparently 
diverse nature. Lyapunov’s gift to the scientific community, 
of a tool which is now widely accepted and fruitfully used for 
the analysis of differential equations, most of which are 
nonlinear in nature, gave the much required thrust for the 
development of modern stability theory. While Lyapunov’s 
first method investigates the local stability properties by 
linearization around a working point, his name is commonly 
associated with the second, which involves use of continuous 
functions of state variables. Besides Lyapunov’s second 
method is particularly useful as it does not demand the 
knowledge of solutions of differential equations but instead 
measures the domain in terms of distance which is the 
Euclidean length of the state vectoi and thus arc powerlul 
tools for practical applications. 

In this book the authors have sought to collect and present 
the main trends in the basic theory of ihe method of Vector 
Lyapunov functions. The authors begin with the generaliza¬ 
tion of Lyapunov functions by relaxing the conditions of 
positive definiteness and decresccntncss. Comparison prin¬ 
ciples arc used in order to bring together the concept of 
Lyapunov functions and the theory of differential ine¬ 
qualities, which in turn enables the transformation of a given 
complicated differential system to the study of relatively 
simpler scalar differential equations, thus facilitating the 
study of stability and other properties of solutions. A brief 
description of various forms of stability and the ensuing 
boundednc.ss is given in a very succinct manner and is 
followed by examples to drive home their relevance. The 
otherwise theoretical and mathematical treatment of the 
subject is given a practical touch when the authors set out to 
define the terms of practically stable .systems giving example 
of a missile which may oscillau around a mathematically 
unstable course even though its performance is acceptable. 

Having set a framework for simplifying the complex 
nonlinear system the authors then refine the basic Lyapunov 
theorems using stability results employing two Lyapunov 
functions thus relaxing the conditions further. Boundedness 
and practical stability are further proved using two Lyapunov 
functions, which set the framework for using several 
Lyapunov functions with a view to generalize the conditions 
further. I’his framework is later used to analyse results of 
global stability due to the fact that the use of several 
Lyapunov functions basically involve.s dividing the vicinity of 
some convenient set into suitable subsets. The authors then 
drive home the practical relevance of Vector Lyapunov 
functions by analysing the large scale systems in context of 
perturbation theory. Perturbed system analysis background is 

• Vector Lyapunov Functions aniJ Stability Analysis of 
Nonlinear Systems, Mathematics and its Applications, Vol. 
63 by V. Lakshmikantham, V. M. Matrosov and S 
Sivasundaram. Kluwer Academic Publications (1991). 


essential to the study of large scale systems since large scale 
systems constitute of subsystems some of which may be 
disconnected and reconnected during functioning, thus 
destroying stability and causing the system to fail. Ihe 
stability and boundedness properties of perturbed systems 
are studied using coupled comparison systems. Also the 
authors show that given good qiialilalivc properties there 
exists a Lyapunov function for each subsystem and a 
combination of these Lyapunov functions which constitute 
the Vector Lyapunov function may then be used lo study the 
whole large composite system. The theory involving 
perturbed and unperturbed systems is woven Ingclher using a 
new comparison theorem. The rcstrielioii of quusimonotonc 
property is taken care ol by introducing the concept ol 
quasisolulions, which leads to isolated subsystems thus 
simplifying analysis and is of immense practical importance. 

Having formulated the theory of Vector Lyapunov 
functions the authors have devoted a considerable part of the 
book to develop and extend the method of Vectoi Lyapunov 
functions to a variety of nonlinear systems. In parliculai the 
examples on control systems, where a convenient and unified 
method of specifying control sets m order lo keep the system 
within a desired practical stability behaviour of the controlled 
motion, is developed and is relevant to the control 
community. Also the example on decentralized control 
system has been treated in the context ol perturbation 
theory. Besides the various examples that arc evenly spread 
over the book, the aulhors have devoted a chapter to 
analy.sing various models of practical relevance. The practical 
fruitfulness of the method is discussed in wide langing 
applications with examples Irom the economic models based 
on the Walrasian approach ol supply and demand, the 
motion of aircraft, immunology, chemical kinetics and neural 
networks. Among the various models discussed in the 
framework of Vector Lyapunov function arc the neural 
networks and motion of aircraft which are of particular 
relevance to control theoreticians. 

It would he appropriate lo add that besides stability 
analysis the method can be exploited much further by 
control, adaptive control engineers and thcorclieians 
Besides global stability analysis a natural application ol the 
theory of Vector Lyapunov function would be lo analyse the 
convergence properties of the sy.stem parameters, that is, the 
extent to which a solution of nonlinear dynamical system 
may be required lo converge to ensure stability and 
acceptable performance of the adaptive control algorithm. 
There is little doubt that Lyapunov functions arc becoming 
increasingly popular in the adaptive control community. Also 
the fact that the theory of Vector Lyapunov function merges 
so well with neural networks models is an interesting and 
welcome feature. Since neural networks arc being widely 
used for modelling and identification of input-output space.s 
of dynamical systems and static identifiLation which include 
pattern recognition, makes this framework relevant lo wide 
ranging needs. The book though appealing to a wide 
ranging audience, is written in a mathematical style. While 
leaving little doubt about the practical and theoretical 
implications of Lyapunov’s second method, a corresponding 
geometrical or graphical interpretation of theorems would 
make the reading lucid and within easy grasp of those who 
are not mathematically inclined. 
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The book brings together wide ranging results of 
considerable relevance. It is particularly useful to theoretici> 
ans and theoretically inclined engineers who are looking for 
tools to analysing the physical systems in a lictter and unified 
manner. To sum up the book is a research monograph for 
those with relevant background in nonlinear systems theory. 
Nevertheless the book presents the results in a brief, to the 
point manner and is perhaps a fitting tribute to Lyapunov in 
a year which marks the century after publication of the 
original pioneering work of Lyapunov. 

About the reviewer 

P. C Ojha completed his graduation in engineering in 


1988 specializing in electronics and communications from 
Aligarh University, India where he was an active member in 
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hardware computer maintenance engineer with a computer 
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the Control Systems Centre of University of Manchester 
Institute of Science and Technology specializing in Control 
and Adaptive control. He completed his masters in 
engineering in 1991. At present he is associated with the 
Automatic Control Laboratory, University of Gent, and is 
involved with an industrial project. His research interesLs 
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Control Sensors and Actuators* 

Clarence W. de Silva 


Reviewer: M, MACHACEK 

APAX Computers Ltd., 10 Chartfield Road, Reigate, 
Surrey, RH2 7JZ, U.K. 

Thi: stated author's intention was to summari.se the material 
used in undergraduate and piostgraduatc courses in control 
system instrumentation. The book is aimed at both .students 
and practising instrumentation engineers. Enough material 
was to be assembled in the book from which selections could 
be made for particular courses. 

The sensors and actuators dealt with are generally those 
used in piece-part manufacturing rather than handling of 
fluids (proce.ss control). Even with this restricted scope, the 
book provides a wealth of information on over 4(X) pages of 
text and figure.s. Perhaps the title of the book may be a bit 
misleading. Readers might expect it to cover a wider range of 
mca.surcments, e g. temperature, level, flowrate, etc. which 
it does not. 

Sen.sors and actuators used in robots, machine tools and 
other devices subject to motion and force control arc 
considered in great detail. Transducers and signal proccs.sors 
are included where necessary. Both analog and digital 
sensors are de.scribcd. 

The sensors and actuators are always presented as integral 
parts of control systems, the system inlegraiion is an issue in 
each of the chapters on individual clement types. 

In Chapter 1, an introduction is given to control system 
instrumentation. Analog and digital control systems arc 
discussed and the advantages of the digital systems are 
clearly stated. An assumption is made of the reader’s good 
grasp of control theory. However, the knowledge of the 
theory is not essential for understanding the bulk of the 
book—only tho.se parts where integration into control 
systems is discussed. 

An overview of sensor and transducer models is given in 
Chapter 2. A very good general approach to the flow and effort 
variables, input/output rating and interface matching is 
presented and model schemes are .set out to which references 
are made throughout the text. Performance criteria, both in 
the time and frequency domains are listed. Most of the 
widely used methods for estimating errors and uncertainties 
are given in detail. Perhaps .some of the flow and effort 
variables could be explained in more detail in order to 
illustrate better the principles of analogy between mechanical 
and electrical variables. 

Chapter 3 deals with analog sensors of motion. Most 
widely used sensors of displacement, velocity and accelera¬ 
tion are described. Jerk is noted as a variable to be measured 
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but no further details are given of how it may be or why it 
.should be measured. The sensors described in detail are 
those based on variable resistance, reluctance, mutual 
inductance and capacitance, on permanent magnet.s, eddy 
currents and piezoelectricity. Others, noted but not 
described in detail, are fiber optic sensors, including an 
optical gyroscope, Doppler interferometers, ultrasonic 
.sensors and mechanical gyroscopes. Proximity sensors are 
included even though they do not actually provide 
quantitative ineasuremenls 

Force and torque analog sensors are dealt with in ('hapler 
4. Potential conceptual difricullies m determining causality in 
force control systems are well noted even though a more 
detailed explanation with additional examples could have 
been helpful. Good description of strain gauges is given, 
both resistive and semiconducting, including their transdu¬ 
cers. Deflection methods of measuring torque using motion 
sensors is described and analysed. Tactile sensors are noted 
as a new technology, with signal processing identified as the 
main topic of on-gomg research. 

Motion sensing using digital transducers is described in 
Chapter 5. Incremental and absolute shaft encoders based on 
optical, electrical, magnetic or proximity sensing arc handled 
in great detail, including their errors. Digital resolvers and 
tachometers arc described. Most of the chapter deals with 
rotary sensors, linear motion sensors and limit switches are 
noted. 

Permanent magnet and variable reluctance stepper motors 
are the subject of Chapter 6. Stepping sequences are 
explained perhaps in too much detail for various geometries 
of the motor construction—a single general method would 
have been more in tune with the style of the book. Static and 
dynamic behaviour analysis and motor models arc good 
enough for understanding of open and feedback loop 
controls. Damping requirements are analysed, mechanical 
and electrical damping principles arc clearly explained. 
Good, practical guidance is given for selecting a motor for a 
given application. 

Chapter 7 could easily have been split into .several shorter 
.sub-chapters for better clarity. As it is, this chapter deals 
with the whole range of continuous drives, including 
electrical DC and AC motors as well as hydraulic valves, 
motors and pumps. The DC motors, with wired armature 
and bru.shlcss, are handled in greatest detail. Models are 
given for various field winding connections, issues for open 
and closed loop control are well explained. Usage of SCRs 
arc well illustrated even though the SCR commutation would 
have perhaps deserved some more detail. I'orquc motors, 
with permanent magnets and geared, arc noted. AC 
induction motors with wound and cage rotors are discussed 
together with their models and ways of controlling the 
specd/tnrque. Of the control methods, the field feedback 
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method could have been explained a little better. AC 
synchronous motors are handled very briefly. Somewhat 
surprisingly, linear motors and solenoids are included under 
the heading of AC synchronous motors. Pressure and flow 
control of hydraulic actuators is handled thoroughly, 
particulars of the most common elements are noted. 
Pneumatic controls are noted but not handled in any detail. 
Guidance for actuator selection is given but only for DC 
motors—others could have been included for consistency. 

The text is full of well-chosen exampic.s that serve to 
illustrate the theory. There are adequate illustrations to 
make the text easy to follow and comprehend; m only very 
few cases one needs to turn a page in order to relate the 
illustration and its reference in the text. There arc some 
typographical errors of minor nature. The reviewer hud no 
time to examine the numerous equations but some mi.splaccd 
brackets in the section on statistics were easy to spot. 

The great value of the book is in the wealth of problems 
given at the end of each chapter. The problems generally 
require a great deal of creative thinking as well as good 
understanding of the engineering issues. The problems would 
be of very good value under a tutor’s guidance. 

References arc given at the end of each chapter. 

The author has managed lo assemble a great deal ol 
material on a wide range of subjects and yet managed lo go 
into suflicicnl level o! detail on all the issues covered. I 


believe that the aim set out for the book, i.e. lo be a 
collection of material from which individual courses can be 
assembled, has been well achieved. 

Perhaps in some parts the author’s expectation of 
theoretical knowledge of an average practising engineer is a 
bit high but understanding of all detail is not essential to get 
benefits from the book. The bulk of the book could be 
followed by those with a good knowledge of algebra and 
differential calculus. 

A bout the reviewer 

Milos Machacck received his Dipl. Ing. degree in electrical 
engineering in 1%6 from the Czech Technical University of 
Prague. He worked for four years as a tutor and lecturer at 
the same university, in the department of electrical 
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• Foxboro G.B. Ltd (process control instrumentation and 
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T’hl book Adaptive Filler Theory gives a very thorough and 
good treatment of a large and growing area. Adaptive filters 
are defined as filters where the eoefticients of the filters are 
changing with changing slalislies of the signals lo he filtered 
By using adaptive filters there can be significant improve¬ 
ments compared with conventional fillers with fixed 
characteristics. Adaptive fillers have successfully been 
applied in, for instance, communications, control, radars, 
sonars, seismology, and biomedical engineering. 

The author has a llirce-fold aim ol the book; 

(1) develop the theory ol diricrent classes ol linear liniie 
impulse response (FIR) adaptive Alters, 

(2) to illustrate the theory with examples and computer 
experiments, 

(3) introduce some ideas ol nonlinear adaptive filtering. 

We will look at the different goals and sec how they are 
accomplished. 

Before treating adaptation mechanisms the design of linear 
optimal filters is discussed. This gives the reader a good 
background and makes the hook sclt-contained. When using 
the book as a text book this can be a disadvantage, since the 
basic material may be covered in a different course based on 
a different text book. On the other hand it gives the 
background and the notations that are needed in the 
adaptive part of the book. 

The first goal is the main part of the book (even if it does 
not start until p. 273). The linear FIR adaptive fillers are 
divided into: gradient algorithms, recursive least-squares 
(RLS) methods, and fast RLS algorithms. 

The gradient algorithms cover the well-known least mean 
square (LMS) algorithm, its advantages and disadvantages 

* Adaptive Filter Theory, 2nd ed. by Simon Haykin. 
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are discussed and illustrated. The computational burden is 
low when using LMS and olher algorithms based on 
stochastic gradients. The drawback is the slow rate of 
convergence. The next logical step is then lo trade 
convergence rate against increased amount of compulations. 
Different ways to organize the computations lor RLS 
algorithms are discussed, e g, QR-decomposilion and Givens 
rotation fhe main part of the computiitions m the RLS 
algorithms is the updating of a matrix. This leads to a 
computational complexity that increases with M~ where M is 
the order of the filler. The LMS algorithm increases with M. 
T'he last algorithms have the property that the computational 
complexity also increases with M The fast algorithms arc 
more complex than the LMS algorithms, but have the same 
convergence properties as the RLS algorithms. 

The material on design of linear FIR adaptive fillers covers 
about 4lK) pages. The treatment is logical and thorough and 
leads the reader carefully towards more and more complex 
algorithms. The advantages and disadvantages of the 
different algorithms are discussed in great detail. The author 
also gives many hints and interpretations of the different 
algorithms. In my opinion the first goal of the aim is very 
well achieved. 

The properties of the different algorithms are illustrated by 
computer simulations. All the details of the simulations are 
not always given. This makes it difficult for the reader to 
recreate the simulations and lest the influence of different 
parameter choices. The examples are, however, well cho.sen 
and give good insight into the properties of the different 
algorithms. In my opinion there could have been more 
simulations of the different aspects of ihe algorithms. This 
could have been done at the expense of the description of 
some of the algorithms. 

Nonlinear filtering and especially nonlinear adaptive 
filtering is a difficult area. This is treated in one of the last 
chapters ol the book. The problem of blind deconvolutions is 
discussed. Lhe problem is to reconstruct an unknown signal 
after it has been filtered through an unknown system. The 
problem of blind deconvolution is important and have many 
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practical implications. To solve the problem it is necessary to 
assume that the desired signal is non-Gaussian and use the 
higher order statistics of the measured signal. This leads to 
nonlinear estimation. This part of the book is far outside the 
scope of the rest of the book and could have been omitted. 

Numerical properties such as finite precision effects and 
numerical stability arc discussed and illustrated in the end of 
the book. These are important problems that often are 
neglected in text book.s. 

Each chapter ends with a selection of problems. Working 
through the problems gives insight into the properties of the 
different algorithms. The problems often lead to new 
versions of the algorithms and help the student to get a 
better understanding of the underlying theory. According to 
the preface there is a solutions manual available through the 
publisher. The solutions manual pre.sents detailed solutions 
to all the problems in the book. Most chapters also have 
computer-oriented problems. These problems involve simu¬ 
lations of different algorithms and investigation of different 
choices of parameters. The use of computer simulations is 
important and will encourage the students to use their 
creativity and will increase the understanding of the 


methods. 

The book has an extensive bibliography and the references 
are up-to-date. The author also has good descriptions of the 
historical development in the area of adaptive filtering. 

In summary the book fulfills its goals with respect to the 
author’s three aims. The book can be used as a text book in a 
graduate course in adaptive signal processing. It can also be 
used as a reference book for anyone who wants to find 
details about a number of algorithms for linear FIR adaptive 
filters. The book is strongly recommended for its wide 
coverage and for its depth. 

About the reviewer 
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M.Sc. degree in Electrical Engineering in 1%6 and Ph.D. 
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I'Hfc AIM OF THIS b(X)k is “to give a systematic presentation of 
methods oriented towards the computer aided analysis and 
design of linear control systems”. Being the first book 
devoted exclusively to this topic, it fills an important gap in 
the existing control literature. The book deserves the 
important role of being a well documented guided 
introduction in the numerical aspects of solving .system 
analysis and synthesis problems, and therefore it will be 
certainly useful tor many control specialists unfamiliar with 
numerical techniques. Simultaneously, it represents a 
valuable reference for further research in the area of 
developing numerical algorithms for control systems. 

The book illustrates the close interaction in the lasi two 
decades between two field.s of applied mathematics: system 
theory and numerical analysis, and the emergence of a new, 
interdisciplinary field, which we can denote as the 
computational system theory. Specific demands for solving 
system analysis and design problems stimulated many 
researches in the area o( numerical linear algebra. The 
algorithms dcvelopied for solving linear matrix equations 
(e.g. Lyapunov or Sylvester), for computing matnx functions 
(for instance the matrix exponential), or for ordering 
standard or generalized Schur forms by means of orthogonal 
transformations, all have strong appeals to important control 
problems. On the other side, the development of 
computational methods for linear systems was strongly 
influenced by the progress in the field of numerical algebra 
and the availability of high-quality software for solving linear 
algebra problems. Many of the sophisticated algorithms of 

* Computational Methods for Linear Control Systems by P. 
Hr. Pelkov, N. D. Christov and M. M. Konstantinov. 
Prentice Hall International (U K.) Ltd. (1991). ISBN 
0-13-161803-2. 


linear algebra served as tools or as models for developing 
reliable algorithms for control problems. A notable example 
is the wide usage ol the reduction ol a square matnx lo the 
real Schur form as a preliminary preproce.ssing step in many 
proposed algorithms Virtually tor almost all mam 
computational problems of linear sy.sicms theory, algorithms 
labeled as “Schur” methods are available or can be derived 
(although they are nor always the best ol pos.sible 
approaches). 

The contents of the book may be outlined as follows 
Chapter 1 deserves the role of introducing the basic 
techniques of numerical matrix computations. Several 
introductory paragraphs are devoted lo basic numerical 
issues, rounding errors, conditioning of problems, numerical 
stability, performances of algorithms. The rest of the chapter 
is devoted lo the mam topics of numerical linear algebra: the 
solution of linear equations and the compulation of 
eigenvalues. Along with presenting solution techniques for 
these and related problems, algorithms for computing 
various matrix decomposition (LU, Choic.sky, OR, Hessen- 
berg, vSehur and singular value decomposition (vSVD)) are 
described. This chapter clearly demonstrates the necessary 
emphasis on using orthogonal transformations in developing 
numerically reliable algorithms. Orthogonal transformations 
produce small, easy to bound errors, when they arc applied 
to other matrices. Algorithms based exclusively on the use of 
such transformations have sometimes the highly desirable 
property of numerical stability, and therefore represent 
reliable numerical approaches for solving the respective 
computational problems. A more detailed presentation of 
the material presented in this chapter can be found in the 
book of Golub and Van Loan (1983). 

Chapter 2 introduces the main concepts and reviews 
several fundamental results (without proofs) of the 
.state-space analysis and design of continuous and discrete- 
time linear control systems. The computational problems to 
be addressed in the next chapters arc formulated here and, 
for some problems, simple solution methods arc given. The 
presented methods are, however, of purely theoretical 
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interest and arc not suitable for computer implementations. 
This aspect is illustrated several times later in the book.. 

Chapter 3 is devoted to the solution of state equations by 
UvSing methods based on matrix exponentials. After 
presenting the main results on the sensitivity of matrix 
exponential, three classes of numerical approaches for its 
evaluation are presented: power series methods, ordinary 
differential equation methods and methods based on matrix 
decompositions. Well chosen examples illustrate the merits 
and drawbacks of different approaches. The accompanying 
error analyses provide useful bounds for assessing the 
precision of results. The best general purpose method to 
compute the matrix cxpcmcniiai is (probably) a combination 
between the block-diagonalization technique (Algorithm 3.4) 
and the Padd approximation method (Algorithm 3.2). A 
word of caution concerning the presented series methods 
(Algorithms 3.1 and 3.2) is necessary here. Both methods 
use a balancing procedure (described vaguely in vSection 
1,10) intended to reduce the 1-norm of the given matrix A. 
Although no reference is cited in the book, usually the 
popular balancing method implemented in the Fortran 
subroutine BALANC form EISPAC'K is assumed to be 
used. However by using this routine, the l-norm of the given 
matrix may occasionally increase (sometimes drastically) 
preventing thus the applicability of methods due to 
unavoidable overflows. This aspect is apparently overlooked 
in several existing implementations of the Pade method and 
spectacular failures can occur even (or simple examples as 
the following one 


X = 


2 1 
0 1 
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I 

I 
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where e is a very small quantity (lor example, the machine 
relative precision). 

Chanter 4 di.scusses algorithms lor solving computational 
problems related to the analysis of linear control systems. 
Several methods are described for the analysis of .stability of 
linear systems and for studying the robustness of this 
properly in presence of parametric perturbations. I’hcn, the 
closely related problems of solving Lyapunov and Sylvester 
matrix equations arc considered. The perturbation analysis 
of the Sylvester equation precedes the presentation of the 
Barlcls-Stcwart algorithm for its .solution. The analysis of 
roundoff errors of this algorithm reveals ilN numerical 
stability. Variants of this algorithm for solving continuous 
and discrete Lyapunov equations or to compute directly the 
Cholcsky factor of the non-negative definite solution of a 
class of Lyapunov equations are also presented, A more 
efficient variant of the BarlcLs-Slewarl algorithm, known as 
the Hcssenberg-Schur method, is also presented. The 
methods for analysis of controllability and observability 
properties arc based on a numerically stable algorithm for 
computing the so-called controllability (or observability) 
staircase canonical form of a .system. A prool ol the 
numerical stability ot this algorithm is given. I he chapter 
ends with a discu.ssion of methods for system balancing and 
model reduction. 

Chapters 5 and 6 concern with the design of control 
systems by slate-space methods. Chapter presents 
computational methods for pole assignnienl of linear 
systems. 3 wo numerically stable Schur methods are 
described. Both methods arc based exclu.sivcly on the use of 
orthogonal transformations and produce, besidc.s the stale 
feedback matrix assigning the desired poles of the closed 
loop system, the Schur form of the closed-loop systems state 
matrix and the corresponding orthogonal transformation 
matrix. Both methods are suitable to design observers by 
\K)\c assignment. The observer state matrix can be directly 
determined in a Schur lorm, a usetui feature for increasing 
the efficiency of on-line computations when the observer 
operates in real-time. Robust pole assignment lechniques arc 
also considered. These methods try to a.ssign simultaneously 
with the poles also a well conditioned set of eigenvectors of 
the closed-loop systems state matrix. For discrete systems, a 
special purpose pole assignment algorithm is presented for 


designing minimum norm dead-beat controllers. Finally, 
several methods are described which place directly the 
closed-loop poles in the stable region of the complex plane, 
without however involving an explicit or exact assignment of 
these poles. In spite of their desire to present the best of 
existing methods, the authors apparently overlooked the 
stabilization algorithms proposed in Varga (19KJ) which, in 
my opinion, are more suitable to solve reliably the 
stabilization problem than the methods presented by the 
authors. 

Chapter 6 is devoted to computational niethod.s for solving 
continuous and discrete-time algebraic matrix Riccati 
equations. Both iterative (Newton’s and matrix sign 
function) methods as well as direct (Schur vectors, 
generalized Schur vectors and .sympleclic transformation) 
methods arc presented. Interesting examples are given which 
illustrate various aspects, as for example: the sensitivity of 
problems, numerical stability issues, or the accuracy of 
methods. Worth mentioning are Examples 6.7 and 6.9 
showing that generally the Schur vectors methods cannot be 
unconditionally considered as numerically stable methods. In 
comparing the relative efficiencies of Newton’s and Schur 
methods, the authors arrive to a .somewhat misleading 
conclusion that Schur methods arc generally much more 
efficient than Newton’s methods. This conclusion is based on 
a overly pessimistic operations count for (he Newton’.s 
method (2(K)n^ flops for 10 iterations) and too optirni.stic 
operations counts for the Schur method (75/i^ flops) and 
generalized Schur method flops). Actually, by using 

the authors' evaluations to compute the Schur form (about 
13/1^ flops) and the generalized Schur form (about 33/r^ 
flops), wc obtain the following figures: about \5i)n^ flops for 
ten Newton iterations, about 13()n’ flops for the Schur 
vectors method and about 320n^ flops for the generalized 
Schur vectors method. However, the Newton’s method 
u.sually converges to the limiting accuracy solution in at most 
seven-eight iterations, and therefore this method has usually 
the same efficiency as the Schur vectors methods. In passing 
we also note that all memory requirement evaluations for 
various algorithms are in cxces.s with at least In ' storage 
locations. The implementations listed in Varga and Sima 
(1993) illustrate the possibility to implement these methods 
with lower memory usage. 

C’hapter 7 presents methods for computing with sub.spaces 
(sum, intersection, image, pre-image and angles) and for 
determining various subspaccs (controllable, unobservable, 
controlled invariant and controllability subspaccs) arising in 
the geometric theory of linear systems. The algorithms which 
perform such computations arc based on reliable numerical 
lechniques to assess the rank ol matrices (SVD or OR 
decomposition with column pivoting). An interesting 
application of the presented lechniques is the computation of 
the Kalman decomposition of a linear system. Many 
examples are used to illustrate the main computational 
aspects. 

We can formulate several closing remarks. The book offers 
a comprehensive overview of existing numerically reliable 
algorithms for solving the basic systems analysis and design 
problems. I'hc selection of presented algorithms reflects 
clearly the important progress made in the Iasi two decades 
in developing numerically reliable algorithms for control 
systems and in studying the sensitivity of various systems 
analysis and design problems. The authors themselves have 
several notable contributions to this progress. The performed 
error analy.scs (also partly contributions of authors to the 
field) arc useful premises for further research. Carefully 
chosen examples help to clarify many of discussed aspects 
and contribute decisively to the understanding of merits or 
inconveniences of particulars methods. The cxercLses given at 
the end of each paiagraph highlight additional a.spects not 
di.scusscd in the book. Some topics of possible interest 
omitted from the presentation are: the computation of 
.systems zeros, the computation of frequency response, or the 
evaluation of transfer function matrices from state-space 
models. 

The algorithms presented in the book can serve in 
principle for computer implementations, although this 
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aspect, in my opinion, was not the main intention of authors 
in presenting the algorithms. The methods for analysis and 
design of linear systems are rather sophisticated and 
therefore, the development of robust software implementa¬ 
tions is a task for numerical analysis experts. Many 
algorithmic details, not mentioned in the book can further 
improve the efficiency of algorithms and the accuracy of 
results. For almost all of presented methods and for many 
other algorithms only referred to, robust implementations 
are available either in Fortran libraries or in MATLAB 
Toolboxes. 
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Many real engineering and non-engineering problems 
facing researchers of the world are highly complex and 
stochastic in nature. Modelling and controlling of most 
complex systems riiust deal with 'high' dimension, uncer¬ 
tainty, and information structure constraints. One way for 
solving these problems is to use the strategy called 
decentralized control in which system inputs are assigned to a 
given set of local controllers (stations) which observe only 
local system outputs. This approach can avoid difficulties in 
data gathering, storage requirements, computer program 
debugging and geographical separation of system com¬ 
ponents. A great number of papers and books (Jamshidi 
(1983); Siljak (1978); Singh (1981), etc.) have been published 
to indicate concepts and methodologies of decentralized 
control of complex system since the 197()s. Decentralized 
control has been successfully applied to many industrial 
systems, for example, power systems, traffic systems and 
communication systems, and has become one of the most 
important branches of modern systems theory. 

Following the book Large Scale Dynamic Systems ; Stability 
and Structure which presented some ideas about decentral¬ 
ized control, published in 1978, Professor Siljak has written 
another book Decentralized Control of Complex Systems 
which is mainly concerned with the characteristics of complex 
systems: dimensionality, uncertainty, and information struc¬ 
ture constraints, and provides a series of novel concepts and 
methods to handle the problems of decentralized control of 
interconnected subsystems. 

The book summarizes the research achievements de* 
veloped by the author and his colleagues in the decentralized 
control held in the last two decades. Throughout the book 
great care is taken to differentiate between centralized and 
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decentralized control. In this book, a great deal of altenlion 
is paid to the explanation of the substantial problems 
involved in decentralized control systems, such as intercon¬ 
nected subsystems modelling, decomposition, stability, 
optimization and robustness of decentralized controllers. In 
the first chapter of the book the important concept of 
graph-theoretic framework is presented starting from the 
view point of structural modelling for complex .systems. 
Because the decentralized control problems are essentially 
structural, the graph-theoretic framework provides a very 
good environment for the computation of system structural 
fixed modes with arbitrary feedback structure constraints, 
reachability, controllability and ob.servability. It also can be 
used to identify the minimal set of lines in directed graphs 
which are es.sential for preserving input reachability and 
structural controllability of a given sy.stem. The graph- 
theoretic algorithm is computationally attractive because one 
can use the Boolean operations instead of algebraic 
manipulation to get results needed. I think that the 
graph-theoretic framework and its algorithms which arc used 
repeatedly, almost in each chapter, can be considered as the 
cream of the book, it is an excellent part, which is superior to 
other books on the same topic. 

In the second chapter of the book, an effort is made to 
deal with the important problem of the connective stability of 
complex systems under structural perturbations. The book 
provides an Ml-matrix condition based on Vector Lyapunov 
Function for testing the connective stability of decentralized 
control .systems. Having this result, the control strategy may 
become very simple: stabilize each subsystem when 
decoupled, and then check stability of the connective 
closed-loop subsystems using M-matrix conditions. The 
condition is further modified in the successive chapters and 
frequently applied to study inherent properties of decentral¬ 
ized control systems, such as robustness, .suboptimization and 
decentralized stabilizability of interconnected subsystems. 
The entire second chapter should be considered as the 
theoretical fundamentals of the later chapters. 

The third part of the book is concerned with the 
optimization of decentralized control systems. Due to the 
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nonciasRical information structure constraints, the standard 
optimization concepts and methods cannot he extended to 
formulate decentralized control strategies. The most 
important factor to be considered is the interconnection 
between the subsystems. In this context, the book regards 
interconnections as perturbations of subsystems that arc 
controlled by decentralized LQ feedback and introduces a 
new idea of suboptimality index to measure the cost of 
robustness to structural perturbations. The book constructs 
and solves the inverse optimal problem of decentralized 
control which leads to an interesting result that the global 
optimality can be recovered from local optimal LQ control 
subsystems (even with nonlinear interconnections) if the 
performance index is modified. This is a very useful 
conclusion for the design of decentralized control laws, and 
also is an important contribution to the theory and 
application of decentralized control. 

Based on the graph-theoretic framework, interpreting the 
interconnections as perturbations of subsystems and using 
M -matrix test techniques the book extend the main results 
made in Chapters 1-3 to Chapters 4 and 5, in which 
stochastic control, dynamic feedback control, and adaptive 
control of complex systems arc analysed and discussed. In 
these parts an effort is made to derive many important 
theorems which establish the basis of arguments, such as the 
sufficient conditions for the existence of decentralized 
asymptotic observers, suhoptimal estimator, and optimal 
decentralized control laws foi a given system. I’he book 
provides several methods for designing decentralized 
observers, estimators and controllers which can be carried 
out by using a parallel processing scheme. 

In order to control a complex system decenirally, the 
prerequisite is that the system can and must be decomposed 
into .several subsystems. But how is this done'^ The book 
gives sufficient space (three chapters) to introduce three 
decomposition approaches; LHT (Low Block Triangular) 
decomposition, nested Epsilon decomposition and overlap¬ 
ping decomposition, which can be easily achieved using the 
algorithms listed in the Appendix demonstrating the inherent 
efficiency of these algorithms and their easy implementation 
in a structural language, such as PASCAL and C, 

The last part of the book is concerned with the reliability 
of control which is the basic requirement in design ol 
complex systems So far as reliability is concerned, it has 
been discussed from different view points in relevant 


disciplines and there exist various techniques to increase the 
reliability of control systems from both hardware and 
software. In the book, the emphasis is placed on the control 
structure and multiple control systems. Using the Inclusion 
Principle and overlapping decomposition made in the 
preceding chapter it gives an example to illustrate the design 
of reliable control. The contents discussed in the last chapter 
arc not detailed and may be regarded as a brief introduction 
to the reliable control of complex systems. 

Finally it is worthy to point out that at end of each chapter 
the book includes a section of notes and references, which 
not only provide readers with a large amount of relevant 
literature, but al.so introduce the evolution of some novel 
concepts and methodology obviously beneficial to those who 
wish further to pursue studies in some subjects of 
decentralized control of complex systems. Another merit of 
the book is that each chapter contains many examples of 
both analysis and design which makes the content of the 
book to be understood easily and well-readable. In my 
opinion the book is tailored to the needs of the readers who 
have acquired a degree ol proficiency in modern control 
systems theory. 
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